Path: csiph.com!usenet.pasdenom.info!weretis.net!feeder1.news.weretis.net!feeder.erje.net!eu.feeder.erje.net!newsfeed.freenet.ag!news2.euro.net!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
MIME-Version: 1.0
Sender: grettke@gmail.com
In-Reply-To: <e180db33-272f-4a9d-bc1e-231f3c3580bf@googlegroups.com>
References: <e180db33-272f-4a9d-bc1e-231f3c3580bf@googlegroups.com>
Date: Fri, 21 Dec 2012 15:34:11 -0600
Subject: Re: Scrapy/XPath help
From: Grant Rettke <grettke@acm.org>
To: Always Learning <cbrowning@ou.edu>
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
Cc: python-list@python.org
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.1169.1356125659.29569.python-list@python.org>
Lines: 43
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:35321

You might have better luck if you share the python make, version, os,
error message, and some unit tests demonstrating what you expect.

On Fri, Dec 21, 2012 at 3:21 PM, Always Learning <cbrowning@ou.edu> wrote:
> Hello all. I'm new to Python, but have been playing around with it for a =
few weeks now, following tutorials, etc. I've spun off on my own and am try=
ing to do some basic web scraping. I've used Firebug/View XPath in Firefox =
for some help with the XPaths, however, I still am receiving errors when I =
try to run this script. If you could help, it would be greatly appreciated!
>
> from scrapy.spider import BaseSpider
> from scrapy.selector import HtmlXPathSelector
> from cbb_info.items import CbbInfoItem, Field
>
> class GameInfoSpider(BaseSpider):
>     name =3D "game_info"
>     allowed_domains =3D ["www.sbrforum.com"]
>     start_urls =3D [
>         'http://www.sbrforum.com/betting-odds/ncaa-basketball/',
>         ]
>
>     def parse(self, response):
>         hxs =3D HtmlXPathSelector(response)
>         toplevels =3D hxs.select("//div[@class=3D'eventLine-value']")
>         items =3D []
>         for toplevels in toplevels:
>             item =3D CbbInfoItem()
>             item ["teams"] =3D toplevels.select("/span[@class=3D'team-nam=
e'/text()").extract()
>             item ["lines"] =3D toplevels.select("/div[@rel=3D'19']").extr=
act()
>             item.append(item)
>         return items
> --
> http://mail.python.org/mailman/listinfo/python-list



--=20
Grant Rettke | ACM, AMA, COG, IEEE
grettke@acm.org | http://www.wisdomandwonder.com/
Wisdom begins in wonder.
((=CE=BB (x) (x x)) (=CE=BB (x) (x x)))