Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!feeds.phibee-telecom.net!newsfeed.xs4all.nl!newsfeed2.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
To: python-list@python.org
From: Mark Lawrence <breamoreboy@yahoo.co.uk>
Subject: Re: web scraping
Date: Sat, 12 Oct 2013 16:53:31 +0100
References: <28798DAD-5F0F-478B-B66F-91FD8DDF13EA@gmail.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
User-Agent: Mozilla/5.0 (Windows NT 6.1; rv:24.0) Gecko/20100101 Thunderbird/24.0.1
In-Reply-To: <28798DAD-5F0F-478B-B66F-91FD8DDF13EA@gmail.com>
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.1040.1381593226.18130.python-list@python.org>
Lines: 29
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:56746

On 12/10/2013 15:12, Ronald Routt wrote:
> 	I am new to programming and trying to figure out python.
>
> I am trying to learn which tools and tutorials I need to use along with some good beginner tutorials in scraping the the web.  The end result I am trying to come up with is scraping auto dealership sites for the following:
>
> 1.Name of dealership
> 2.  State where dealership is located
> 3.  Name of Owner, President or General Manager
> 4.  Email address of number 3 above
> 5.  Phone number of dealership
>
> Note:  Many times the Owner, President or General Manager and their email address is under a tab on the website such as "Meet our team" or "Support".  Sometimes this information is not available on the website.
>
> I sure would appreciate any help I can get to get me on the right track.  From what I have read so far, believe I have to use urllib but know nothing about how to us it..
>
> Thanks
> ronroutt@gmail.com
>

Take a look at this http://www.crummy.com/software/BeautifulSoup/

-- 
Roses are red,
Violets are blue,
Most poems rhyme,
But this one doesn't.

Mark Lawrence