Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #49644

Re: HTML Parser

From Neil Cerutti <neilc@norwich.edu>
Newsgroups comp.lang.python
Subject Re: HTML Parser
Date 2013-07-02 17:57 +0000
Organization Norwich University
Message-ID <b3gik2Fa73jU1@mid.individual.net> (permalink)
References <b4aba93c-f832-4af4-8c48-02b1f8f6b1cd@googlegroups.com>

Show all headers | View raw


On 2013-07-02, subhabangalore@gmail.com <subhabangalore@gmail.com> wrote:
> Dear Group,
>
> I was looking for a good tutorial for a "HTML Parser". My
> intention was to extract tables from web pages or information
> from tables in web pages. 
>
> I tried to make a search, I got HTMLParser, BeautifulSoup, etc.
> HTMLParser works fine for me, but I am looking for a good
> tutorial to learn it nicely.

Take a read of the topic "Parsing, creating, and Manipulating
HTML Documents" from chapter five of Text Processing in Python.

http://gnosis.cx/TPiP/chap5.txt

-- 
Neil Cerutti

Back to comp.lang.python | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

HTML Parser subhabangalore@gmail.com - 2013-07-02 10:43 -0700
  Re: HTML Parser Neil Cerutti <neilc@norwich.edu> - 2013-07-02 17:57 +0000
  Re: HTML Parser Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2013-07-02 23:53 +0000
  Re: HTML Parser Joshua Landau <joshua.landau.ws@gmail.com> - 2013-07-03 01:13 +0100

csiph-web