Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #49644

Re: HTML Parser

Path csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!news-1.dfn.de!news.dfn.de!news.informatik.hu-berlin.de!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From Neil Cerutti <neilc@norwich.edu>
Newsgroups comp.lang.python
Subject Re: HTML Parser
Date 2 Jul 2013 17:57:22 GMT
Organization Norwich University
Lines 18
Message-ID <b3gik2Fa73jU1@mid.individual.net> (permalink)
References <b4aba93c-f832-4af4-8c48-02b1f8f6b1cd@googlegroups.com>
Mime-Version 1.0
Content-Type text/plain; charset=us-ascii
Content-Transfer-Encoding 7bit
X-Trace individual.net FkcRvdtBhqevvbQICJJq+gE77tiPWwUfql2tms0V7B5AVKzWM3
Cancel-Lock sha1:g38IbjZ9tE5/u0HOCc1lbl739pA=
User-Agent slrn/0.9.9p1/mm/ao (Win32)
Xref csiph.com comp.lang.python:49644

Show key headers only | View raw


On 2013-07-02, subhabangalore@gmail.com <subhabangalore@gmail.com> wrote:
> Dear Group,
>
> I was looking for a good tutorial for a "HTML Parser". My
> intention was to extract tables from web pages or information
> from tables in web pages. 
>
> I tried to make a search, I got HTMLParser, BeautifulSoup, etc.
> HTMLParser works fine for me, but I am looking for a good
> tutorial to learn it nicely.

Take a read of the topic "Parsing, creating, and Manipulating
HTML Documents" from chapter five of Text Processing in Python.

http://gnosis.cx/TPiP/chap5.txt

-- 
Neil Cerutti

Back to comp.lang.python | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

HTML Parser subhabangalore@gmail.com - 2013-07-02 10:43 -0700
  Re: HTML Parser Neil Cerutti <neilc@norwich.edu> - 2013-07-02 17:57 +0000
  Re: HTML Parser Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2013-07-02 23:53 +0000
  Re: HTML Parser Joshua Landau <joshua.landau.ws@gmail.com> - 2013-07-03 01:13 +0100

csiph-web