Path: csiph.com!x330-a1.tempe.blueboxinc.net!usenet.pasdenom.info!gegeweb.org!de-l.enfer-du-nord.net!feeder2.enfer-du-nord.net!txtfeed1.tudelft.nl!tudelft.nl!txtfeed2.tudelft.nl!amsnews11.chello.com!newsfeed.xs4all.nl!newsfeed6.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.002 X-Spam-Evidence: '*H*': 1.00; '*S*': 0.00; 'xml,': 0.05; 'content- type:multipart/signed': 0.09; 'filename:fname piece:signature': 0.09; 'developer': 0.12; 'mon,': 0.15; 'adam': 0.16; 'content- type:application/pgp-signature': 0.16; 'filename:fname piece:asc': 0.16; 'filename:fname:signature.asc': 0.16; 'from:addr:awilliam': 0.16; 'from:addr:whitemice.org': 0.16; 'from:name:adam tauno williams': 0.16; 'received:72.14.190': 0.16; 'received:72.14.190.87': 0.16; 'received:mail.wmmi.net': 0.16; 'received:wmmi.net': 0.16; 'reply-to:addr:awilliam': 0.16; 'reply- to:addr:whitemice.org': 0.16; 'subject:lxml': 0.16; 'wrote:': 0.16; 'header:In-Reply-To:1': 0.22; 'import': 0.27; 'received:72.14': 0.29; 'html.': 0.30; 'to:addr:python-list': 0.33; 'received:192': 0.38; 'received:192.168.1': 0.39; 'subject:: ': 0.39; 'to:addr:python.org': 0.40; 'url:us': 0.60; 'your': 0.61; 'williams': 0.63; 'header:Reply-To:1': 0.71; 'reply-to:no real name:2**0': 0.72 Subject: Re: lxml to parse html From: Adam Tauno Williams To: python-list@python.org Date: Mon, 23 Jan 2012 10:30:40 -0500 In-Reply-To: References: Content-Type: multipart/signed; micalg="pgp-sha1"; protocol="application/pgp-signature"; boundary="=-GuE5ZQmrKXpxOZeDWWMY" X-Mailer: Evolution 3.2.1 Mime-Version: 1.0 X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.12 Precedence: list Reply-To: awilliam@whitemice.org List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 35 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1327332741 news.xs4all.nl 6857 [2001:888:2000:d::a6]:45942 X-Complaints-To: abuse@xs4all.nl Xref: x330-a1.tempe.blueboxinc.net comp.lang.python:19265 --=-GuE5ZQmrKXpxOZeDWWMY Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable On Mon, 2012-01-23 at 15:39 +0800, contro opinion wrote: > import lxml.html > myxml=3D''' > > > Use lxml.etree not lxml.html. Your content is XML, not HTML. --=20 System & Network Administrator [ LPI & NCLA ] OpenGroupware Developer Adam Tauno Williams --=-GuE5ZQmrKXpxOZeDWWMY Content-Type: application/pgp-signature; name="signature.asc" Content-Description: This is a digitally signed message part Content-Transfer-Encoding: 7bit -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.18 (GNU/Linux) iEYEABECAAYFAk8dfSUACgkQLRePpNle04OLsACfWg9Tl+E11mgMCzbPcXkbVxkV SIoAni2NYFGqU+r/RsRddjXXsJmM4hBc =E0TU -----END PGP SIGNATURE----- --=-GuE5ZQmrKXpxOZeDWWMY--