Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!xlned.com!feeder7.xlned.com!news2.euro.net!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.020 X-Spam-Evidence: '*H*': 0.96; '*S*': 0.00; 'subject:error': 0.03; 'parser': 0.07; 'issue:': 0.09; 'cc:addr:python-list': 0.11; 'parser.': 0.16; 'thursday,': 0.16; 'wrote:': 0.18; 'meant': 0.20; 'email addr:gmail.com>': 0.22; 'cc:addr:python.org': 0.22; 'parse': 0.24; 'cc:2**0': 0.24; 'cc:no real name:2**0': 0.24; '>': 0.26; 'header:In-Reply-To:1': 0.27; 'document.': 0.30; 'strongly': 0.30; 'message-id:@mail.gmail.com': 0.30; 'too.': 0.31; 'but': 0.35; 'received:google.com': 0.35; 'subject:can': 0.39; 'july': 0.63; 'to:addr:gmail.com': 0.65; 'jul': 0.74; 'actually,': 0.84; '2013': 0.98 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=gEr33/jeTinAZSiNTwMUvCQbdqJAipX7xXTGiNkAcwA=; b=gRg7SvqRuy7eINWdTYUMRpb2TtjE0F5N7+Kw4JT1J3JqgCEG43o82q6dux5Mg6T6ND 3IVq0wQ/GZIQw0qDptVaSUzMJ9UB8dZ1G8/zd+g7wNVUiEP4vXUmt31dOEN2x6dRa1gp ouzXbv32mPdln9avH1foIMBPR5QYGSwaFsWCLF7Ky+l1I3yYbz7R3vezW17dRIwXIA8g wNa3OUhiOSsSEWV3MQvUTHq57ukg+RCu/JnKxYAkRexUBLNVKa9Rc1HQ9hma+EuFWIMX XIjqun2fI1uZY202CWMm7JMdHm2beM8qLH4UPLllLaogBYOPpOWFzldPAtlmjkKIru1y 5nDQ== MIME-Version: 1.0 X-Received: by 10.224.147.145 with SMTP id l17mr19680608qav.3.1373538422790; Thu, 11 Jul 2013 03:27:02 -0700 (PDT) In-Reply-To: <3906a61a-1a52-48b8-b9df-fb30c5498e99@googlegroups.com> References: <51DE73E4.6040007@bluewin.ch> <3906a61a-1a52-48b8-b9df-fb30c5498e99@googlegroups.com> Date: Thu, 11 Jul 2013 11:27:02 +0100 Subject: Re: ElementTree: can't figure out a mismached-tag error From: =?ISO-8859-1?Q?F=E1bio_Santos?= To: fronagzen@gmail.com Content-Type: multipart/alternative; boundary=089e0160b624c69da604e139d52f Cc: python-list@python.org X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 36 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1373538425 news.xs4all.nl 15992 [2001:888:2000:d::a6]:59677 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:50436 --089e0160b624c69da604e139d52f Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable On 11 Jul 2013 10:24, wrote: > > Actually, I don't think etree has a HTML parser. And I would counter-recommend lxml if speed is an issue: BeautifulSoup takes a looooong time to parse a large document. > > On Thursday, July 11, 2013 5:08:04 PM UTC+8, F=E1bio Santos wrote: > > > > Use an HTML parser. I strongly recommend BeautifulSoup but I think etree has an HTML parser too. I am not sure.. I meant lxml. My apologies. --089e0160b624c69da604e139d52f Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable


On 11 Jul 2013 10:24, <fronagzen@= gmail.com> wrote:
>
> Actually, I don't think etree has a HTML parser. And I would count= er-recommend lxml if speed is an issue: BeautifulSoup takes a looooong time= to parse a large document.
>
> On Thursday, July 11, 2013 5:08:04 PM UTC+8, F=E1bio Santos wrote:
> >
> > Use an HTML parser. I strongly recommend=A0 BeautifulSoup but I t= hink etree has an HTML parser too. I am not sure..

I meant lxml. My apologies.

--089e0160b624c69da604e139d52f--