Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #22078

Re: Fetching data from a HTML file

Path csiph.com!usenet.pasdenom.info!news.albasani.net!newsfeed.freenet.ag!news2.euro.net!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
Return-Path <fetchinson@googlemail.com>
X-Original-To python-list@python.org
Delivered-To python-list@mail.python.org
X-Spam-Status OK 0.018
X-Spam-Evidence '*H*': 0.96; '*S*': 0.00; 'snippet': 0.07; 'fetch': 0.09; 'framework,': 0.09; 'subject:file': 0.13; 'url:software': 0.13; '(read': 0.16; 'from:addr:fetchinson': 0.16; 'from:name:daniel fetchinson': 0.16; 'psss,': 0.16; 'url:putitdown': 0.16; 'wrote:': 0.18; 'thanks,': 0.19; "haven't": 0.20; 'trying': 0.21; 'header:In-Reply-To:1': 0.22; 'subject:data': 0.25; 'figure': 0.26; 'url:mailman': 0.27; "i'm": 0.28; 'message-id:@mail.gmail.com': 0.29; 'source,': 0.29; 'seem': 0.29; 'specifically': 0.30; 'from:addr:googlemail.com': 0.31; 'url:listinfo': 0.32; "i've": 0.32; "can't": 0.33; 'match': 0.34; 'hi,': 0.34; 'to:addr:python-list': 0.35; 'url:python': 0.35; 'with.': 0.37; 'but': 0.37; 'received:google.com': 0.37; 'received:209.85': 0.38; 'could': 0.38; 'data': 0.38; 'url:org': 0.39; 'subject:from': 0.39; 'suggestions': 0.39; 'received:209': 0.39; 'put': 0.40; 'to:addr:python.org': 0.40; 'below': 0.62; 'groups.': 0.68; 'beautiful': 0.71; 'sum': 0.74; 'received:209.85.216.174': 0.80
DKIM-Signature v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=H9KH6eXmQ8dYBSV/EZZkg8Z2RXGj/sOvF2Oh7CweExY=; b=QAvxBOm/RI5cvWwFitNob4+gCIJuKez10mv5MQF9EtOP4mSVERnnjjOdc4R85gtOfY pTEJHM0+Cd98h7InndW500UYE1evyyOg0uvsp03ad4NH1Rhop5DLupP74e7tSrtuu/xk c6pYDy6rJdKszQKTbUcMJZImRWQD5LoR4Cq/SSWieaH1J5San3NXiNsglq0qO4xpI3QE Gt5rCkOcGR5PrXxjkmS9YS/I/qNFBcCCYoUVzWdvnE3PNo3PcQ6RWsypycDbXaF/yuTs rp/k02WydjIr5YuVDZ5NJVF4QINqLFkcY1RDvGNSQSQmMXI2nwnQlgPtGd8OTzg1rhsW bTpA==
MIME-Version 1.0
In-Reply-To <9362386.1094.1332510725414.JavaMail.geo-discussion-forums@ynlt15>
References <9362386.1094.1332510725414.JavaMail.geo-discussion-forums@ynlt15>
Date Fri, 23 Mar 2012 16:28:56 +0100
Subject Re: Fetching data from a HTML file
From Daniel Fetchinson <fetchinson@googlemail.com>
To Python <python-list@python.org>
Content-Type text/plain; charset=ISO-8859-1
X-BeenThere python-list@python.org
X-Mailman-Version 2.1.12
Precedence list
List-Id General discussion list for the Python programming language <python-list.python.org>
List-Unsubscribe <http://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe>
List-Archive <http://mail.python.org/pipermail/python-list>
List-Post <mailto:python-list@python.org>
List-Help <mailto:python-list-request@python.org?subject=help>
List-Subscribe <http://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe>
Newsgroups comp.lang.python
Message-ID <mailman.932.1332516539.3037.python-list@python.org> (permalink)
Lines 28
NNTP-Posting-Host 2001:888:2000:d::a6
X-Trace 1332516539 news.xs4all.nl 6910 [2001:888:2000:d::a6]:43161
X-Complaints-To abuse@xs4all.nl
Xref csiph.com comp.lang.python:22078

Show key headers only | View raw


On 3/23/12, Sangeet <mrsangeet@gmail.com> wrote:
> Hi,
>
> I've got to fetch data from the snippet below and have been trying to match
> the digits in this to specifically to specific groups. But I can't seem to
> figure how to go about stripping the tags! :(
>
> <tr><td align="center"><b>Sum</b></td><td></td><td align='center'
> class="green">245</td><td align='center' class="red">11</td><td
> align='center'>0</td><td align='center' >256</td><td align='center' >1.496
> [min]</td></tr>
> </table>

Try beautiful soup: http://www.crummy.com/software/BeautifulSoup/

> Actually, I'm working on ROBOT Framework, and haven't been able to figure
> out how to read data from HTML tables. Reading from the source, is the best
> (read rudimentary) way I could come up with. Any suggestions are welcome!
>
> Thanks,
> Sangeet
> --
> http://mail.python.org/mailman/listinfo/python-list
>


-- 
Psss, psss, put it down! - http://www.cafepress.com/putitdown

Back to comp.lang.python | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

Fetching data from a HTML file Sangeet <mrsangeet@gmail.com> - 2012-03-23 06:52 -0700
  RE: Fetching data from a HTML file "Prasad, Ramit" <ramit.prasad@jpmorgan.com> - 2012-03-23 15:08 +0000
  Re: Fetching data from a HTML file Daniel Fetchinson <fetchinson@googlemail.com> - 2012-03-23 16:28 +0100
  Re: Fetching data from a HTML file Jon Clements <joncle@googlemail.com> - 2012-03-23 22:12 -0700
    Re: Fetching data from a HTML file John Nagle <nagle@animats.com> - 2012-03-24 14:04 -0700

csiph-web