Path: csiph.com!usenet.pasdenom.info!news.albasani.net!newsfeed.freenet.ag!news2.euro.net!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.018 X-Spam-Evidence: '*H*': 0.96; '*S*': 0.00; 'snippet': 0.07; 'fetch': 0.09; 'framework,': 0.09; 'subject:file': 0.13; 'url:software': 0.13; '(read': 0.16; 'from:addr:fetchinson': 0.16; 'from:name:daniel fetchinson': 0.16; 'psss,': 0.16; 'url:putitdown': 0.16; 'wrote:': 0.18; 'thanks,': 0.19; "haven't": 0.20; 'trying': 0.21; 'header:In-Reply-To:1': 0.22; 'subject:data': 0.25; 'figure': 0.26; 'url:mailman': 0.27; "i'm": 0.28; 'message-id:@mail.gmail.com': 0.29; 'source,': 0.29; 'seem': 0.29; 'specifically': 0.30; 'from:addr:googlemail.com': 0.31; 'url:listinfo': 0.32; "i've": 0.32; "can't": 0.33; 'match': 0.34; 'hi,': 0.34; 'to:addr:python-list': 0.35; 'url:python': 0.35; 'with.': 0.37; 'but': 0.37; 'received:google.com': 0.37; 'received:209.85': 0.38; 'could': 0.38; 'data': 0.38; 'url:org': 0.39; 'subject:from': 0.39; 'suggestions': 0.39; 'received:209': 0.39; 'put': 0.40; 'to:addr:python.org': 0.40; 'below': 0.62; 'groups.': 0.68; 'beautiful': 0.71; 'sum': 0.74; 'received:209.85.216.174': 0.80 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=H9KH6eXmQ8dYBSV/EZZkg8Z2RXGj/sOvF2Oh7CweExY=; b=QAvxBOm/RI5cvWwFitNob4+gCIJuKez10mv5MQF9EtOP4mSVERnnjjOdc4R85gtOfY pTEJHM0+Cd98h7InndW500UYE1evyyOg0uvsp03ad4NH1Rhop5DLupP74e7tSrtuu/xk c6pYDy6rJdKszQKTbUcMJZImRWQD5LoR4Cq/SSWieaH1J5San3NXiNsglq0qO4xpI3QE Gt5rCkOcGR5PrXxjkmS9YS/I/qNFBcCCYoUVzWdvnE3PNo3PcQ6RWsypycDbXaF/yuTs rp/k02WydjIr5YuVDZ5NJVF4QINqLFkcY1RDvGNSQSQmMXI2nwnQlgPtGd8OTzg1rhsW bTpA== MIME-Version: 1.0 In-Reply-To: <9362386.1094.1332510725414.JavaMail.geo-discussion-forums@ynlt15> References: <9362386.1094.1332510725414.JavaMail.geo-discussion-forums@ynlt15> Date: Fri, 23 Mar 2012 16:28:56 +0100 Subject: Re: Fetching data from a HTML file From: Daniel Fetchinson To: Python Content-Type: text/plain; charset=ISO-8859-1 X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.12 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 28 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1332516539 news.xs4all.nl 6910 [2001:888:2000:d::a6]:43161 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:22078 On 3/23/12, Sangeet wrote: > Hi, > > I've got to fetch data from the snippet below and have been trying to match > the digits in this to specifically to specific groups. But I can't seem to > figure how to go about stripping the tags! :( > > Sum class="green">24511 align='center'>02561.496 > [min] > Try beautiful soup: http://www.crummy.com/software/BeautifulSoup/ > Actually, I'm working on ROBOT Framework, and haven't been able to figure > out how to read data from HTML tables. Reading from the source, is the best > (read rudimentary) way I could come up with. Any suggestions are welcome! > > Thanks, > Sangeet > -- > http://mail.python.org/mailman/listinfo/python-list > -- Psss, psss, put it down! - http://www.cafepress.com/putitdown