Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!news.albasani.net!weretis.net!feeder4.news.weretis.net!feeds.phibee-telecom.net!newsfeed.xs4all.nl!newsfeed3a.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.018 X-Spam-Evidence: '*H*': 0.97; '*S*': 0.00; 'value,': 0.04; '(python': 0.07; 'level,': 0.07; 'things,': 0.09; 'topics,': 0.09; 'python': 0.11; 'checking,': 0.16; 'fetch': 0.16; 'soup': 0.16; 'topics': 0.16; 'subject:page': 0.19; 'thanks.': 0.20; 'print': 0.22; 'script': 0.25; 'pass': 0.26; 'message-id:@mail.gmail.com': 0.30; 'skip:c 30': 0.32; 'checking': 0.33; 'subject:from': 0.34; 'info': 0.35; 'received:google.com': 0.35; 'subject:data': 0.36; "i'll": 0.36; 'to:addr:python-list': 0.38; 'to:addr:python.org': 0.39; 'new': 0.61; 'account': 0.65; 'beautiful': 0.68; 'subject:get': 0.81; '<a': 0.84; 'url:app': 0.84; 'subject:Best': 0.91 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:from:date:message-id:subject:to:content-type; bh=v7T4DTZjdGEOLTk2F+sKt/F2pTtwbL585BHkxEwQP+Y=; b=C9yUmKDfDSSAfQw9g0Juz5GsRYQhbQcdUGO4PSvpX/yEq9Hi++lrZgWxVxFolk1d8q n6zSFURjQ6YwfOKyG6VsxmaMAs/WRfXLhk3nmqFYB6oIbL9zP+K8iQmydMvRf54TVbvN JIz5P7htp5MKXIghprWJzSqS99to9L3R+2kWcroljLRIJ5V74Ka9RjJEOu4HuaAfI9D9 +ILO1hNwn8PJ1sQIXY7R6Zltj029u4SjHOMsOenzjt6WFhY8xTbqj/80Hpy6PK+KC5dm 77XRW+b6AhYvHprhz6uK1E9nD7YTRMEFcNybp2VnZVjwVYvj1tTbLIGZ2iNloY5ejYY9 RpXA== X-Received: by 10.112.13.10 with SMTP id d10mr4433695lbc.10.1411047066756; Thu, 18 Sep 2014 06:31:06 -0700 (PDT) MIME-Version: 1.0 From: Juan Christian Date: Thu, 18 Sep 2014 10:30:46 -0300 Subject: Best approach to get data from web page continuously To: Python Content-Type: multipart/alternative; boundary=001a11c3a11a2cec66050356ff46 X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 37 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1411047074 news.xs4all.nl 2946 [2001:888:2000:d::a6]:56673 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:78014 --001a11c3a11a2cec66050356ff46 Content-Type: text/plain; charset=UTF-8 I'll write a python (Python 3.4.1) script to fetch for new data (topics) from this page (http://steamcommunity.com/app/440/tradingforum) continuously. All the topics follow this structure: It will work like that: I'll get the last topics, do some background checking regarding user level, inventory value, account age, and other things, if the user pass in the checking, I'll print some info and links in the terminal. The only thing I need to know to start is: What's the better way the get this data? Beautiful Soup 4 + requests? urllib? Others? Thanks. --001a11c3a11a2cec66050356ff46 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable

I'll write a python (Python 3.4.1) script to fetch for= new data (topics) from this page (http://steamcommunity.com/app/440/tradingforum) cont= inuously.

All the topics follow this structure: <a cl= ass=3D"forum_topic_overlay" href=3D"http://steamcommunity.com/app/= 440/tradingforum/TOPIC_ID/"> </a>

It will work like that: I'll get the last topics, do some backgr= ound checking regarding user level, inventory value, account age, and other= things, if the user pass in the checking, I'll print some info and lin= ks in the terminal. The only thing I need to know to start is: What's t= he better way the get this data?=C2=A0Beautiful Soup 4 + requests? urllib? = Others?

Thanks.

--001a11c3a11a2cec66050356ff46--