Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.lang.python > #51625
| Path | csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!news.mixmin.net!eweka.nl!hq-usenetpeers.eweka.nl!xlned.com!feeder5.xlned.com!newsfeed.xs4all.nl!newsfeed3.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail |
|---|---|
| Return-Path | <joshua.landau.ws@gmail.com> |
| X-Original-To | python-list@python.org |
| Delivered-To | python-list@mail.python.org |
| X-Spam-Status | OK 0.029 |
| X-Spam-Evidence | '*H*': 0.95; '*S*': 0.01; 'subject:Python': 0.06; 'subject:help': 0.08; 'subject:script': 0.09; 'url:software': 0.09; 'cc:addr:python-list': 0.11; 'url:download': 0.12; 'fetch': 0.16; 'simpson': 0.16; 'sure.': 0.16; 'sender:addr:gmail.com': 0.17; 'wrote:': 0.18; 'email addr:gmail.com>': 0.22; 'cc:addr:python.org': 0.22; 'parse': 0.24; 'cc:2**0': 0.24; 'header:In-Reply-To:1': 0.27; 'message-id:@mail.gmail.com': 0.30; "i'm": 0.30; 'requests': 0.31; 'that.': 0.31; 'maybe': 0.34; "i'd": 0.34; 'received:google.com': 0.35; "you'll": 0.62; 'email addr:gmail.com': 0.63; 'july': 0.63; 'great': 0.65; 'url:0': 0.67; 'url:4': 0.69; 'links,': 0.74; 'lazy': 0.91; '2013': 0.98 |
| DKIM-Signature | v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:from:date :x-google-sender-auth:message-id:subject:to:cc:content-type; bh=7VD4PTaHMJPK0yNaWVbhiF3V6s+FvO8sByT//ih3iqY=; b=zWTNjCpGHh7SXEeG/c/6tjTt96+liXJUmDoQBnIgOOZArDhBRk1rlOBSvE9XtCxqFg TKmJ5yRGbfpSutQF78yGTtu20ecHD84EGj0heEyOIufGH/uxo+4BNVUhWItXAme84xEh 4y6qrRGU0XbAi2f9v7UQIvz/rK0Bc9FE4XJhOpCfhayGNju8Sb2JMwxD+JO5/eCbKuNn bDSyxqqqFi7WzBSIbG4FvXaJPGaMJniXJxEoArn7SPg+g7TIDVAfRTiSZgzxHPawlnpa uLPse7ES644LbOXZMsdTh8uyklJg8FqymiD4BzXldLpu1BbCW8OmlTWJ7keBEHt4OEZ0 wBkQ== |
| X-Received | by 10.112.5.199 with SMTP id u7mr29263011lbu.67.1375251893619; Tue, 30 Jul 2013 23:24:53 -0700 (PDT) |
| MIME-Version | 1.0 |
| Sender | joshua.landau.ws@gmail.com |
| In-Reply-To | <20130730214759.GA47935@cskk.homeip.net> |
| References | <7e9a50d5-3340-4cf0-a3b8-ef1109837529@googlegroups.com> <20130730214759.GA47935@cskk.homeip.net> |
| From | Joshua Landau <joshua@landau.ws> |
| Date | Wed, 31 Jul 2013 07:24:13 +0100 |
| X-Google-Sender-Auth | cQIvclb774XbSNHl_wcTxtfFh90 |
| Subject | Re: Python script help |
| To | Cameron Simpson <cs@zip.com.au> |
| Content-Type | multipart/alternative; boundary=14dae94ed6fd98903304e2c8c85e |
| Cc | python-list <python-list@python.org> |
| X-BeenThere | python-list@python.org |
| X-Mailman-Version | 2.1.15 |
| Precedence | list |
| List-Id | General discussion list for the Python programming language <python-list.python.org> |
| List-Unsubscribe | <http://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe> |
| List-Archive | <http://mail.python.org/pipermail/python-list/> |
| List-Post | <mailto:python-list@python.org> |
| List-Help | <mailto:python-list-request@python.org?subject=help> |
| List-Subscribe | <http://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe> |
| Newsgroups | comp.lang.python |
| Message-ID | <mailman.5352.1375251900.3114.python-list@python.org> (permalink) |
| Lines | 46 |
| NNTP-Posting-Host | 2001:888:2000:d::a6 |
| X-Trace | 1375251900 news.xs4all.nl 15934 [2001:888:2000:d::a6]:53418 |
| X-Complaints-To | abuse@xs4all.nl |
| Xref | csiph.com comp.lang.python:51625 |
Show key headers only | View raw
[Multipart message — attachments visible in raw view] - view raw
On 30 July 2013 22:47, Cameron Simpson <cs@zip.com.au> wrote: > On 30Jul2013 09:12, cool1574@gmail.com <cool1574@gmail.com> wrote: > | ** urlib, urlib2 > > Sure. And I'd use BeautifulSoup to do the parse. You'll need to fetch that. > So: urllib[2] to fetch the document and BS to parse it for links, > then urllib[2] to fetch the links you want. > > http://www.crummy.com/software/BeautifulSoup/bs4/download/4.0/ Personally BeautifulSoup + requests is a great combination. Maybe I'm just lazy ;).
Back to comp.lang.python | Previous | Next — Previous in thread | Next in thread | Find similar | Unroll thread
Python script help cool1574@gmail.com - 2013-07-30 07:49 -0700
Re: Python script help Chris Angelico <rosuav@gmail.com> - 2013-07-30 16:38 +0100
Re: Python script help cool1574@gmail.com - 2013-07-30 08:49 -0700
Re: Python script help Chris Angelico <rosuav@gmail.com> - 2013-07-30 16:58 +0100
Re: Python script help cool1574@gmail.com - 2013-07-30 09:10 -0700
Re: Python script help cool1574@gmail.com - 2013-07-30 09:12 -0700
Re: Python script help Cameron Simpson <cs@zip.com.au> - 2013-07-31 07:47 +1000
Re: Python script help Joshua Landau <joshua@landau.ws> - 2013-07-31 07:24 +0100
Re: Python script help Chris Angelico <rosuav@gmail.com> - 2013-07-30 17:22 +0100
Re: Python script help Vincent Vande Vyvre <vincent.vandevyvre@swing.be> - 2013-07-30 18:58 +0200
Re: Python script help Ulrich Eckhardt <ulrich.eckhardt@dominolaser.com> - 2013-07-30 17:32 +0200
Re: Python script help Denis McMahon <denismfmcmahon@gmail.com> - 2013-07-31 05:08 +0000
Re: Python script help cool1574@gmail.com - 2013-07-31 01:15 -0700
Re: Python script help alex23 <wuwei23@gmail.com> - 2013-08-01 10:57 +1000
Re: Python script help Alister <alister.ware@ntlworld.com> - 2013-08-01 10:39 +0000
Re: Python script help Piet van Oostrum <piet@vanoostrum.org> - 2013-08-23 22:37 -0400
Re: Python script help cool1574@gmail.com - 2013-08-01 09:02 -0700
Re: Python script help Ulrich Eckhardt <ulrich.eckhardt@dominolaser.com> - 2013-08-02 10:44 +0200
Re: Python script help cool1574@gmail.com - 2013-08-02 02:46 -0700
Re: Python script help Chris Angelico <rosuav@gmail.com> - 2013-08-02 11:01 +0100
Re: Python script help cool1574@gmail.com - 2013-08-04 08:57 -0700
Re: Python script help Chris Angelico <rosuav@gmail.com> - 2013-08-04 17:20 +0100
Re: Python script help Michael Torrie <torriem@gmail.com> - 2013-08-04 16:58 -0600
Re: Python script help Jake Angulo <jake.angulo@gmail.com> - 2013-08-05 10:30 +1000
csiph-web