Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #60215

Re: Web Page Parsing/Downloading

Path csiph.com!usenet.pasdenom.info!news.etla.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed4.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
Return-Path <rosuav@gmail.com>
X-Original-To python-list@python.org
Delivered-To python-list@mail.python.org
X-Spam-Status OK 0.070
X-Spam-Evidence '*H*': 0.86; '*S*': 0.00; 'scripts': 0.03; '22,': 0.09; 'advice.': 0.09; 'python': 0.11; 'from:addr:rosuav': 0.16; 'from:name:chris angelico': 0.16; 'sorts': 0.16; 'worst': 0.16; 'wrote:': 0.18; 'trying': 0.19; 'example': 0.22; 'otherwise,': 0.22; 'separate': 0.22; 'error': 0.23; 'url:moin': 0.24; "i've": 0.25; 'posts': 0.26; 'subject:/': 0.26; 'header:In-Reply-To:1': 0.27; 'points': 0.29; 'strongly': 0.30; 'message- id:@mail.gmail.com': 0.30; 'along': 0.30; "i'm": 0.30; 'easier': 0.31; 'posting': 0.31; 'url:wiki': 0.31; 'own,': 0.31; 'post.': 0.31; 'time;': 0.31; 'anyone': 0.31; 'figure': 0.32; 'url:python': 0.33; 'running': 0.33; 'fri,': 0.33; "can't": 0.35; 'received:google.com': 0.35; 'google': 0.35; 'version': 0.36; 'really': 0.36; 'done': 0.36; 'doing': 0.36; 'url:org': 0.36; 'two': 0.37; 'starting': 0.37; 'nov': 0.38; 'to:addr:python-list': 0.38; 'pm,': 0.38; 'rather': 0.38; 'that,': 0.38; 'little': 0.38; 'short': 0.38; 'to:addr:python.org': 0.39; 'read': 0.60; 'tips': 0.61; 'lost': 0.61; 'new': 0.61; 'simple': 0.61; "you're": 0.61; 'first': 0.61; "you'll": 0.62; "you've": 0.63; 'become': 0.64; 'more': 0.64; 'note:': 0.66; 'cut': 0.74; 'from.': 0.93; '2013': 0.98
DKIM-Signature v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=l95PX0C41VVfMbGZoEOKEiM2HuWfgFs66gqvgSCfj7w=; b=Nfz4flf2mM4i+LkKypoXIliX353+1zx4geBWQIUrOxYbPIC66C5nw3E25tzUDuUQoS RKrb0ENXLdJqzBq4W8pyr3bBQoZdfxFrcU18zVdEzq4Igmnc/AJJ62mVwqEraHoPbdkV mTWb/aNOInmnysJrS5EIgQkgpORUIIYzPbIWQdCQD0WTbQ1JTxMbD9p+gkuqignflyai 97vaBdYLq/TC1JfpjCHPz6IikDMmaO7UYMBDULmDsD51L5BIl08oSax1/Py82KRCgicg pbhjtLPZicLrDFEZJJJoc9Yx1e9zxBuW1SDMNScbsdOHMoUeUVZz1pfMmU4qbnOLODs4 EQAQ==
MIME-Version 1.0
X-Received by 10.68.225.9 with SMTP id rg9mr2422814pbc.122.1385121757818; Fri, 22 Nov 2013 04:02:37 -0800 (PST)
In-Reply-To <fd8d82d5-f5cc-4a5b-99d1-a93a20895f63@googlegroups.com>
References <fd8d82d5-f5cc-4a5b-99d1-a93a20895f63@googlegroups.com>
Date Fri, 22 Nov 2013 23:02:37 +1100
Subject Re: Web Page Parsing/Downloading
From Chris Angelico <rosuav@gmail.com>
To python-list@python.org
Content-Type text/plain; charset=ISO-8859-1
X-BeenThere python-list@python.org
X-Mailman-Version 2.1.15
Precedence list
List-Id General discussion list for the Python programming language <python-list.python.org>
List-Unsubscribe <https://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe>
List-Archive <http://mail.python.org/pipermail/python-list/>
List-Post <mailto:python-list@python.org>
List-Help <mailto:python-list-request@python.org?subject=help>
List-Subscribe <https://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe>
Newsgroups comp.lang.python
Message-ID <mailman.3034.1385121767.18130.python-list@python.org> (permalink)
Lines 25
NNTP-Posting-Host 2001:888:2000:d::a6
X-Trace 1385121767 news.xs4all.nl 15930 [2001:888:2000:d::a6]:45152
X-Complaints-To abuse@xs4all.nl
Xref csiph.com comp.lang.python:60215

Show key headers only | View raw


On Fri, Nov 22, 2013 at 9:10 PM, TheRandomPast <wishingforsam@gmail.com> wrote:
> Can anyone help because I've become a little stuck? None of the scripts are running for me and I can't see where I'm having issues

I'm rather lost in what you're trying to accomplish here. The first
thing to do would be to separate out your three scripts and just look
at one at a time; then cut each one down to just what it really needs
to be doing. Once you've done that, you'll have a simple example - see
http://sscce.org/ for tips on that - and you can figure out what it's
doing wrong. If you can't figure it out on your own, the short example
will be far more suitable for posting here, along with its error
backtrace (if it's throwing one), than a more verbose program listing.

Two general points of advice. Firstly, if you're just starting out, I
strongly recommend you use Python 3 instead of Python 2. All sorts of
things have been improved, and it's far easier to learn on the new
version than to learn on the old and then have to change your habits
later.

And secondly, please read this and take note:
https://wiki.python.org/moin/GoogleGroupsPython - otherwise, you'll
find that a lot of people don't want to see your post. Best would be
to avoid Google Groups altogether, as it's very approximately the
worst newsgroup client I've ever seen posts from.

ChrisA

Back to comp.lang.python | Previous | NextPrevious in thread | Find similar | Unroll thread


Thread

Web Page Parsing/Downloading TheRandomPast <wishingforsam@gmail.com> - 2013-11-22 02:10 -0800
  Re: Web Page Parsing/Downloading Chris Angelico <rosuav@gmail.com> - 2013-11-22 23:02 +1100

csiph-web