Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #54617

Re: Directory Web Site

Path csiph.com!usenet.pasdenom.info!aioe.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed2.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
Return-Path <rosuav@gmail.com>
X-Original-To python-list@python.org
Delivered-To python-list@mail.python.org
X-Spam-Status OK 0.031
X-Spam-Evidence '*H*': 0.94; '*S*': 0.00; 'broken': 0.04; 'functioning': 0.09; 'scripting': 0.09; '"link': 0.16; '23,': 0.16; 'effect,': 0.16; 'from:addr:rosuav': 0.16; 'from:name:chris angelico': 0.16; 'non-python': 0.16; 'fix': 0.17; 'wrote:': 0.18; 'print': 0.22; 'mon,': 0.24; 'sort': 0.25; 'header:In-Reply-To:1': 0.27; 'appreciated.': 0.29; 'am,': 0.29; 'words': 0.29; 'message- id:@mail.gmail.com': 0.30; 'url:mailman': 0.30; 'code': 0.31; 'sep': 0.31; 'probably': 0.32; 'url:python': 0.33; 'could': 0.34; 'advice': 0.35; 'common': 0.35; 'tool': 0.35; 'received:google.com': 0.35; 'url:listinfo': 0.36; 'url:org': 0.36; 'easiest': 0.38; 'to:addr:python-list': 0.38; 'extremely': 0.39; 'help,': 0.39; 'to:addr:python.org': 0.39; 'url:mail': 0.40; 'remove': 0.60; 'tips': 0.61; 'hoping': 0.75; 'find.': 0.84; 'task,': 0.91; '2013': 0.98
DKIM-Signature v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; bh=7WXZLYoXvEOVpNYO++KCXl7xMP24QkacS4oKRyEqx7k=; b=ieKThvhDy05TlWwFztHE3xfTN8jqfy/miiZ13zYwJjIj2FbAcx/Oiqt2iXm486xhPS GjymbW8xw4jXXA0MWkim2Va9Kn3QTgPam81sNLYQKYOx4lPU70TWZRHVSFQ5Xh+K0oJT q9E7tIOYbBYsyRby9C8E4YWzMUDBB7krxUGptpdqW/7T5dd8aXrmoDVeW6QvWADVtGkV bmsByqvBxuvmBOllN/j7uLeC2RqZdqLDI4UCdy6lteDsBKiLuqMThDHuL4iidWcTfoly UK0OlsjHDdUPFc7xESz8yRSS2e5Ae7buUKs2lVl/atFQiDNr1+aCWKqW35+x/aUrUAL7 V72Q==
MIME-Version 1.0
X-Received by 10.58.77.65 with SMTP id q1mr20525686vew.8.1379920470688; Mon, 23 Sep 2013 00:14:30 -0700 (PDT)
In-Reply-To <9c1aee48-7d5e-49d3-ae8c-b5d422cecb9b@googlegroups.com>
References <9c1aee48-7d5e-49d3-ae8c-b5d422cecb9b@googlegroups.com>
Date Mon, 23 Sep 2013 17:14:30 +1000
Subject Re: Directory Web Site
From Chris Angelico <rosuav@gmail.com>
To python-list@python.org
Content-Type text/plain; charset=ISO-8859-1
Content-Transfer-Encoding quoted-printable
X-BeenThere python-list@python.org
X-Mailman-Version 2.1.15
Precedence list
List-Id General discussion list for the Python programming language <python-list.python.org>
List-Unsubscribe <https://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe>
List-Archive <http://mail.python.org/pipermail/python-list/>
List-Post <mailto:python-list@python.org>
List-Help <mailto:python-list-request@python.org?subject=help>
List-Subscribe <https://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe>
Newsgroups comp.lang.python
Message-ID <mailman.255.1379920473.18130.python-list@python.org> (permalink)
Lines 14
NNTP-Posting-Host 2001:888:2000:d::a6
X-Trace 1379920473 news.xs4all.nl 15906 [2001:888:2000:d::a6]:43971
X-Complaints-To abuse@xs4all.nl
Xref csiph.com comp.lang.python:54617

Show key headers only | View raw


On Mon, Sep 23, 2013 at 11:14 AM,  <worthingtonclinton@gmail.com> wrote:
> Was hoping to get some tips or advice on scripting a program that would sort through my many links on my directory website and print out to me the ones that are broken or no longer functioning so that I could fix or remove them from the site. Any help, ideas, advice will be greatly appreciated.
> --
> https://mail.python.org/mailman/listinfo/python-list

The easiest solution is probably a non-Python tool like wget. Search
the web for "link checker" or "href verifier" or words to that effect,
see what you find. This is an extremely common task, and I don't think
you need to write any code for it.

ChrisA

Back to comp.lang.python | Previous | NextPrevious in thread | Find similar | Unroll thread


Thread

Directory Web Site worthingtonclinton@gmail.com - 2013-09-22 18:14 -0700
  Re: Directory Web Site Joel Goldstick <joel.goldstick@gmail.com> - 2013-09-22 21:36 -0400
    Re: Directory Web Site worthingtonclinton@gmail.com - 2013-09-22 19:02 -0700
  Re: Directory Web Site Dave Angel <davea@davea.name> - 2013-09-23 01:59 +0000
    Re: Directory Web Site worthingtonclinton@gmail.com - 2013-09-22 19:04 -0700
  Re: Directory Web Site Chris Angelico <rosuav@gmail.com> - 2013-09-23 17:14 +1000

csiph-web