Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!news2.arglkargh.de!news.mixmin.net!rt.uk.eu.org!newsfeed.xs4all.nl!newsfeed1.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.004 X-Spam-Evidence: '*H*': 0.99; '*S*': 0.00; 'broken': 0.04; 'functioning': 0.09; 'received:80.91': 0.09; 'received:80.91.229': 0.09; 'received:gmane.org': 0.09; 'received:list': 0.09; 'scripting': 0.09; 'python': 0.11; 'reasonably': 0.16; 'received:80.91.229.3': 0.16; 'received:plane.gmane.org': 0.16; 'script?': 0.16; 'soup': 0.16; 'fix': 0.17; 'wrote:': 0.18; 'written': 0.21; 'otherwise,': 0.22; 'print': 0.22; 'header:User- Agent:1': 0.23; 'sort': 0.25; 'source': 0.25; 'header:X -Complaints-To:1': 0.27; 'correct': 0.29; 'appreciated.': 0.29; 'probably': 0.32; 'could': 0.34; 'advice': 0.35; 'charset:us- ascii': 0.36; 'to:addr:python-list': 0.38; 'help,': 0.39; 'to:addr:python.org': 0.39; 'received:org': 0.40; 'remove': 0.60; 'most': 0.60; 'tips': 0.61; 'email addr:gmail.com': 0.63; 'beautiful': 0.68; 'hoping': 0.75; 'start.': 0.84 X-Injected-Via-Gmane: http://gmane.org/ To: python-list@python.org From: Dave Angel Subject: Re: Directory Web Site Date: Mon, 23 Sep 2013 01:59:14 +0000 (UTC) References: <9c1aee48-7d5e-49d3-ae8c-b5d422cecb9b@googlegroups.com> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Gmane-NNTP-Posting-Host: 174.32.174.35 User-Agent: XPN/1.2.6 (Street Spirit ; Linux) X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 14 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1379901576 news.xs4all.nl 15959 [2001:888:2000:d::a6]:38605 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:54607 On 22/9/2013 21:14, worthingtonclinton@gmail.com wrote: > Was hoping to get some tips or advice on scripting a program that would sort through my many links on my directory website and print out to me the ones that are broken or no longer functioning so that I could fix or remove them from the site. Any help, ideas, advice will be greatly appreciated. Since it's your own website, the best answer is probably to process the source to that site. Was it written by a python script? Otherwise, if the site is reasonably correct (as most are not), then Beautiful Soup is probably the place to start. -- DaveA