Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!newsfeed.xs4all.nl!newsfeed3.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.007 X-Spam-Evidence: '*H*': 0.99; '*S*': 0.00; 'jeff': 0.04; 'subject:Question': 0.07; 'http': 0.09; 'parameter': 0.09; 'subset': 0.09; '*only*': 0.16; '-tkc': 0.16; 'accepts': 0.16; 'blocked': 0.16; 'from:addr:python.list': 0.16; 'from:addr:tim.thechases.com': 0.16; 'from:name:tim chase': 0.16; 'https': 0.16; 'internally': 0.16; 'logon': 0.16; 'proxy,': 0.16; 'component': 0.16; 'exception': 0.16; 'wrote:': 0.18; '(where': 0.19; 'filtering': 0.24; 'proxy': 0.24; 'settings.': 0.24; 'sort': 0.25; 'script': 0.25; 'header:In-Reply-To:1': 0.27; 'reporting': 0.29; "i'm": 0.30; 'description,': 0.31; 'sites.': 0.31; 'up.': 0.33; 'could': 0.34; 'but': 0.35; 'there': 0.35; 'doing': 0.36; 'charset:us-ascii': 0.36; 'should': 0.36; 'list': 0.37; 'being': 0.38; 'server': 0.38; 'to:addr:python-list': 0.38; 'use.': 0.39; 'to:addr:python.org': 0.39; 'either': 0.39; 'issues,': 0.61; 'browser': 0.61; 'information,': 0.61; 'first': 0.61; 'happen': 0.63; 'due': 0.66; 'home': 0.69; 'here)': 0.84; 'received:50.22': 0.84; 'suspicion': 0.84 Date: Mon, 16 Dec 2013 06:04:23 -0600 From: Tim Chase To: python-list@python.org Subject: Re: Question RE urllib In-Reply-To: References: X-Mailer: Claws Mail 3.8.1 (GTK+ 2.24.10; x86_64-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: quoted-printable X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - boston.accountservergroup.com X-AntiAbuse: Original Domain - python.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - tim.thechases.com X-Get-Message-Sender-Via: boston.accountservergroup.com: authenticated_id: tim@thechases.com X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 24 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1387195393 news.xs4all.nl 2917 [2001:888:2000:d::a6]:42651 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:62051 On 2013-12-16 04:40, Jeff James wrote: > These sites do not require a logon in order for the home > page to come up. Could this be due to some port being blocked > internally ? Only one of the sites reporting as down is "https" but > all are internal sites. Is there some other component I should be > including in the script ? =46rom your description, I'm unsure whether it's *only* the HTTPS sites that are having issues, or a subset of your URL list that includes both HTTP and HTTPS (where those sites happen to be internal). Either way, my first suspicion is that you have some sort of proxy server in your network that web-browsers use. The urlopen() accepts a parameter to list proxy server information, which you should be able to glean from your browser settings. If your web-browser isn't using a proxy, the sites might be doing unsavory things like filtering by user-agent. Dumping the contents of the exception (and including it here) might help diagnose. -tkc