Path: csiph.com!usenet.pasdenom.info!weretis.net!feeder1.news.weretis.net!feeder.erje.net!eu.feeder.erje.net!newsfeed.xs4all.nl!newsfeed3.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
MIME-Version: 1.0
In-Reply-To: <VjMcu.4720$J14.740@fx31.am4>
References: <7fb9c035-c663-4874-9597-ac47d1c30da7@googlegroups.com> <VjMcu.4720$J14.740@fx31.am4>
Date: Fri, 1 Nov 2013 10:14:16 -0400
Subject: Re: how to extract page-URL using BeautifulSoup
From: Joel Goldstick <joel.goldstick@gmail.com>
To: Alister <alister.ware@ntlworld.com>
Content-Type: text/plain; charset=UTF-8
Cc: "python-list@python.org" <python-list@python.org>
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.1929.1383315259.18130.python-list@python.org>
Lines: 38
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:58268

This is nearly the same question you asked under another name
yesterday.  Its not clear what you really want to do.  You are asking
what the url is of the page you retrieve by providing the same url.

On Fri, Nov 1, 2013 at 7:33 AM, Alister <alister.ware@ntlworld.com> wrote:
> On Thu, 31 Oct 2013 08:59:00 -0700, bhaktanishant wrote:
>
>> I want to extract the page-url. for example:
>> if i have this code
>>
>> import urllib2 from bs4 import BeautifulSoup link =
>> "http://www.google.com"
>> page = urllib2.urlopen(link).read()
>> soup = BeautifulSoup(page)
>>
>> then i can extract title of page by:
>>
>> title = soup.title
>>
>> but i want to know that how to extract page-URL from "soup" that will be
>> "http://www.google.com"
>
> I must be missing something here, the page url is what you use to open
> the page in the first place in your case link.
>
>
>
>
> --
> May a Misguided Platypus lay its Eggs in your Jockey Shorts.
> --
> https://mail.python.org/mailman/listinfo/python-list



-- 
Joel Goldstick
http://joelgoldstick.com