Path: csiph.com!fu-berlin.de!uni-berlin.de!not-for-mail
From: Chris Angelico <rosuav@gmail.com>
Newsgroups: comp.lang.python
Subject: Re: Screen scraper to get all 'a title' elements
Date: Thu, 26 Nov 2015 09:10:43 +1100
Lines: 17
Message-ID: <mailman.99.1448489447.20593.python-list@python.org>
References: <23ed6f4b-0ef2-4c9e-ade6-e597e7e03ca2@googlegroups.com> <a6f3a0a7-acc3-46db-a36b-c3d774293347@googlegroups.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
In-Reply-To: <a6f3a0a7-acc3-46db-a36b-c3d774293347@googlegroups.com>
Precedence: list
Xref: csiph.com comp.lang.python:99496

On Thu, Nov 26, 2015 at 9:04 AM, ryguy7272 <ryanshuell@gmail.com> wrote:
> Ok, I guess that makes sense.  So, I just tried the script below, and got nothing...
>
> import requests
> from bs4 import BeautifulSoup
>
> r = requests.get("https://en.wikipedia.org/wiki/Wikipedia:Unusual_place_names")
> soup = BeautifulSoup(r.content)
> print soup.find_all("a",{"title"})

The second argument to find_all is supposed to be a dict, not a set,
and it's only useful if you want to put some restriction on the
titles. To simply enumerate all the titles, try this:

[a.get("title") for a in soup.find_all("a")]

ChrisA