Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!newsfeed.xs4all.nl!newsfeed4.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
MIME-Version: 1.0
In-Reply-To: <03c8b5d0-363e-4287-80d0-a43b0266f2a3@googlegroups.com>
References: <84eb4c69-d43d-4777-8a99-34eed9be73d6@googlegroups.com> <03c8b5d0-363e-4287-80d0-a43b0266f2a3@googlegroups.com>
Date: Sun, 23 Mar 2014 11:49:11 -0600
Subject: Re: help with for loop----python 2.7.2
From: Ian Kelly <ian.g.kelly@gmail.com>
To: Python <python-list@python.org>
Content-Type: multipart/alternative; boundary=047d7b111dd988e48a04f549bc3d
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.8418.1395596955.18130.python-list@python.org>
Lines: 89
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:68818

--047d7b111dd988e48a04f549bc3d
Content-Type: text/plain; charset=ISO-8859-1

On Mar 23, 2014 11:31 AM, "tad na" <teddybubu@gmail.com> wrote:
> OK . second problem :)
> I can print the date.  not sure how to do this one..

Why not? What happens when you try?

> try:
>     from urllib2 import urlopen
> except ImportError:
>     from urllib.request import urlopen
> import urllib2
> from bs4 import BeautifulSoup
>
> soup = BeautifulSoup(urlopen('http://bl.ocks.org/mbostock.rss'))
> #print soup.find_all('item')
> #print (soup)
> data = soup.find_all("item")
>
> x=0
> for item in soup.find_all('item'):
>     title = item.find('title').text
>     link = item.find('link').text
>     date = item.find('pubDate')
>    # print date
>     print('+++++++++++++++++')
>     print data[x].title.text
>     print data[x].link.text
>     print data[x].guid.text
>     print data[x].pubDate
>     x = x + 1

data[x] should be the same object as item, no? If you want to keep track of
the current iteration index, a cleaner way to do that is by using enumerate:

    for x, item in enumerate(soup.find_all('item')):

As far as printing the pubDate goes, why not start by getting its text
property as you do with the other tags? From there you can either print the
string out directly or parse it into a datetime object.

--047d7b111dd988e48a04f549bc3d
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<p dir=3D"ltr"><br>
On Mar 23, 2014 11:31 AM, &quot;tad na&quot; &lt;<a href=3D"mailto:teddybub=
u@gmail.com">teddybubu@gmail.com</a>&gt; wrote:<br>
&gt; OK . second problem :)<br>
&gt; I can print the date. =A0not sure how to do this one..</p>
<p dir=3D"ltr">Why not? What happens when you try?</p>
<p dir=3D"ltr">&gt; try:<br>
&gt; =A0 =A0 from urllib2 import urlopen<br>
&gt; except ImportError:<br>
&gt; =A0 =A0 from urllib.request import urlopen<br>
&gt; import urllib2<br>
&gt; from bs4 import BeautifulSoup<br>
&gt;<br>
&gt; soup =3D BeautifulSoup(urlopen(&#39;<a href=3D"http://bl.ocks.org/mbos=
tock.rss&#39;">http://bl.ocks.org/mbostock.rss&#39;</a>))<br>
&gt; #print soup.find_all(&#39;item&#39;)<br>
&gt; #print (soup)<br>
&gt; data =3D soup.find_all(&quot;item&quot;)<br>
&gt;<br>
&gt; x=3D0<br>
&gt; for item in soup.find_all(&#39;item&#39;):<br>
&gt; =A0 =A0 title =3D item.find(&#39;title&#39;).text<br>
&gt; =A0 =A0 link =3D item.find(&#39;link&#39;).text<br>
&gt; =A0 =A0 date =3D item.find(&#39;pubDate&#39;)<br>
&gt; =A0 =A0# print date<br>
&gt; =A0 =A0 print(&#39;+++++++++++++++++&#39;)<br>
&gt; =A0 =A0 print data[x].title.text<br>
&gt; =A0 =A0 print data[x].link.text<br>
&gt; =A0 =A0 print data[x].guid.text<br>
&gt; =A0 =A0 print data[x].pubDate<br>
&gt; =A0 =A0 x =3D x + 1</p>
<p dir=3D"ltr">data[x] should be the same object as item, no? If you want t=
o keep track of the current iteration index, a cleaner way to do that is by=
 using enumerate:</p>
<p dir=3D"ltr">=A0=A0=A0 for x, item in enumerate(soup.find_all(&#39;item&#=
39;)):</p>
<p dir=3D"ltr">As far as printing the pubDate goes, why not start by gettin=
g its text property as you do with the other tags? From there you can eithe=
r print the string out directly or parse it into a datetime object.<br>
</p>

--047d7b111dd988e48a04f549bc3d--