Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!goblin2!goblin.stu.neva.ru!newsfeed1.swip.net!uio.no!news.tele.dk!news.tele.dk!small.news.tele.dk!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
MIME-Version: 1.0
Date: Thu, 28 Nov 2013 15:19:48 +0000
Subject: Downloading/Saving to a Directory
From: "TheRandomPast ." <wishingforsam@gmail.com>
To: python-list@python.org
Content-Type: multipart/alternative; boundary=047d7bae4922891d1804ec3e3e1d
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.3359.1385653859.18130.python-list@python.org>
Lines: 77
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:60698

--047d7bae4922891d1804ec3e3e1d
Content-Type: text/plain; charset=ISO-8859-1

Hi,

I've created a script that allows me to see how many images are on a
webpage and their URL however now I want to download all .jpg images from
this website and save them onto my computer. I've never done this before
and I've become a little confused as to where I should go next. Can some
kind person take a look at my code and tell me if I'm completely in the
wrong direction?

Just to clarify what I want to do is download all .jpg images on
dogpicturesite.com and save them to a directory on my computer.

Sorry if this is a really stupid question.

import traceback
import sys
from urllib import urlretrieve

try:

        print ' imagefiles()'
        images = re.findall(r'([-\w]+\.(?:jpg))', webpage)
        urlretrieve('http://dogpicturesite.com/', 'C:/images)
        print "Downloading Images....."
        time.sleep(5)
        print "Images Downloaded."
except:
        print "Failed to Download Images"
        raw_input('Press Enter to exit...')
        sys.exit()

def main():
    sys.argv.append('http://dogpicturesite.com/')
    if len(sys.argv) != 2:
        print '[-] Image Files'
        return
    page = webpage.webpage(sys.argv[1])
    imagefiles(webpage)

--047d7bae4922891d1804ec3e3e1d
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi,=A0<div><br></div><div>I&#39;ve created a script that a=
llows me to see how many images are on a webpage and their URL however now =
I want to download all .jpg images from this website and save them onto my =
computer. I&#39;ve never done this before and I&#39;ve become a little conf=
used as to where I should go next. Can some kind person take a look at my c=
ode and tell me if I&#39;m completely in the wrong direction?=A0</div>
<div><br></div><div>Just to clarify what I want to do is download all .jpg =
images on <a href=3D"http://dogpicturesite.com">dogpicturesite.com</a> and =
save them to a directory on my computer.=A0</div><div><br></div><div>Sorry =
if this is a really stupid question.=A0</div>
<div><br></div><div><div>import traceback</div><div>import sys</div><div>fr=
om urllib import urlretrieve</div><div><br></div><div>try:</div><div><br></=
div><div>=A0 =A0 =A0 =A0 print &#39; imagefiles()&#39;</div><div>=A0 =A0 =
=A0 =A0 images =3D re.findall(r&#39;([-\w]+\.(?:jpg))&#39;, webpage)</div>
<div>=A0 =A0 =A0 =A0 urlretrieve(&#39;<a href=3D"http://dogpicturesite.com/=
">http://dogpicturesite.com/</a>&#39;, &#39;C:/images)</div><div>=A0 =A0 =
=A0 =A0 print &quot;Downloading Images.....&quot;</div><div>=A0 =A0 =A0 =A0=
 time.sleep(5)</div><div>
=A0 =A0 =A0 =A0 print &quot;Images Downloaded.&quot;</div><div>except:</div=
><div>=A0 =A0 =A0 =A0 print &quot;Failed to Download Images&quot;</div><div=
>=A0 =A0 =A0 =A0 raw_input(&#39;Press Enter to exit...&#39;)</div><div>=A0 =
=A0 =A0 =A0 sys.exit()</div>
<div><br></div><div>def main():</div><div>=A0 =A0 sys.argv.append(&#39;<a h=
ref=3D"http://dogpicturesite.com/">http://dogpicturesite.com/</a>&#39;)</di=
v><div>=A0 =A0 if len(sys.argv) !=3D 2:</div><div>=A0 =A0 =A0 =A0 print =
9;[-] Image Files&#39;</div>
<div>=A0 =A0 =A0 =A0 return</div><div>=A0 =A0 page =3D webpage.webpage(sys.=
argv[1])</div><div>=A0 =A0 imagefiles(webpage)</div></div><div><br></div><d=
iv><br></div></div>

--047d7bae4922891d1804ec3e3e1d--