Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #39447

Re: Urllib's urlopen and urlretrieve

From rh <richard_hubbe11@lavabit.com>
Subject Re: Urllib's urlopen and urlretrieve
Date 2013-02-21 09:55 -0800
References <34998ea2-6b19-4a98-8ea0-389aca0192ca@googlegroups.com>
Newsgroups comp.lang.python
Message-ID <mailman.2187.1361469299.2939.python-list@python.org> (permalink)

Show all headers | View raw


On Thu, 21 Feb 2013 04:12:52 -0800 (PST)
qoresucks@gmail.com wrote:

> I only just started Python and given that I know nothing about
> network programming or internet programming of any kind really, I
> thought it would be interesting to try write something that could
> create an archive of a website for myself. With this I started trying
> to use the urllib library, however I am having a problem
> understanding why certain things wont work with the
> urllib.urlretrieve and urllib.urlopen then reading.
> 
> Why is it that when using urllib.urlopen then reading or
> urllib.urlretrieve, does it only give me parts of the sites, loosing
> the formatting, images, etc...? How can I get around this?

urllib2 is the standard library in 2.7.3 to use, in 3.3 it is urllib
straight from the doc page

import urllib2
f = urllib2.urlopen('http://www.python.org/')
print f.read(100)
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<?xml-stylesheet href="./css/ht2html

And so your journey begins.  With recursing into links, etc., etc.
> 
> Lastly, while its a bit off topic, I lack a good understanding of
> network programming as a whole. From making programs communicate or
> to simply extract data from URL's, I don't know where to even begin,
> which has lead me to learning python to better understand it
> hopefully then carry it over to other languages I know. Can anyone
> give me some advice on where to begin learning this information? Even
> if its in another language.

Also since you're new you may want to work with python3 but not
a requirement.

There are lots of free books online, search this list for links.
(you can search this list at gmane and probably elsewhere)

Back to comp.lang.python | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

Urllib's urlopen and urlretrieve qoresucks@gmail.com - 2013-02-21 04:12 -0800
  Re: Urllib's urlopen and urlretrieve Michael Herman <hermanmu@gmail.com> - 2013-02-21 04:59 -0800
    Re: Urllib's urlopen and urlretrieve qoresucks@gmail.com - 2013-02-21 21:09 -0800
      Re: Urllib's urlopen and urlretrieve Dave Angel <davea@davea.name> - 2013-02-22 12:05 -0500
      Re: Urllib's urlopen and urlretrieve MRAB <python@mrabarnett.plus.com> - 2013-02-22 17:18 +0000
    Re: Urllib's urlopen and urlretrieve qoresucks@gmail.com - 2013-02-21 21:09 -0800
  Re: Urllib's urlopen and urlretrieve Dave Angel <davea@davea.name> - 2013-02-21 10:56 -0500
  Re: Urllib's urlopen and urlretrieve rh <richard_hubbe11@lavabit.com> - 2013-02-21 09:47 -0800
  Re: Urllib's urlopen and urlretrieve rh <richard_hubbe11@lavabit.com> - 2013-02-21 09:55 -0800
  Re: Urllib's urlopen and urlretrieve Dave Angel <davea@davea.name> - 2013-02-21 13:04 -0500
  Re: Urllib's urlopen and urlretrieve Dave Angel <davea@davea.name> - 2013-02-21 13:53 -0500

csiph-web