Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #39439

Re: Urllib's urlopen and urlretrieve

Date 2013-02-21 10:56 -0500
From Dave Angel <davea@davea.name>
Subject Re: Urllib's urlopen and urlretrieve
References <34998ea2-6b19-4a98-8ea0-389aca0192ca@googlegroups.com>
Newsgroups comp.lang.python
Message-ID <mailman.2179.1361462189.2939.python-list@python.org> (permalink)

Show all headers | View raw


On 02/21/2013 07:12 AM, qoresucks@gmail.com wrote:
> I only just started Python and given that I know nothing about network programming or internet programming of any kind really, I thought it would be interesting to try write something that could create an archive of a website for myself.

Please send your emails as text, not html;  this is a text-based mailing 
list.

To archive your website, use the rsync command.  No need to write any 
code, as rsync will descend into all the directories as needed, and 
it'll get the actual website data, not the stuff that the web server 
feeds to the browsers.

If for some reason you don't have rsync, you could use scp.  But it 
doesn't seem to be able to preserve attributes.  It's also not smart 
enough to only copy stuff that's been changed, when you want to update 
incrementally.


-- 
DaveA

Back to comp.lang.python | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

Urllib's urlopen and urlretrieve qoresucks@gmail.com - 2013-02-21 04:12 -0800
  Re: Urllib's urlopen and urlretrieve Michael Herman <hermanmu@gmail.com> - 2013-02-21 04:59 -0800
    Re: Urllib's urlopen and urlretrieve qoresucks@gmail.com - 2013-02-21 21:09 -0800
      Re: Urllib's urlopen and urlretrieve Dave Angel <davea@davea.name> - 2013-02-22 12:05 -0500
      Re: Urllib's urlopen and urlretrieve MRAB <python@mrabarnett.plus.com> - 2013-02-22 17:18 +0000
    Re: Urllib's urlopen and urlretrieve qoresucks@gmail.com - 2013-02-21 21:09 -0800
  Re: Urllib's urlopen and urlretrieve Dave Angel <davea@davea.name> - 2013-02-21 10:56 -0500
  Re: Urllib's urlopen and urlretrieve rh <richard_hubbe11@lavabit.com> - 2013-02-21 09:47 -0800
  Re: Urllib's urlopen and urlretrieve rh <richard_hubbe11@lavabit.com> - 2013-02-21 09:55 -0800
  Re: Urllib's urlopen and urlretrieve Dave Angel <davea@davea.name> - 2013-02-21 13:04 -0500
  Re: Urllib's urlopen and urlretrieve Dave Angel <davea@davea.name> - 2013-02-21 13:53 -0500

csiph-web