Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.lang.python > #108045
| From | DFS <nospam@dfs.com> |
|---|---|
| Newsgroups | comp.lang.python |
| Subject | Re: You gotta love a 2-line python solution |
| Date | 2016-05-02 23:56 -0400 |
| Organization | A noiseless patient Spider |
| Message-ID | <ng97bn$4fi$1@dont-email.me> (permalink) |
| References | <ng6hur$tiu$1@dont-email.me> <c1424b89-32fb-46b1-bc2f-7d1fb57dd268@googlegroups.com> <ng8tmt$bqa$1@dont-email.me> <c9aa3139-a713-4460-9165-93080529e610@googlegroups.com> |
On 5/2/2016 11:27 PM, jfong@ms4.hinet.net wrote:
> DFS at 2016/5/3 9:12:24AM wrote:
>> try
>>
>> from urllib.request import urlretrieve
>>
>> http://stackoverflow.com/questions/21171718/urllib-urlretrieve-file-python-3-3
>>
>>
>> I'm running python 2.7.11 (32-bit)
>
> Alright, it works...someway.
>
> I try to get a zip file. It works, the file can be unzipped correctly.
>
>>>> from urllib.request import urlretrieve
>>>> urlretrieve("http://www.caprilion.com.tw/fed.zip", "d:\\temp\\temp.zip")
> ('d:\\temp\\temp.zip', <http.client.HTTPMessage object at 0x03102C50>)
>>>>
>
> But when I try to get this forum page, it does get a html file but can't be viewed normally.
>
>>>> urlretrieve("https://groups.google.com/forum/#!topic/comp.lang.python/jFl3GJ
> bmR7A", "d:\\temp\\temp.html")
> ('d:\\temp\\temp.html', <http.client.HTTPMessage object at 0x03102A90>)
>>>>
>
> I suppose the html is a much complex situation where more processes need to be done before it can be opened by a web browser:-)
Who knows what Google has done... it won't open in Opera. The tab title
shows up, but after 20-30 seconds the screen just stays blank and the
cursor quits loading.
It's a mess - try running it thru BeautifulSoup.prettify() and it looks
better.
------------------------------------------------------------
import BeautifulSoup
from urllib.request import urlretrieve
webfile = "D:\\afile.html"
urllib.urlretrieve("https://groups.google.com/forum/#!topic/comp.lang.python/jFl3GJbmR7A",webfile)
f = open(webfile)
soup = BeautifulSoup.BeautifulSoup(f)
f.close()
print soup.prettify()
------------------------------------------------------------
Back to comp.lang.python | Previous | Next — Previous in thread | Next in thread | Find similar | Unroll thread
You gotta love a 2-line python solution DFS <nospam@dfs.com> - 2016-05-01 23:39 -0400
Re: You gotta love a 2-line python solution Stephen Hansen <me+python@ixokai.io> - 2016-05-01 21:31 -0700
Re: You gotta love a 2-line python solution DFS <nospam@dfs.com> - 2016-05-02 00:51 -0400
Re: You gotta love a 2-line python solution Stephen Hansen <me+python@ixokai.io> - 2016-05-01 22:02 -0700
Re: You gotta love a 2-line python solution DFS <nospam@dfs.com> - 2016-05-02 01:08 -0400
Re: You gotta love a 2-line python solution Stephen Hansen <me+python@ixokai.io> - 2016-05-01 22:21 -0700
Re: You gotta love a 2-line python solution Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2016-05-02 15:51 +1000
Re: You gotta love a 2-line python solution DFS <nospam@dfs.com> - 2016-05-02 01:23 -0400
Re: You gotta love a 2-line python solution Stephen Hansen <me+python@ixokai.io> - 2016-05-01 22:37 -0700
Re: You gotta love a 2-line python solution DFS <nospam@dfs.com> - 2016-05-02 02:13 -0400
Re: You gotta love a 2-line python solution Terry Reedy <tjreedy@udel.edu> - 2016-05-02 02:46 -0400
Re: You gotta love a 2-line python solution BartC <bc@freeuk.com> - 2016-05-02 10:26 +0100
Re: You gotta love a 2-line python solution Marko Rauhamaa <marko@pacujo.net> - 2016-05-02 13:12 +0300
Re: You gotta love a 2-line python solution Steven D'Aprano <steve@pearwood.info> - 2016-05-02 22:05 +1000
Re: You gotta love a 2-line python solution DFS <nospam@dfs.com> - 2016-05-02 11:15 -0400
Re: You gotta love a 2-line python solution Larry Martell <larry.martell@gmail.com> - 2016-05-02 11:24 -0400
Re: You gotta love a 2-line python solution Manolo MartÃnez <manolo@austrohungaro.com> - 2016-05-02 17:32 +0200
Re: You gotta love a 2-line python solution jfong@ms4.hinet.net - 2016-05-02 17:45 -0700
Re: You gotta love a 2-line python solution DFS <nospam@dfs.com> - 2016-05-02 21:12 -0400
Re: You gotta love a 2-line python solution jfong@ms4.hinet.net - 2016-05-02 20:27 -0700
Re: You gotta love a 2-line python solution Stephen Hansen <me+python@ixokai.io> - 2016-05-02 20:49 -0700
Re: You gotta love a 2-line python solution jfong@ms4.hinet.net - 2016-05-02 20:57 -0700
Re: You gotta love a 2-line python solution Stephen Hansen <me+python@ixokai.io> - 2016-05-03 09:09 -0700
Re: You gotta love a 2-line python solution DFS <nospam@dfs.com> - 2016-05-02 23:56 -0400
Re: You gotta love a 2-line python solution Steven D'Aprano <steve@pearwood.info> - 2016-05-04 11:20 +1000
Re: You gotta love a 2-line python solution Dennis Lee Bieber <wlfraed@ix.netcom.com> - 2016-05-03 08:14 -0400
csiph-web