Path: csiph.com!usenet.pasdenom.info!aioe.org!eternal-september.org!feeder.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Denis McMahon Newsgroups: comp.lang.python Subject: Re: python to enable javascript , tried selinium, ghost, pyQt4 already Date: Sat, 18 Jan 2014 21:40:38 +0000 (UTC) Organization: A noiseless patient Spider Lines: 24 Message-ID: References: <91184b5c-aa05-42e1-81de-15252023a15b@googlegroups.com> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Injection-Date: Sat, 18 Jan 2014 21:40:38 +0000 (UTC) Injection-Info: mx05.eternal-september.org; posting-host="66ffcfa4470a58bcddbdcd1913f98ab4"; logging-data="30470"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19NaA7Ggg28bIO533vdSGX5JC6vkwly694=" User-Agent: Pan/0.136 (I'm far too busy being delicious; GIT 926a150 git://git.gnome.org/pan2) Cancel-Lock: sha1:W+8VWajyiCLFdrDHSr8rq6Vrt5U= Xref: csiph.com comp.lang.python:64260 On Sun, 19 Jan 2014 05:13:57 +1100, Chris Angelico wrote: > On Sat, Jan 18, 2014 at 10:54 PM, Jaiprakash Singh > wrote: >> hi, >> >> can you please suggest me some method for study so that i can >> scrap a site having JavaScript behind it >> >> >> i have tried selenium, ghost, pyQt4, but it is slow and as a am >> working with thread it sinks my ram memory very fast. > > Do you mean "scrape"? You're trying to retrieve the displayed contents > of a web page that uses JavaScript? If so, that's basically impossible > without actually executing the JS code, which means largely replicating > the web browser. Oh, you think he meant scrape? I thought he was trying to scrap (as in throw away / replace) an old javascript heavy website with something using python instead. -- Denis McMahon, denismfmcmahon@gmail.com