Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.java.programmer > #2817

Re: JavaScript and Screenscraping

From Dr J R Stockton <reply1113@merlyn.demon.co.uk>
Newsgroups comp.lang.java.programmer
Subject Re: JavaScript and Screenscraping
Date 2011-04-03 17:27 +0100
Organization Home
Message-ID <vovilNYX$JmNFwIi@invalid.uk.co.demon.merlyn.invalid> (permalink)
References <rvc6p6toumdlevjb48ohjnlf1gur128eqe@4ax.com> <AwQpTOIkQllNFw1B@invalid.uk.co.demon.merlyn.invalid> <t64dp61er3n5cbkpuippmpji0dlaijbsnm@4ax.com>

Show all headers | View raw


In comp.lang.java.programmer message <t64dp61er3n5cbkpuippmpji0dlaijbsnm
@4ax.com>, Fri, 1 Apr 2011 20:00:27, Roedy Green <see_website@mindprod.c
om.invalid> posted:

>On Fri, 1 Apr 2011 23:39:32 +0100, Dr J R Stockton
><reply1113@merlyn.demon.co.uk> wrote, quoted or indirectly quoted
>someone who said :
>
>>But JavaScript used as you describe does not necessarily generate HTML,
>>but can manipulate the DOM tree directly.
>>
>>Or are you thinking of server-side scripting with .php?
>
>I am just trying to go to motherboard manufacturer websites and
>collect specs from the webpages. The webpages often contain a lot of
>Javascript.  The data does not appear in any form. Presumably the Java
>script loads more Java script or resources then formats it.

Probably but not entirely presumably; if using an iframe, there could be
no need for reformatting.

Given a URL or two as examples, and a clear indication of what is to be
scraped, one might be able to understand the situation better.

-- 
 (c) John Stockton, nr London, UK.    ?@merlyn.demon.co.uk     Turnpike v6.05.
 Website  <http://www.merlyn.demon.co.uk/> - w. FAQish topics, links, acronyms
 PAS EXE etc. : <http://www.merlyn.demon.co.uk/programs/> - see in 00index.htm
 Dates - miscdate.htm estrdate.htm js-dates.htm pas-time.htm critdate.htm etc.

Back to comp.lang.java.programmer | Previous | NextPrevious in thread | Next in thread | Find similar


Thread

JavaScript and Screenscraping Roedy Green <see_website@mindprod.com.invalid> - 2011-03-30 06:51 -0700
  Re: JavaScript and Screenscraping Michal Kleczek <kleku75@gmail.com> - 2011-03-30 16:27 +0200
    Re: JavaScript and Screenscraping Tom Anderson <twic@urchin.earth.li> - 2011-03-31 00:28 +0100
  Re: JavaScript and Screenscraping Peter Duniho <NpOeStPeAdM@NnOwSlPiAnMk.com> - 2011-03-30 07:40 -0700
    Re: JavaScript and Screenscraping Roedy Green <see_website@mindprod.com.invalid> - 2011-03-30 18:27 -0700
  Re: JavaScript and Screenscraping Dr J R Stockton <reply1113@merlyn.demon.co.uk> - 2011-04-01 23:39 +0100
    Re: JavaScript and Screenscraping Roedy Green <see_website@mindprod.com.invalid> - 2011-04-01 20:00 -0700
      Re: JavaScript and Screenscraping Dr J R Stockton <reply1113@merlyn.demon.co.uk> - 2011-04-03 17:27 +0100
  Re: JavaScript and Screenscraping RedGrittyBrick <RedGrittyBrick@spamweary.invalid> - 2011-04-05 17:15 +0100

csiph-web