Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #91105 > unrolled thread

Extract email address from Java script in html source using python

Started bysavitha devi <savithad8@gmail.com>
First post2015-05-23 12:16 +0530
Last post2015-05-25 18:39 +0000
Articles 3 — 3 participants

Back to article view | Back to comp.lang.python


Contents

  Extract email address from Java script in html source using python savitha devi <savithad8@gmail.com> - 2015-05-23 12:16 +0530
    Re: Extract email address from Java script in html source using python Peter Pearson <pkpearson@nowhere.invalid> - 2015-05-23 16:57 +0000
    Re: Extract email address from Java script in html source using python Denis McMahon <denismfmcmahon@gmail.com> - 2015-05-25 18:39 +0000

#91105 — Extract email address from Java script in html source using python

Fromsavitha devi <savithad8@gmail.com>
Date2015-05-23 12:16 +0530
SubjectExtract email address from Java script in html source using python
Message-ID<mailman.265.1432369272.17265.python-list@python.org>

[Multipart message — attachments visible in raw view] — view raw

I am developing a web scraper code using HTMLParser. I need to extract
text/email address from java script with in the HTMLCode.I am beginner
level in python coding and totally lost here. Need some help on this. The
java script code is as below:

<script type='text/javascript'>
 //<!--
 document.getElementById('cloak48218').innerHTML = '';
 var prefix = '&#109;a' + 'i&#108;' + '&#116;o';
 var path = 'hr' + 'ef' + '=';
 var addy48218 = '&#105;nf&#111;' + '&#64;';
 addy48218 = addy48218 + 'tsv-n&#101;&#117;r&#105;&#101;d' + '&#46;' +
'd&#101;';
 document.getElementById('cloak48218').innerHTML += '<a ' + path +
'\'' + prefix + ':' + addy48218 + '\'>' + addy48218+'<\/a>';
 //-->

[toc] | [next] | [standalone]


#91141

FromPeter Pearson <pkpearson@nowhere.invalid>
Date2015-05-23 16:57 +0000
Message-ID<csbprgF7c4pU1@mid.individual.net>
In reply to#91105
On Sat, 23 May 2015 12:16:06 +0530, savitha devi <savithad8@gmail.com> wrote:
>
> I am developing a web scraper code using HTMLParser. I need to extract
> text/email address from java script with in the HTMLCode.

Would be be correct in suspecting that you are assembling a list
of email addresses for use in spamming?  After all, that is the
problem that motivates people to hide their email addresses behind
Javascript.

-- 
To email me, substitute nowhere->runbox, invalid->com.

[toc] | [prev] | [next] | [standalone]


#91205

FromDenis McMahon <denismfmcmahon@gmail.com>
Date2015-05-25 18:39 +0000
Message-ID<mjvq9o$9i3$11@dont-email.me>
In reply to#91105
On Sat, 23 May 2015 12:16:06 +0530, savitha devi wrote:

> I am developing a web scraper code using HTMLParser. I need to extract
> text/email address from java script with in the HTMLCode.I am beginner
> level in python coding and totally lost here. Need some help on this.

(a) Try a less ambitious learning project.
(b) Start reading the relevant documentation.

Pick one and go with it.

-- 
Denis McMahon, denismfmcmahon@gmail.com

[toc] | [prev] | [standalone]


Back to top | Article view | comp.lang.python


csiph-web