Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #91105

Extract email address from Java script in html source using python

Path csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!eternal-september.org!feeder.eternal-september.org!border1.nntp.ams1.giganews.com!border2.nntp.ams1.giganews.com!nntp.giganews.com!newsfeed.xs4all.nl!newsfeed4a.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
Return-Path <savithad8@gmail.com>
X-Original-To python-list@python.org
Delivered-To python-list@mail.python.org
X-Spam-Status OK 0.020
X-Spam-Evidence '*H*': 0.96; '*S*': 0.00; 'prefix': 0.07; 'subject:script': 0.09; 'subject:using': 0.09; 'python': 0.11; 'subject:python': 0.14; "'';": 0.16; "':'": 0.16; "'\\''": 0.16; 'subject:Java': 0.16; 'beginner': 0.18; 'java': 0.22; "skip:' 40": 0.22; 'developing': 0.25; 'script': 0.25; 'coding': 0.27; 'var': 0.27; 'message-id:@mail.gmail.com': 0.28; 'this.': 0.28; "skip:' 10": 0.30; 'code': 0.31; 'skip:d 40': 0.32; 'extract': 0.33; 'received:google.com': 0.34; 'to:addr:python-list': 0.35; 'path': 0.35; 'totally': 0.36; "skip:' 20": 0.37; 'level': 0.37; 'to:addr:python.org': 0.39; 'skip:t 20': 0.40; 'some': 0.40; 'address': 0.61; 'here.': 0.61; 'sans': 0.72; 'below:': 0.75; 'subject:source': 0.84
DKIM-Signature v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=e0akMRa8IOP/sbS9brhjWEp02THAF9AB9x7vBogqIzQ=; b=ygAZR2/gPEEJZ2FTJJhp8bmHiveCSmBlVm+WilsulstEZIEDg7SnJPgZOrzu/+0jff fwUbg2c41Tmfa6CzTbxdH4Wby9TE9wd/YtIwkCF6JaZdKFEN5CyVbNps+MiTa4f1GSPS CReNiG67lZUXGzVapV4g+tcPVqAYjXX0nbo6413n22KV+lNF3vGIYTbQLAGurfGy2/Gc UtK15jTxurl0KZmXoc7USwbECEdVkm7KxCOcGgEqUGTnMe3WOl3Qh4jZ4kersOWg7YY4 lXzoJEic4WGZtxxnJHtWVqgIopOGDjMSEN2LomSuJb4JybmYC1FO0wJ553hFUBUPb/cp 2TBQ==
MIME-Version 1.0
X-Received by 10.202.203.77 with SMTP id b74mr6810892oig.1.1432363566545; Fri, 22 May 2015 23:46:06 -0700 (PDT)
Date Sat, 23 May 2015 12:16:06 +0530
Subject Extract email address from Java script in html source using python
From savitha devi <savithad8@gmail.com>
To python-list@python.org
Content-Type multipart/alternative; boundary=001a1134fb4492c18b0516ba21f3
X-Mailman-Approved-At Sat, 23 May 2015 10:21:11 +0200
X-BeenThere python-list@python.org
X-Mailman-Version 2.1.20+
Precedence list
List-Id General discussion list for the Python programming language <python-list.python.org>
List-Unsubscribe <https://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe>
List-Archive <http://mail.python.org/pipermail/python-list/>
List-Post <mailto:python-list@python.org>
List-Help <mailto:python-list-request@python.org?subject=help>
List-Subscribe <https://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe>
Newsgroups comp.lang.python
Message-ID <mailman.265.1432369272.17265.python-list@python.org> (permalink)
Lines 173
NNTP-Posting-Host 2001:888:2000:d::a6
X-Trace 1432369272 news.xs4all.nl 2959 [2001:888:2000:d::a6]:50035
X-Complaints-To abuse@xs4all.nl
Xref csiph.com comp.lang.python:91105

Show key headers only | View raw


[Multipart message — attachments visible in raw view] - view raw

I am developing a web scraper code using HTMLParser. I need to extract
text/email address from java script with in the HTMLCode.I am beginner
level in python coding and totally lost here. Need some help on this. The
java script code is as below:

<script type='text/javascript'>
 //<!--
 document.getElementById('cloak48218').innerHTML = '';
 var prefix = '&#109;a' + 'i&#108;' + '&#116;o';
 var path = 'hr' + 'ef' + '=';
 var addy48218 = '&#105;nf&#111;' + '&#64;';
 addy48218 = addy48218 + 'tsv-n&#101;&#117;r&#105;&#101;d' + '&#46;' +
'd&#101;';
 document.getElementById('cloak48218').innerHTML += '<a ' + path +
'\'' + prefix + ':' + addy48218 + '\'>' + addy48218+'<\/a>';
 //-->

Back to comp.lang.python | Previous | NextNext in thread | Find similar | Unroll thread


Thread

Extract email address from Java script in html source using python savitha devi <savithad8@gmail.com> - 2015-05-23 12:16 +0530
  Re: Extract email address from Java script in html source using python Peter Pearson <pkpearson@nowhere.invalid> - 2015-05-23 16:57 +0000
  Re: Extract email address from Java script in html source using python Denis McMahon <denismfmcmahon@gmail.com> - 2015-05-25 18:39 +0000

csiph-web