Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #54495

Re: Antispam measures circumventing

Date 2013-09-20 17:45 +0100
From Jugurtha Hadjar <jugurtha.hadjar@gmail.com>
Subject Re: Antispam measures circumventing
References <523C6402.7090501@gmail.com>
Newsgroups comp.lang.python
Message-ID <mailman.193.1379695530.18130.python-list@python.org> (permalink)

Show all headers | View raw


Chris, Vlastimil, great insights gentlemen! Thanks

Chris Angelico wrote:

 >Instead of matching the ones that are the same as their uppercase
 >version, why not instead keep the ones that are the same as their
 >lowercase?


That's why I started off doing, and then lost track a bit. It didn't 
cross my mind that '.' and '@' are uncased characters and I'm a bit 
ashamed of not thinking about that before running the code

(i.e:

'.'.lower() gives False
'.'.upper() gives False

And the same for '@'. So unless you specifically "spare" them, they'll 
be whacked if you exclude upper case characters, or only include lower 
case characters).

 >Ah, now you're getting into the realm of CAPTCHAs. I'll be quite frank
 >with you: Don't bother. Many MANY experts are already looking into it

Yeah.. I thought of writing "My e-mail is my first name, dot, my last 
name at gmail dot com".

Some "riddling" can be viable to a certain extent. Or if your e-mail is 
ba86rockstar@gm.bu

ba, then 86, then rock, then star, at gm dot bu.

Or the e-mail can be generated dynamically calling a script that 
assembles pieces and displays it. This way, it can escape scrapers and 
all and will make it hard to manually harvest e-mails.. Which brings us 
to your next point about e-mail harvesters and that kind of labor (which 
is astounding !).




 > email = 'REMOVEMEjohn.doSPAMeSPAM@REMOVEMEhotmail.com'
 > ''.join(filter(lambda x: x==x.lower(),email))
 >'john.doe@hotmail.com'

Nice ! As well as Vlastimil's suggestions. The things I found on the net 
weren't that well written. There were *way* too many lines that made me 
think "No way. There's gotta be a better way".






-- 
~Jugurtha Hadjar,

Back to comp.lang.python | Previous | Next | Find similar | Unroll thread


Thread

Re: Antispam measures circumventing Jugurtha Hadjar <jugurtha.hadjar@gmail.com> - 2013-09-20 17:45 +0100

csiph-web