Path: csiph.com!usenet.pasdenom.info!aioe.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed5.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
Date: Wed, 17 Oct 2012 11:00:11 -0400
From: Dave Angel <d@davea.name>
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:14.0) Gecko/20120714 Thunderbird/14.0
MIME-Version: 1.0
To: nwaits <nowaits@gmail.com>
Subject: Re: Script for finding words of any size that do NOT contain vowels with acute diacritic marks?
References: <a7454cb7-e6dc-4167-b72a-56a67a5873a7@googlegroups.com>
In-Reply-To: <a7454cb7-e6dc-4167-b72a-56a67a5873a7@googlegroups.com>
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit
Cc: python-list@python.org
Precedence: list
Reply-To: d@davea.name
Newsgroups: comp.lang.python
Message-ID: <mailman.2350.1350486045.27098.python-list@python.org>
Lines: 17
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:31516

On 10/17/2012 10:31 AM, nwaits wrote:
> I'm very impressed with python's wordlist script for plain text.  Is there a script for finding words that do NOT have certain diacritic marks, like acute or grave accents (utf-8), over the vowels?  
> Thank you.

if you can construct a list of "illegal" characters, then you can simply
check each character of the word against the list, and if it succeeds
for all of the characters, it's a winner.

If that's not fast enough, you can build a translation table from the
list of illegal characters, and use translate on each word.  Then it
becomes a question of checking if the translated word is all zeroes.  
More setup time, but much faster looping for each word.

-- 

DaveA