Path: csiph.com!usenet.pasdenom.info!news.albasani.net!newsfeed.freenet.ag!news2.euro.net!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
MIME-Version: 1.0
In-Reply-To: <0becd1f8-e760-49c1-88d6-1c11b49e203c@googlegroups.com>
References: <0becd1f8-e760-49c1-88d6-1c11b49e203c@googlegroups.com>
Date: Tue, 23 Oct 2012 22:36:40 +0200
Subject: Re: regex function driving me nuts
From: Vlastimil Brom <vlastimil.brom@gmail.com>
To: "MartinD." <cyberdicks@gmail.com>
Content-Type: text/plain; charset=ISO-8859-1
Cc: python-list@python.org
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.2690.1351024603.27098.python-list@python.org>
Lines: 42
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:31957

2012/10/23 MartinD. <cyberdicks@gmail.com>:
> Hi,
>
> I'm new to Python.
> Does someone has an idea what's wrong.  I tried everything. The only regex that is tested is the last one in a whole list of regex in keywords.txt
> Thanks!
> Martin
>
>
> ########
> def checkKeywords( str, lstKeywords ):
>
>         for regex in lstKeywords:
>                 match = re.search(regex, str,re.IGNORECASE)
>                 # If-statement after search() tests if it succeeded
>                 if match:
>                         print match.group() ##just debugging
>                         return match.group() ## 'found!
>
>         return
>
> #########
>
> keywords1 = [line for line in open('keywords1.txt')]
> resultKeywords1 = checkKeywords("string_to_test",keywords1)
> print resultKeywords1
>
> --
> http://mail.python.org/mailman/listinfo/python-list

Hi,
just a wild guess, as I don't have access to  containing the list of
potentially problematic regex patterns
does:
keywords1 = [line.strip() for line in open('keywords1.txt')]
possibly fix yout problem?
the lines of the file iterator also preserve newlines, which might not
be expected in your keywords, strip() removes (be default) any
starting and tryiling whitespace.

hth,
  vbr