Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #45884

Re: Utility to locate errors in regular expressions

From Roy Smith <roy@panix.com>
Newsgroups comp.lang.python
Subject Re: Utility to locate errors in regular expressions
Date 2013-05-24 09:12 -0400
Organization PANIX Public Access Internet and UNIX, NYC
Message-ID <roy-14A84E.09121624052013@news.panix.com> (permalink)
References <mailman.2062.1369400329.3114.python-list@python.org>

Show all headers | View raw


In article <mailman.2062.1369400329.3114.python-list@python.org>,
 Malte Forkel <malte.forkel@berlin.de> wrote:

> Finding out why a regular expression does not match a given string can
> very tedious. I would like to write a utility that identifies the
> sub-expression causing the non-match. My idea is to use a parser to
> create a tree representing the complete regular expression. Then I could
> simplify the expression by dropping sub-expressions one by one from
> right to left and from bottom to top until the remaining regex matches.
> The last sub-expression dropped should be (part of) the problem.
> 
> As a first step, I am looking for a parser for Python regular
> expressions, or a Python regex grammar to create a parser from.
> 
> But may be my idea is flawed? Or a similar (or better) tools already
> exists? Any advice will be highly appreciated!

I think this would be a really cool tool.  The debugging process I've 
always used is essentially what you describe.  I start try progressively 
shorter sub-patterns until I get a match, then try to incrementally add 
back little bits of the original pattern until it no longer matches.  
With luck, the problem will become obvious at that point.

Having a tool which automated this would be really useful.

Of course, most of Python user community are wimps and shy away from big 
hairy regexes [ducking and running].

Back to comp.lang.python | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

Utility to locate errors in regular expressions Malte Forkel <malte.forkel@berlin.de> - 2013-05-24 14:58 +0200
  Re: Utility to locate errors in regular expressions Roy Smith <roy@panix.com> - 2013-05-24 09:12 -0400
    Re: Utility to locate errors in regular expressions Neil Cerutti <neilc@norwich.edu> - 2013-05-24 13:58 +0000
  Re: Utility to locate errors in regular expressions rusi <rustompmody@gmail.com> - 2013-05-24 07:09 -0700
  Re: Utility to locate errors in regular expressions Christian Gollwitzer <auriocus@gmx.de> - 2013-05-24 20:21 +0200

csiph-web