Path: csiph.com!eternal-september.org!feeder.eternal-september.org!mx02.eternal-september.org!.POSTED!not-for-mail From: Paul Rubin Newsgroups: comp.lang.python Subject: Re: psss...I want to move from Perl to Python Date: Sun, 31 Jan 2016 09:49:45 -0800 Organization: A noiseless patient Spider Lines: 10 Message-ID: <877fipr64m.fsf@jester.gateway.pace.com> References: <56add285$0$1593$c3e8da3$5496439d@news.astraweb.com> Mime-Version: 1.0 Content-Type: text/plain Injection-Info: mx02.eternal-september.org; posting-host="560a36ee31cc4bcf69e115b311f0cc5c"; logging-data="27710"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+0URD1zBOh4YJZygf4Hkou" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3 (gnu/linux) Cancel-Lock: sha1:lH1CYygEocR3WpgontHlmH9hEco= sha1:f0F4T/FwpEXX8IdJ8fXMhaDZH7U= Xref: csiph.com comp.lang.python:102363 Cameron Simpson writes: > Adzapper. It has many many regexps matching URLs. (Actually a more > globlike syntax, but it gets turned into a regexp.) You plug it into > your squid proxy. Oh cool, is that out there in circulation? It sounds like the approach of merging all the regexes into one and compiling to a FSM could be a big win. I wouldn't expect too big a state space explosion.