Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.compilers > #3546
| From | cross@spitfire.i.gajendra.net |
|---|---|
| Newsgroups | comp.compilers |
| Subject | Re: Interesting paper on regex NFA matching |
| Date | 2024-01-26 04:15 +0000 |
| Organization | Compilers Central |
| Message-ID | <24-01-004@comp.compilers> (permalink) |
| References | <24-01-003@comp.compilers> |
In article <24-01-003@comp.compilers>, John R Levine <johnl@taugh.com> wrote: >The easiest way to implement regular expressions is to turn them into >NFAs, but that has well known problems that if you feed hostile input >to the NFAs, they can take exponential time and it's a way to DDoS the >server. If you translate the NFA to a DFA it runs in linear time, but >if the regex is ambiguous, as many are, the DFA may match something >different from the NFA. NFAs won't take exponential time if implemented carefully, though they may be superlinear. Still, we're talking quadratic, not exponential. Backtracking implementations can take exponential time, but those aren't all NFAs. Converting an NFA to a DFA could be exponential, however. https://swtch.com/~rsc/regexp/regexp1.html >This paper on the Arxiv preprint server proposes some rather complex >tweaks to NFA regex matching to fix many performance issues while giving >the same answers as before. > >https://arxiv.org/abs/2401.12639 If I'm reading the abstract right, they modify a backtracking implementation so that they can efficiently match some extended regex's. Note however, that such so-called "extended" regexs are not really regular expressions in the formal sense; that is, they do not describe members of the regular languages, but rather languages beyond those expressible as a DFA. Regardless this work is interesting, since AFAIK the speedup for this class of regex to linear time is novel. Thanks for passing it on. - Dan C.
Back to comp.compilers | Previous | Next — Previous in thread | Next in thread | Find similar
Interesting paper on regex NFA matching John R Levine <johnl@taugh.com> - 2024-01-25 22:10 -0500 Re: Interesting paper on regex NFA matching cross@spitfire.i.gajendra.net - 2024-01-26 04:15 +0000 Re: Interesting paper on regex NFA matching Kaz Kylheku <433-929-6894@kylheku.com> - 2024-01-26 04:16 +0000
csiph-web