Path: csiph.com!weretis.net!feeder6.news.weretis.net!news.misty.com!news.iecc.com!.POSTED.news.iecc.com!nerds-end From: Hans-Peter Diettrich Newsgroups: comp.compilers Subject: Re: State-of-the-art algorithms for lexical analysis? Date: Wed, 8 Jun 2022 05:32:40 +0200 Organization: Compilers Central Lines: 15 Sender: news@iecc.com Approved: comp.compilers@iecc.com Message-ID: <22-06-021@comp.compilers> References: <22-06-006@comp.compilers> <22-06-007@comp.compilers> <22-06-008@comp.compilers> <22-06-013@comp.compilers> <22-06-015@comp.compilers> <22-06-019@comp.compilers> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Injection-Info: gal.iecc.com; posting-host="news.iecc.com:2001:470:1f07:1126:0:676f:7373:6970"; logging-data="64848"; mail-complaints-to="abuse@iecc.com" Keywords: lex, history Posted-Date: 09 Jun 2022 12:33:49 EDT X-submission-address: compilers@iecc.com X-moderator-address: compilers-request@iecc.com X-FAQ-and-archives: http://compilers.iecc.com Xref: csiph.com comp.compilers:3058 On 6/7/22 6:40 PM, Christopher F Clark wrote: > And, as our moderator pointed out, this makes a terrible regular > expression, NFA, DFA, but it is actually quite easy in nearly any > programming language. Now I know what made me think of Hollerith constants with the "H" :-) I doubt that it's "quite easy" to use Hollerith constants for humans - how often do you have to check whether you got the right number of characters when reading or writing such a constant? So the delimited form of strings is easier to handle by both humans and DFA's, a win-win situation :-) DoDi