Path: csiph.com!newsfeed.hal-mli.net!feeder3.hal-mli.net!newsfeed.hal-mli.net!feeder1.hal-mli.net!border3.nntp.dca.giganews.com!border1.nntp.dca.giganews.com!border4.nntp.dca.giganews.com!border2.nntp.dca.giganews.com!nntp.giganews.com!novia!news-out.readnews.com!news-xxxfer.readnews.com!panix!panix.com!presence From: Cydrome Leader Newsgroups: comp.text,comp.unix.shell Subject: Re: character classes & regular expressions Date: Sun, 6 May 2012 02:31:05 +0000 (UTC) Organization: PANIX Public Access Internet and UNIX, NYC Lines: 35 Message-ID: References: <20120503180629.531@kylheku.com> <7QGor.22191$em4.6868@newsfe21.iad> <20120503211141.506@kylheku.com> <86r4uz3kzo.fsf@gray.siamics.net> <20120504103639.939@kylheku.com> <86mx5n2w3m.fsf_-_@gray.siamics.net> NNTP-Posting-Host: panix1.panix.com X-Trace: reader1.panix.com 1336271465 20605 166.84.1.1 (6 May 2012 02:31:05 GMT) X-Complaints-To: abuse@panix.com NNTP-Posting-Date: Sun, 6 May 2012 02:31:05 +0000 (UTC) User-Agent: tin/1.8.3-20070201 ("Scotasay") (UNIX) (NetBSD/5.1.2 (i386)) Xref: csiph.com comp.text:26 comp.unix.shell:4922 In comp.unix.shell Ivan Shmakov wrote: >>>>>> Kaz Kylheku writes: >>>>>> On 2012-05-04, Ivan Shmakov wrote: >>>>>> Kaz Kylheku writes: > > [Cross-posting to news:comp.text, for the subject being > discussed is hardly specific to Unix shells; really, this time.] > > [...] > > >>> In my defense, I've never used this committee-designed dog of a > >>> syntax until today, which was only because I was groping for a > >>> quick workaround, and likely never will again. > > >>> (I also refuse to implement it in my regex engine, though I have > >>> caved in to Perl's \w, \d, \s, \W, \D and \S, which is probably as > >>> far as I will go.) > > >> How do you specify a "single character, either an upper-case letter > >> or a digit" within such a regular expression, then? > > > [A-Z0-9] > > [A-Z\d] > > It happens that the native languages of the most people of the > world either use extensions to the Latin script (beyond those in > ASCII, such as J or W), or use a script not derived from Latin > at all. (Greek-based scripts are not uncommon, for instance; > FWIW, the Latin script is based on the Greek one itself.) > > Good luck selling your product to anyone speaking French, Greek, > Polish or Russian. like greeks have money to buy software or a russian has ever made a legit software purchase.