Path: csiph.com!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail From: Stefan Wiens Newsgroups: de.comp.os.unix.shell Subject: Re: System und Shell-Scripte: Latin-1 zu UTF-8 Date: Sun, 08 Mar 2026 10:47:53 +0100 Organization: none Lines: 34 Message-ID: <87jyvmpsnd.fsf@s-bot.de> References: <87ms0jqp7f.fsf@s-bot.de> <1ppu7m-b4u.ln1@tempo.martinkl.dialup.fu-berlin.de> <83fr6b11ti.fsf@helmutwaitzmann.news.arcor.de> <87h5qrq61l.fsf@s-bot.de> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: individual.net bBrqy+weUsakWokbLsKXigs67OWhjJsc/jfv64TdbsPQgcbP0= Cancel-Lock: sha1:OIbvgYeq5H3uzq+Aq2u+Q4FhaqI= sha1:HA3ApmgehYeyZfGcRwOfQnsQoSw= sha256:JW93ixZTtmZxqo1bzEiXQFQMmplhIDFfSGPrAhGl/Rg= User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.2 (gnu/linux) Xref: csiph.com de.comp.os.unix.shell:14665 Ingrid Stefan Wiens writes: > tr(1) ist ein gutes Beispiel, wo es hakeln könnte. > Hier (GNU coreutils) 9.1: [...] > $ echo klöä | LC_ALL=de_DE.UTF8 tr [:lower:] [:upper:] > KLöä > § Da gibt es eigentlich keinen Gestaltungsspielraum: ,----[ ] | Otherwise, only character class names lower or upper are valid in | string2 and then only if the corresponding character class ( upper and | lower, respectively) is specified in the same relative position in | string1. Such a specification shall be interpreted as a request for | case conversion. When [: lower:] appears in string1 and [: upper:] | appears in string2, the arrays shall contain the characters from the | toupper mapping in the LC_CTYPE category of the current locale. When | [: upper:] appears in string1 and [: lower:] appears in string2, the | arrays shall contain the characters from the tolower mapping in the | LC_CTYPE category of the current locale. The first character from each | mapping pair shall be in the array for string1 and the second | character from each mapping pair shall be in the array for string2 in | the same relative position. `---- Die exzessiven Leerzeichen bei "[: lower:]" etc. stehen dort fälschlicherweise. Jedenfalls ist GNU tr im Unrecht. -- Stefan