Path: csiph.com!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!eternal-september.org!.POSTED!not-for-mail From: Tim Rentsch Newsgroups: comp.lang.c Subject: Re: Rationale for aligning data on even bytes in a Unix shell file? Date: Sun, 04 May 2025 21:47:13 -0700 Organization: A noiseless patient Spider Lines: 54 Message-ID: <86ldrb64vi.fsf@linuxsc.com> References: <214QP.3042635$TBhc.1825340@fx16.iad> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Injection-Date: Mon, 05 May 2025 06:47:19 +0200 (CEST) Injection-Info: dont-email.me; posting-host="b2f20e442295d938e3644788cf940be7"; logging-data="3880418"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19aJub0FgSSFmV3ek8wECxnuS9MrKhA3Nc=" User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.4 (gnu/linux) Cancel-Lock: sha1:EYvcg/iN6IvvJfaWZxHz1kopD2U= sha1:0euitl8cqfPbXQks5wvoQ35Pty4= Xref: csiph.com comp.lang.c:393158 scott@slp53.sl.home (Scott Lurndal) writes: > Bonita Montero writes: > >> Am 29.04.2025 um 02:28 schrieb Scott Lurndal: >> >>> Bonita Montero writes: >>> >>>> Am 28.04.2025 um 20:47 schrieb Richard Harnden: >>>> >>>>> On 28/04/2025 19:36, Bonita Montero wrote: >>>>> >>>>>> Am 28.04.2025 um 18:59 schrieb Scott Lurndal: >>>>>> >>>>>>> Not really. UTF-8 is UTF-8, regardless of the locale. >>>>>> >>>>>> But UTF-8 isn't the standard locale for Unix filesystems >>>>>> except with macOS. >>>>> >>>>> UTF-8 isn't a locale - it's an encoding. >>>> >>>> Idiot. >>>> Type "locale" in the shell and thenn return. >>> >>> $ locale >>> LANG=C >>> LC_CTYPE="C" >>> LC_NUMERIC=C >>> LC_TIME="C" >>> LC_COLLATE="C" >>> LC_MONETARY="C" >>> LC_MESSAGES="C" >>> LC_PAPER="C" >>> LC_NAME="C" >>> LC_ADDRESS="C" >>> LC_TELEPHONE="C" >>> LC_MEASUREMENT="C" >>> LC_IDENTIFICATION="C" >>> LC_ALL= >> >> For me: >> >> boni@Raubtier-Asyl:/mnt/c/Users/Boni$ locale >> LANG=C.UTF-8 >> LANGUAGE= > > Same locale, different encoding. As has been pointed out > to you repeatedly. I don't want to take sides in this exchange, but I feel obliged to point out that, in the terminology of the ISO C standard, having a different encoding implies being a different locale. So it might be good to specify a frame of reference for where the term "locale" has its source, for the various respective comments being offered.