Path: csiph.com!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!eternal-september.org!.POSTED!not-for-mail
From: Tim Rentsch
Newsgroups: comp.lang.c
Subject: Re: Rationale for aligning data on even bytes in a Unix shell file?
Date: Sun, 04 May 2025 21:47:13 -0700
Organization: A noiseless patient Spider
Lines: 54
Message-ID: <86ldrb64vi.fsf@linuxsc.com>
References: <214QP.3042635$TBhc.1825340@fx16.iad>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Injection-Date: Mon, 05 May 2025 06:47:19 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="b2f20e442295d938e3644788cf940be7"; logging-data="3880418"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19aJub0FgSSFmV3ek8wECxnuS9MrKhA3Nc="
User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.4 (gnu/linux)
Cancel-Lock: sha1:EYvcg/iN6IvvJfaWZxHz1kopD2U= sha1:0euitl8cqfPbXQks5wvoQ35Pty4=
Xref: csiph.com comp.lang.c:393158
scott@slp53.sl.home (Scott Lurndal) writes:
> Bonita Montero writes:
>
>> Am 29.04.2025 um 02:28 schrieb Scott Lurndal:
>>
>>> Bonita Montero writes:
>>>
>>>> Am 28.04.2025 um 20:47 schrieb Richard Harnden:
>>>>
>>>>> On 28/04/2025 19:36, Bonita Montero wrote:
>>>>>
>>>>>> Am 28.04.2025 um 18:59 schrieb Scott Lurndal:
>>>>>>
>>>>>>> Not really. UTF-8 is UTF-8, regardless of the locale.
>>>>>>
>>>>>> But UTF-8 isn't the standard locale for Unix filesystems
>>>>>> except with macOS.
>>>>>
>>>>> UTF-8 isn't a locale - it's an encoding.
>>>>
>>>> Idiot.
>>>> Type "locale" in the shell and thenn return.
>>>
>>> $ locale
>>> LANG=C
>>> LC_CTYPE="C"
>>> LC_NUMERIC=C
>>> LC_TIME="C"
>>> LC_COLLATE="C"
>>> LC_MONETARY="C"
>>> LC_MESSAGES="C"
>>> LC_PAPER="C"
>>> LC_NAME="C"
>>> LC_ADDRESS="C"
>>> LC_TELEPHONE="C"
>>> LC_MEASUREMENT="C"
>>> LC_IDENTIFICATION="C"
>>> LC_ALL=
>>
>> For me:
>>
>> boni@Raubtier-Asyl:/mnt/c/Users/Boni$ locale
>> LANG=C.UTF-8
>> LANGUAGE=
>
> Same locale, different encoding. As has been pointed out
> to you repeatedly.
I don't want to take sides in this exchange, but I feel obliged to
point out that, in the terminology of the ISO C standard, having a
different encoding implies being a different locale. So it might be
good to specify a frame of reference for where the term "locale" has
its source, for the various respective comments being offered.