Path: csiph.com!usenet.pasdenom.info!gegeweb.org!eternal-september.org!feeder.eternal-september.org!mx04.eternal-september.org!.POSTED!not-for-mail From: Joshua Cranmer Newsgroups: comp.lang.java.programmer Subject: Re: A proposal to handle file encodings Date: Sun, 25 Nov 2012 11:06:14 -0600 Organization: A noiseless patient Spider Lines: 24 Message-ID: References: <9kava8lk1ignppq7rso7gmcb541gnerf8q@4ax.com> <78n1b8tkmcbdefk2ifeeroklp93p88otma@4ax.com> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Injection-Date: Sun, 25 Nov 2012 17:06:31 +0000 (UTC) Injection-Info: mx04.eternal-september.org; posting-host="5a9707252ba5efb9bece56d1f4656a90"; logging-data="30596"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/E+kzYnUKOHQNH90+KVOKhGzUC9YA19wQ=" User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:17.0) Gecko/17.0 Thunderbird/17.0 In-Reply-To: Cancel-Lock: sha1:TU9UFbyLbMQoqRjT8I2gyyew5cI= Xref: csiph.com comp.lang.java.programmer:19943 On 11/25/2012 3:07 AM, Peter J. Holzer wrote: > On 2012-11-24 14:50, Roedy Green wrote: >> On Sat, 24 Nov 2012 00:53:51 +0100, Jan Burse >> wrote, quoted or indirectly quoted someone who said : >>> I guess Roedy wants the charset. >> >> In HTTP the meta information is in the HTTP header. This is all very >> well except the that the server is just guessing. > > No. Normally it isn't guessing at all. It just uses the configured > charset. And how is the configured charset not guessing? If a server is serving static files from a directory, I'm willing to bet that most administrators won't bother changing the default setting and instead will just hope that the default works. I've had enough charset pains to know that much of it (particularly in en regions) are going to be people blindly using default settings. And I also know that not all tools agree on their default settings. -- Beware of bugs in the above code; I have only proved it correct, not tried it. -- Donald E. Knuth