Path: csiph.com!usenet.pasdenom.info!news.albasani.net!.POSTED!not-for-mail From: Jan Burse Newsgroups: comp.lang.java.programmer Subject: Re: A proposal to handle file encodings Date: Mon, 26 Nov 2012 02:20:42 +0100 Organization: albasani.net Lines: 24 Message-ID: References: <9kava8lk1ignppq7rso7gmcb541gnerf8q@4ax.com> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Trace: news.albasani.net s1BO4HX8zeN0+HK0qujXYriC3k+BYd5mo2qeO1wCjBgy8gcYUkdpH6evyGHHo5Cu2W9m0G8niwtWsAavqPUyVw== NNTP-Posting-Date: Mon, 26 Nov 2012 01:20:43 +0000 (UTC) Injection-Info: news.albasani.net; logging-data="S3w5RoTfii7snPGeYmVnk8llmYT91yh35pri5ANr0FEdzdxCOBnxv0dK6SMs/Fye8nB0I9x6bmfiMHL8KHcM/eZJ8Jd/Kh/YLRJw9S8KvGczTGHT+LSCe6YIcUE666lD"; mail-complaints-to="abuse@albasani.net" User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:17.0) Gecko/17.0 Firefox/17.0 SeaMonkey/2.14 In-Reply-To: Cancel-Lock: sha1:LUMBadIoE+zMJtvFlTE3gCG3xHs= Xref: csiph.com comp.lang.java.programmer:19971 BTW: This is a nice read: http://www.transbay.net/~enf/ascii/ascii.pdf Shows history of ASCII, EBCDIC, ISO-646, etc.. Jan Burse schrieb: > Joshua Cranmer schrieb: >> >> Well, there's also the minor issue that some encodings use the same name >> for slightly (or sometimes greatly) different variants--I think Big5 is >> an offender here in having a few different variants in mapping >> multioctet chars to Unicode code points, and "ASCII" and "EBCDIC" are >> both laughably useless, since they pretend that the 8th bit is never set. > > According to Wiki: > > "Generally, this encoding form is rarely used, even on EBCDIC based > mainframes for which it was designed. IBM EBCDIC based mainframe > operating systems, like z/OS, usually use UTF-16 for complete Unicode > support. For example, DB2 UDB, COBOL, PL/I, Java and the IBM XML toolkit > support UTF-16 on IBM mainframes." > > http://en.wikipedia.org/wiki/UTF-EBCDIC