X-Received: by 10.236.126.72 with SMTP id a48mr6587416yhi.49.1398996169192; Thu, 01 May 2014 19:02:49 -0700 (PDT) X-Received: by 10.182.107.136 with SMTP id hc8mr40160obb.23.1398996169074; Thu, 01 May 2014 19:02:49 -0700 (PDT) Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!news.glorb.com!s7no485713qap.0!news-out.google.com!gi6ni709igc.0!nntp.google.com!c1no362966igq.0!postnews.google.com!glegroupsg2000goo.googlegroups.com!not-for-mail Newsgroups: comp.lang.python Date: Thu, 1 May 2014 19:02:48 -0700 (PDT) In-Reply-To: Complaints-To: groups-abuse@google.com Injection-Info: glegroupsg2000goo.googlegroups.com; posting-host=59.95.16.79; posting-account=mBpa7woAAAAGLEWUUKpmbxm-Quu5D8ui NNTP-Posting-Host: 59.95.16.79 References: <5361d4f9$0$11109$c3e8da3@news.astraweb.com> <82067b83-a6f5-4b16-b012-385535ea5607@googlegroups.com> User-Agent: G2/1.0 MIME-Version: 1.0 Message-ID: Subject: Re: Unicode 7 From: Rustom Mody Injection-Date: Fri, 02 May 2014 02:02:49 +0000 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Xref: csiph.com comp.lang.python:70834 On Friday, May 2, 2014 5:03:21 AM UTC+5:30, MRAB wrote: > On 2014-05-01 23:38, Terry Reedy wrote: > > On 5/1/2014 2:04 PM, Rustom Mody wrote: > >>>> Since its Unicode-troll time, here's my contribution > >>>> http://blog.languager.org/2014/04/unicode-and-unix-assumption.html > > I will not comment on the Unix-assumption part, but I think you go wron= g > > with this: "Unicode is a Headache". The major headache is that unicode > > and its very few encodings are not universally used. The headache is al= l > > the non-unicode legacy encodings still being used. So you better title > > this section 'Non-Unicode is a Headache'. > [snip] > I think he's right when he says "Unicode is a headache", but only > because it's being used to handle languages which are, themselves, a > "headache": left-to-right versus right-to-left, sometimes on the same > line; diacritics, possibly several on a glyph; etc. Yes, the headaches go a little further back than Unicode. There is a certain large old book... In which is described the building of a 'tower that reached up to heaven'..= . At which point 'it was decided'=B6 to do something to prevent that. And our headaches started. I dont know how one causally connects the 'headaches' but Ive seen - mojibake - unicode 'number-boxes' (what are these called?) - Worst of all what we *dont* see -- how many others dont see what we see? I never knew of any of this in the good ol days of ASCII =B6 Passive voice is often the best choice in the interests of political co= rrectness It would be a pleasant surprise if everyone sees a pilcrow at start of line= above