Path: csiph.com!usenet.pasdenom.info!aioe.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed3.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.032 X-Spam-Evidence: '*H*': 0.94; '*S*': 0.00; 'received:134': 0.05; 'plenty': 0.07; 'assuming': 0.09; '--------': 0.10; 'python': 0.11; 'alphabet': 0.16; 'confuse': 0.16; 'illustrate': 0.16; 'subject:issue': 0.16; 'unicode.': 0.16; 'coding': 0.22; 'header :User-Agent:1': 0.23; 'header:In-Reply-To:1': 0.27; 'related': 0.29; 'relies': 0.31; 'schemes': 0.31; 'languages': 0.32; 'problem': 0.35; 'there': 0.35; 'really': 0.36; 'european': 0.36; 'set.': 0.36; 'should': 0.36; 'wrong': 0.37; 'bringing': 0.38; 'to:addr:python-list': 0.38; 'to:addr:python.org': 0.39; 'up,': 0.60; 'full': 0.61; 'email addr:gmail.com': 0.63; 'confirm': 0.64; 'greek': 0.84; 'pardon': 0.84; 'endorsed': 0.93 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AqEEAEvoJlKGuA9G/2dsb2JhbABagzyDeb4LgTiDGAEBBAEjVQYLCxoCBRYLAgIJAwIBAgEPNhMGAgKHbAMJBqgGiEsNV4gOgSmBMoougnQWglOBNAOWDIFpgS+EaYYfhS+DIg Date: Wed, 04 Sep 2013 10:01:50 +0200 From: Antoon Pardon User-Agent: Mozilla/5.0 (X11; Linux i686; rv:10.0.12) Gecko/20130116 Icedove/10.0.12 MIME-Version: 1.0 To: python-list@python.org Subject: Re: UnicodeDecodeError issue References: <5222fc40$0$6599$c3e8da3$5496439d@news.astraweb.com> <52247CED.9050101@mrabarnett.plus.com> <3510a783-79ac-48b0-90a9-8262d28eeba0@googlegroups.com> In-Reply-To: <3510a783-79ac-48b0-90a9-8262d28eeba0@googlegroups.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 37 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1378281717 news.xs4all.nl 15910 [2001:888:2000:d::a6]:46146 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:53607 Op 03-09-13 17:23, wxjmfauth@gmail.com schreef: > -------- > > The Latin alphabet uses Greek lettering. > > The Cyrillic alphabet uses Greek lettering. > > Greek: One should not confuse modern Greek > with ancient Greek, polytonic Greek full > of diacritics. > > Plenty of European languages (~15) based on the Latin > alphabet uses some ancient Greek diacritics. > > Now unicode. > > Everything is working very smoothly with the endorsed coding > schemes of Unicode.org. > > Expectedly it fails (behaves badly) with Python and its > Flexible Sting Representation, mainly because it relies on > the latin-1 (iso-8859-1) set. You really seem obsessed. There is no reason at all to think that is problem is related to the FSR. You are only bringing this up, because you are looking for opportunities to complain about the FSR. > To take the problem the other way, one can take these > linguistic ascpects to illustrate the wrong design of > the FSR. No you can't, you are just assuming so because you feel it would confirm your bias against the FSR. -- Antoon Pardon