Path: csiph.com!usenet.pasdenom.info!gegeweb.org!eternal-september.org!feeder.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Marko Rauhamaa Newsgroups: comp.lang.python Subject: Re: Micro Python -- a lean and efficient implementation of Python 3 Date: Wed, 04 Jun 2014 15:17:31 +0300 Organization: A noiseless patient Spider Lines: 18 Message-ID: <871tv4vpgk.fsf@elektro.pacujo.net> References: <20140603194949.3147497d@x34f> <44acd692-5dcd-4e5f-8238-7fbe0de4db2a@googlegroups.com> <538ecdef$0$11109$c3e8da3@news.astraweb.com> <7xoay9w1h0.fsf@ruckus.brouhaha.com> <20140604060152.2cdf8198@bigbox.christie.dr> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: mx05.eternal-september.org; posting-host="ff5cf27ef3d5b31f034d3b72bdc27a41"; logging-data="21587"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+emx8z/aig6JPo0LV6kLCX" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.3 (gnu/linux) Cancel-Lock: sha1:BmDkaHsd9M85+efuWofHY0BviV4= sha1:J1r/JKVxWq/Z5q73F92jPdDVIw0= Xref: csiph.com comp.lang.python:72630 Robin Becker : >>>> u'\xc5ngstr\xf6m'==u'\xc5ngstro\u0308m' > False Now *that* would be a valid reason for our resident Unicode expert to complain! Py3 in no way solves text representation issues definitively. > I know this is artificial Not at all. It probably is out of scope for Python, but it is a real cause for human suffering. What's Unicode for "résumé"? Note, for example, that Google manages to sort out issues like these. It sees past diacritics and even case ending. Marko