Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.lang.python > #43961
| Path | csiph.com!usenet.pasdenom.info!aioe.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed4.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail |
|---|---|
| Return-Path | <bsk16@case.edu> |
| X-Original-To | python-list@python.org |
| Delivered-To | python-list@mail.python.org |
| X-Spam-Status | OK 0.005 |
| X-Spam-Evidence | '*H*': 0.99; '*S*': 0.00; 'say,': 0.05; '"as': 0.07; 'failing': 0.07; 'plenty': 0.07; 'subject:support': 0.07; 'string': 0.09; '(unicode': 0.09; 'bug.': 0.09; 'complicate': 0.09; 'escape': 0.09; 'expense': 0.09; 'mentions': 0.09; 'python': 0.11; 'thread': 0.14; 'fails.': 0.16; 'guilty': 0.16; 'non-ascii': 0.16; 'rule.': 0.16; 'saying.': 0.16; 'semantics': 0.16; 'subject:Unicode': 0.16; 'url:browse_thread': 0.16; 'url:thread': 0.16; 'sat,': 0.16; 'wrote:': 0.18; 'trying': 0.19; 'saying': 0.22; 'to:name:python-list@python.org': 0.22; 'this?': 0.23; 'recognize': 0.24; 'unicode': 0.24; 'header:In-Reply-To:1': 0.27; 'chris': 0.29; 'am,': 0.29; 'character': 0.29; 'especially': 0.30; 'message-id:@mail.gmail.com': 0.30; "i'm": 0.30; 'gives': 0.31; 'code': 0.31; 'usually': 0.31; 'way?': 0.31; 'figure': 0.32; 'url:python': 0.33; '-----': 0.33; 'implemented': 0.33; 'totally': 0.33; 'received:209.85': 0.35; 'test': 0.35; 'received:google.com': 0.35; 'really': 0.36; 'received:209.85.210': 0.36; 'should': 0.36; 'two': 0.37; 'received:209': 0.37; 'performance': 0.37; 'to:addr:python-list': 0.38; 'pm,': 0.38; 'previous': 0.38; 'does': 0.39; 'to:addr:python.org': 0.39; 'how': 0.40; 'introduced': 0.61; 'url:group': 0.63; 'provide': 0.64; 'account': 0.65; 'skip:\xe2 10': 0.65; '20,': 0.68; '21st': 0.68; '8bit%:43': 0.74; '.replace': 0.84; 'batchelder': 0.84; 'characters,': 0.84; 'fonts': 0.84; 'hijacking': 0.84; 'horrible': 0.84; 'tex': 0.84; 'url:lang': 0.84; 'mean.': 0.91; '2013': 0.98 |
| X-Google-DKIM-Signature | v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=x-received:mime-version:x-received:in-reply-to:references:date :message-id:subject:from:to:content-type:content-transfer-encoding :x-gm-message-state; bh=JEpeD3UTQGrojv/as0oMUjfejCBH8z/RRNpgVgzqX+E=; b=cAjEoLUaWWYhaQepK3XLpF38ovVU3iIzytT54dtaYuI63BJZV79Ekw8k5cIs7o50wA XRDY+lRnP/i3naXwf4NNz5hLI3meUP9TrPeS7PBgu6+tdtqB7Kj6rvPtboHfQB83PKSl Xv+g4BrhCnAlH62ycmw5AUhD97X7iEnQNBFy5GFBJWONLp3G0gieKCU5EChusIVvQwNT Jyk1tpzXw6V2xW1uGKu2fRdHE3B9uAykchMV85UnT6vt4c7qbcejjEXUH8cBZS7LapNi algA2bn0V8QLbx0N4LfXoJWRg4k8dqEVxlV7BZ1k0tWE2OXjVpErawi7UFg+oVcB9dFV nqVQ== |
| X-Received | by 10.60.37.68 with SMTP id w4mr6872347oej.62.1366480974782; Sat, 20 Apr 2013 11:02:54 -0700 (PDT) |
| MIME-Version | 1.0 |
| X-Received | by 10.60.37.68 with SMTP id w4mr6872342oej.62.1366480974688; Sat, 20 Apr 2013 11:02:54 -0700 (PDT) |
| In-Reply-To | <5172CEE4.9070403@nedbatchelder.com> |
| References | <d9798b4e-2825-4a36-93a3-f8a03d37a4bc@b3g2000vbo.googlegroups.com> <5172CEE4.9070403@nedbatchelder.com> |
| Date | Sat, 20 Apr 2013 11:02:54 -0700 |
| Subject | Re: Is Unicode support so hard... |
| From | Benjamin Kaplan <benjamin.kaplan@case.edu> |
| To | "python-list@python.org" <python-list@python.org> |
| Content-Type | text/plain; charset=UTF-8 |
| Content-Transfer-Encoding | quoted-printable |
| X-Gm-Message-State | ALoCoQmgwXuseVqp5w25PeIrqg+VJoZ90EMhjrYqbl/E4XR8Mli5roCodLO9/JE94uMLG9TJbmUfjpEH8Zft9QazNjwglxDmBzjUfjIZtwHl3vkIKYvrR/fGuXbUpdSUfm1m13svF4S2O7oEB841i0jv9rjlnUyUqw== |
| X-Junkmail-Whitelist | YES (by domain whitelist at mpv2.tis.cwru.edu) |
| X-BeenThere | python-list@python.org |
| X-Mailman-Version | 2.1.15 |
| Precedence | list |
| List-Id | General discussion list for the Python programming language <python-list.python.org> |
| List-Unsubscribe | <http://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe> |
| List-Archive | <http://mail.python.org/pipermail/python-list/> |
| List-Post | <mailto:python-list@python.org> |
| List-Help | <mailto:python-list-request@python.org?subject=help> |
| List-Subscribe | <http://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe> |
| Newsgroups | comp.lang.python |
| Message-ID | <mailman.858.1366481215.3114.python-list@python.org> (permalink) |
| Lines | 53 |
| NNTP-Posting-Host | 2001:888:2000:d::a6 |
| X-Trace | 1366481215 news.xs4all.nl 2260 [2001:888:2000:d::a6]:56966 |
| X-Complaints-To | abuse@xs4all.nl |
| Xref | csiph.com comp.lang.python:43961 |
Show key headers only | View raw
On Sat, Apr 20, 2013 at 10:22 AM, Ned Batchelder <ned@nedbatchelder.com> wrote: > On 4/20/2013 1:12 PM, jmfauth wrote: >> >> In a previous post, >> >> >> http://groups.google.com/group/comp.lang.python/browse_thread/thread/6aec70817705c226# >> , >> >> Chris “Kwpolska” Warrick wrote: >> >> “Is Unicode support so hard, especially in the 21st century?” >> >> -- >> >> Unicode is not really complicate and it works very well (more >> than two decades of development if you take into account >> iso-14****). >> >> But, - I can say, "as usual" - people prefer to spend their >> time to make a "better Unicode than Unicode" and it usually >> fails. Python does not escape to this rule. >> >> ----- >> >> I'm "busy" with TeX (unicode engine variant), fonts and typography. >> This gives me plenty of ideas to test the "flexible string >> representation" (FSR). I should recognize this FSR is failing >> particulary very well... >> >> I can almost say, a delight. >> >> jmf >> Unicode lover > > I'm totally confused about what you are saying. What does "make a better > Unicode than Unicode" mean? Are you saying that Python is guilty of this? > In what way? Can you provide specifics? Or are you saying that you like > how Python has implemented it? "FSR is failing ... a delight"? I don't > know what you mean. > > --Ned. Don't bother trying to figure this out. jmfauth has been hijacking every thread that mentions Unicode to complain about the flexible string representation introduced in Python 3.3. Apparently, having proper Unicode semantics (indexing is based on characters, not code points) at the expense of performance when calling .replace on the only non-ASCII or BMP character in the string is a horrible bug.
Back to comp.lang.python | Previous | Next — Previous in thread | Next in thread | Find similar | Unroll thread
Is Unicode support so hard... jmfauth <wxjmfauth@gmail.com> - 2013-04-20 10:12 -0700
Re: Is Unicode support so hard... Ned Batchelder <ned@nedbatchelder.com> - 2013-04-20 13:22 -0400
Re: Is Unicode support so hard... Benjamin Kaplan <benjamin.kaplan@case.edu> - 2013-04-20 11:02 -0700
Re: Is Unicode support so hard... Chris Angelico <rosuav@gmail.com> - 2013-04-21 04:14 +1000
Re: Is Unicode support so hard... Chris “Kwpolska” Warrick <kwpolska@gmail.com> - 2013-04-20 20:15 +0200
Re: Is Unicode support so hard... Mark Lawrence <breamoreboy@yahoo.co.uk> - 2013-04-20 19:18 +0100
Re: Is Unicode support so hard... Neil Hodgson <nhodgson@iinet.net.au> - 2013-04-21 09:03 +1000
Re: Is Unicode support so hard... rusi <rustompmody@gmail.com> - 2013-04-20 18:37 -0700
Re: Is Unicode support so hard... Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2013-04-21 03:36 +0000
Re: Is Unicode support so hard... Chris Angelico <rosuav@gmail.com> - 2013-04-21 13:42 +1000
Re: Is Unicode support so hard... Terry Jan Reedy <tjreedy@udel.edu> - 2013-04-21 05:02 -0400
Re: Is Unicode support so hard... Mark Lawrence <breamoreboy@yahoo.co.uk> - 2013-04-21 13:03 +0100
Re: Is Unicode support so hard... Ethan Furman <ethan@stoneleaf.us> - 2013-04-20 18:06 -0700
Re: Is Unicode support so hard... 88888 Dihedral <dihedral88888@googlemail.com> - 2013-04-20 23:09 -0700
csiph-web