Path: csiph.com!fu-berlin.de!uni-berlin.de!not-for-mail
From: Ian Kelly <ian.g.kelly@gmail.com>
Newsgroups: comp.lang.python
Subject: Re: How to waste computer memory?
Date: Fri, 18 Mar 2016 01:00:05 -0600
Lines: 24
Message-ID: <mailman.302.1458284448.12893.python-list@python.org>
References: <a2639027-c69c-46df-a7a5-45a677b9e01d@googlegroups.com> <265377f4-741d-4aa2-9338-239f56f8bc57@googlegroups.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
In-Reply-To: <265377f4-741d-4aa2-9338-239f56f8bc57@googlegroups.com>
Precedence: list
Xref: csiph.com comp.lang.python:105182

On Thu, Mar 17, 2016 at 1:21 PM, Rick Johnson
<rantingrickjohnson@gmail.com> wrote:
> In the event that i change my mind about Unicode, and/or for
> the sake of others, who may want to know, please provide a
> list of languages that *YOU* think handle Unicode better than
> Python, starting with the best first. Thanks.

jmf has been asked this before, and as I recall he seems to feel that
UTF-8 should be used for all purposes, ignoring the limitations of
that encoding such as that indexing becomes a O(n) operation. He has
pointed at Go as an example of a language wherein Unicode "just
works", although I think that others do not necessarily agree [1].

He also seems to have a strange notion of the meaning of the word
"buggy". He frequently uses that word to describe the Python 3.3
Unicode implementation, although he can't seem to demonstrate any
actual bugs. Instead, he points at cherry-picked micro-benchmarks that
show Python's old "narrow" Unicode implementation (which did not
properly support SMP characters, unlike the "wide" implementation
which was a much greater memory hog than the version he's now
complaining about) outperforming the PEP-393 implementation while
completely ignoring any real-world benchmarks.

[1] https://coderwall.com/p/k7zvyg/dealing-with-unicode-in-go