Path: csiph.com!fu-berlin.de!uni-berlin.de!not-for-mail
From: Chris Angelico <rosuav@gmail.com>
Newsgroups: comp.lang.python
Subject: Re: Not x.islower() has different output than x.isupper() in list output...
Date: Thu, 5 May 2016 00:37:17 +1000
Lines: 25
Message-ID: <mailman.384.1462372639.32212.python-list@python.org>
References: <572407AE.1070703@icloud.com> <1461979797.3824480.593944273.0B8D8DF3@webmail.messagingengine.com> <57241097.7020801@icloud.com> <mailman.242.1461981344.32212.python-list@python.org> <e1e5bfe4-7998-4cf7-a4f8-53cf5426c7c5@googlegroups.com> <CAPTjJmq4ce8qBTFW5aszP4tZSu8tB5Dc6Y9im4pH1znDGbR0GQ@mail.gmail.com> <mailman.339.1462271120.32212.python-list@python.org> <lf5r3dje63l.fsf@ling.helsinki.fi> <CAPTjJmqt77gQSm11VDMYviZeFq6uTYUJdkPa8toUA2TejS1Lnw@mail.gmail.com> <mailman.341.1462276849.32212.python-list@python.org> <nga79f$gau$1@dont-email.me> <CAPTjJmq0GJ+22m7YJw_v3+ykSrUySRA665PqZKsJ+SCCw9np7Q@mail.gmail.com> <mailman.344.1462281210.32212.python-list@python.org> <nga89p$ku2$1@dont-email.me> <lf5eg9jdwmr.fsf@ling.helsinki.fi> <57296c7a$0$1589$c3e8da3$5496439d@news.astraweb.com> <ngcvjm$3ou$1@dont-email.me> <CAPTjJmo7LE-cJLE+v3yO4uqUeRn8v6TvNm=SvpcOzomVTQbj9g@mail.gmail.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
In-Reply-To: <ngcvjm$3ou$1@dont-email.me>
Precedence: list
Xref: csiph.com comp.lang.python:108132

On Thu, May 5, 2016 at 12:09 AM, DFS <nospam@dfs.com> wrote:
> On 5/3/2016 11:28 PM, Steven D'Aprano wrote:
>> [ lengthy piece about text, Unicode, and letter case ]
>
> Linguist much?

As an English-only speaker who writes code that needs to be used
around the world, you end up accruing tidbits of language and text
trivia in the form of edge cases that you need to remember to test.
Among them:

* Turkish dotless and dotted i
* Greek medial and final sigma
* German eszett
* Hebrew and Arabic right-to-left text
* Chinese non-BMP characters
* Combining characters (eg diacriticals starting U+0300)
* Non-characters eg U+FFFE

And then a post like Steven's basically comes from pulling up all
those from your memory, and maybe doing a spot of quick testing and/or
research to get some explanatory details. You don't have to be a
linguist, necessarily - just a competent debugger.

ChrisA