Path: csiph.com!usenet.pasdenom.info!weretis.net!feeder1.news.weretis.net!feeder.erje.net!newsfeed.xs4all.nl!newsfeed5.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
Date: Sun, 14 Oct 2012 18:31:20 +0100
From: MRAB <python@mrabarnett.plus.com>
User-Agent: Mozilla/5.0 (Windows NT 5.1; rv:16.0) Gecko/20121010 Thunderbird/16.0.1
MIME-Version: 1.0
To: python-list@python.org
Subject: Re: pyw program not displaying unicode characters properly
References: <MPG.2ae50ce060f7e130989681@news.free.fr>
In-Reply-To: <MPG.2ae50ce060f7e130989681@news.free.fr>
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
Precedence: list
Reply-To: python-list@python.org
Newsgroups: comp.lang.python
Message-ID: <mailman.2178.1350235875.27098.python-list@python.org>
Lines: 39
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:31255

On 2012-10-14 17:55, jjmeric wrote:
>
> Hi everybody !
>
> Our language lab at INALCO is using a nice language parsing and analysis
> program written in Python. As you well know a lot of languages use
> characters that can only be handled by unicode.
>
> Here is an example of the problem we have on some Windows computers.
> In the attached screen-shot (DELETED),
> the bambara character (a sort of epsilon)  is displayed as a square.
>
> The fact that it works fine on some computers and fails to display the
> characters on others suggests that it is a user configuration issue:
> Recent observations: it's OK on Windows 7 but not on Vista computers,
> it's OK on some Windows XP computers, it's not on others Windows XP...
>
> On the computers where it fails, we've tried to play with options in the
> International settings, but are not able to fix it.
>
> Any idea that would help us go in the right direction, or just fix it,
> is welcome !
>
> Thanks!
> I ni ce! (in bambara, a language spoken in Mali, West Africa)
>
A square is shown when the font being used doesn't contain a visible
glyph for the codepoint.

Which codepoint is it? What is the codepoint's name?

Here's how to find out:

 >>> hex(ord("Ɛ"))
'0x190'
 >>> import unicodedata
 >>> unicodedata.name("Ɛ")
'LATIN CAPITAL LETTER OPEN E'