Path: csiph.com!usenet.pasdenom.info!aioe.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed6.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
Newsgroups: comp.lang.python
Date: Wed, 29 Aug 2012 08:43:05 -0700 (PDT)
In-Reply-To: <mailman.3929.1346241717.4697.python-list@python.org>
Complaints-To: groups-abuse@google.com
Injection-Info: glegroupsg2000goo.googlegroups.com; posting-host=62.203.125.238; posting-account=ung4FAoAAAC46zhHJ0Nsnuox7M5gDvs_
References: <mailman.3784.1345854291.4697.python-list@python.org> <1cb3f062-eb45-4b0c-977b-76afb099923c@googlegroups.com> <k1a40u$r47$2@ger.gmane.org> <mailman.3793.1345888006.4697.python-list@python.org> <f6266544-d67c-4589-a3ed-c14428ead237@googlegroups.com> <mailman.3816.1345933655.4697.python-list@python.org> <mailman.3831.1345964382.4697.python-list@python.org> <503a0d51$0$6574$c3e8da3$5496439d@news.astraweb.com> <mailman.3841.1345995646.4697.python-list@python.org> <503a8361$0$6574$c3e8da3$5496439d@news.astraweb.com> <mailman.3853.1346014938.4697.python-list@python.org> <2e92da71-fbd2-467f-9088-1c79fa7bcf69@googlegroups.com> <UIOdnTQtcNTRlKHNnZ2dnUVZ_vednZ2d@westnet.com.au> <a15ab72d-996e-4aff-a70b-440b7baa6d68@j9g2000pbg.googlegroups.com> <mailman.3920.1346213765.4697.python-list@python.org> <62566024-df1d-4948-a27a-45c7820ddc6c@googlegroups.com> <mailman.3929.1346241717.4697.python-list@python.org>
User-Agent: G2/1.0
MIME-Version: 1.0
Subject: Re: Flexible string representation, unicode, typography, ...
From: wxjmfauth@gmail.com
To: comp.lang.python@googlegroups.com
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable
Cc: python-list@python.org, wxjmfauth@gmail.com, d@davea.name
Precedence: list
Message-ID: <mailman.3938.1346254994.4697.python-list@python.org>
Lines: 58
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:28067

Le mercredi 29 ao=FBt 2012 14:01:57 UTC+2, Dave Angel a =E9crit=A0:
> On 08/29/2012 07:40 AM, wxjmfauth@gmail.com wrote:
>=20
> > <snip>
>=20
>=20
>=20
> > Forget Python and all these benchmarks. The problem is on an other
>=20
> > level. Coding schemes, typography, usage of characters, ... For a
>=20
> > given coding scheme, all code points/characters are equivalent.
>=20
> > Expecting to handle a sub-range in a coding scheme without shaking
>=20
> > that coding scheme is impossible. If a coding scheme does not give
>=20
> > satisfaction, the only valid solution is to create a new coding
>=20
> > scheme, cp1252, mac-roman, EBCDIC, ... or the interesting "TeX" case,
>=20
> > where the "internal" coding depends on the fonts! Unicode (utf***), as
>=20
> > just one another coding scheme, does not escape to this rule. This
>=20
> > "Flexible String Representation" fails. Not only it is unable to stick
>=20
> > with a coding scheme, it is a mixing of coding schemes, the worst of
>=20
> > all possible implementations. jmf=20
>=20
>=20
>=20
> Nonsense.  The discussion was not about an encoding scheme, but an
>=20
> internal representation.  That representation does not change the
>=20
> programmer's interface in any way other than performance (cpu and memory
>=20
> usage).   Most of the rest of your babble is unsupported opinion.
>=20

I can hit the nail a little more.
I have even a better idea and I'm serious.

If "Python" has found a new way to cover the set
of the Unicode characters, why not proposing it
to the Unicode consortium?

Unicode has already three schemes covering practically
all cases: memory consumption, maximum flexibility and
an intermediate solution.
It would be to bad, to not share it.

What do you think? ;-)

jmf