Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!newsfeed.xs4all.nl!newsfeed1a.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.029 X-Spam-Evidence: '*H*': 0.94; '*S*': 0.00; 'subject:Python': 0.06; 'statements': 0.09; 'python': 0.11; 'wrote': 0.14; '==========': 0.16; 'illusion': 0.16; 'subject:Unicode': 0.16; 'apps': 0.16; 'wrote:': 0.18; 'properly': 0.19; 'working.': 0.19; 'email addr:gmail.com>': 0.22; 'skip:n 60': 0.24; 'text,': 0.24; '>': 0.26; 'certain': 0.27; 'header:In-Reply-To:1': 0.27; 'am,': 0.29; 'character': 0.29; 'message-id:@mail.gmail.com': 0.30; 'handled': 0.32; 'supposed': 0.32; 'quite': 0.32; 'anybody': 0.35; 'test': 0.35; 'received:google.com': 0.35; 'generic': 0.38; 'to:addr:python-list': 0.38; 'to:addr:python.org': 0.39; '26,': 0.68; 'containing': 0.69; '100': 0.79; '"not': 0.84; 'frustrating': 0.84; '\xe2\x82\xac': 0.84 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=lwYJ3UxhMCoTuIUVQdY7P8U5dB7STSrjaRT1wLvK4oA=; b=drPlksARrJqOMA1T4lFKAWB5lx/tpNHVKN8fW0yWHYO0w7yS0eBhvouZ0p9ZTPuP7Z zqNIP4ivEK6+xdqv2bKm0lWBxlwe6K7eJD1/bX2qAuiqxfgFcOgX2Bqb7iKlw44F6JjB AExYMDK2/6Rv9tJkJWhqf1qEtWqTwJ+HYWy5k97FQ+RR10Y1TD2oUN8K+DGetBbr1Wy0 Xpe76KlU8oEj1UzthDs6nGINid4EXrO85xp5TR+vp5/I0ZHltDxseZ1kfp+CZtM+Q4D4 8TcpQS6P9nhrXF7vq1V1i28EOmyqvVY20XOJcTlyr5B03XKHGT4ps2vW2ql5grNVi3DD iIvw== MIME-Version: 1.0 X-Received: by 10.68.235.6 with SMTP id ui6mr14138256pbc.45.1398519509724; Sat, 26 Apr 2014 06:38:29 -0700 (PDT) In-Reply-To: References: <0f253434-5e7d-4eea-88e1-7997fec2bd2d@googlegroups.com> <773afa7d-4b6d-4d67-8d40-ea90b335a1a2@googlegroups.com> <5357715c$0$11109$c3e8da3@news.astraweb.com> <03bb12d8-93be-4ef6-94ae-4a02789aea2d@googlegroups.com> Date: Sat, 26 Apr 2014 09:38:29 -0400 Subject: Re: Unicode in Python From: Ian Kelly To: Python Content-Type: multipart/alternative; boundary=047d7b33d08496b35304f7f23286 X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 60 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1398519519 news.xs4all.nl 2830 [2001:888:2000:d::a6]:37059 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:70632 --047d7b33d08496b35304f7f23286 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable On Apr 26, 2014 3:46 AM, "Frank Millman" wrote: > > > wrote in message > news:03bb12d8-93be-4ef6-94ae-4a02789aea2d@googlegroups.com... > > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > > > > I wrote once 90 % of Python 2 apps (a generic term) supposed to > > process text, strings are not working. > > > > In Python 3, that's 100 %. It is somehow only by chance, apps may > > give the illusion they are properly working. > > > > It is quite frustrating when you make these statements without explaining > what you mean by 'not working'. As far as anybody has been able to determine, what jmf means by "not working" is that strings containing the =E2=82=AC character are handled le= ss efficiently than strings that do not contain it in certain contrived test cases. --047d7b33d08496b35304f7f23286 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable


On Apr 26, 2014 3:46 AM, "Frank Millman" <frank@chagford.com> wrote:
>
>
> <wxjmfauth@gmail.com>= wrote in message
> news:03bb12d8-93be-4ef6-94ae-4a02789aea2d@googlegroups.com...
> > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
> >
> > I wrote once 90 % of Python 2 apps (a generic term) supposed to > > process text, strings are not working.
> >
> > In Python 3, that's 100 %. It is somehow only by chance, apps= may
> > give the illusion they are properly working.
> >
>
> It is quite frustrating when you make these statements without explain= ing
> what you mean by 'not working'.

As far as anybody has been able to determine, what jmf means= by "not working" is=C2=A0 that strings containing the =E2=82=AC = character are handled less efficiently than strings that do not contain it = in certain contrived test cases.

--047d7b33d08496b35304f7f23286--