Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!news.glorb.com!newsfeed.xs4all.nl!newsfeed1.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
MIME-Version: 1.0
In-Reply-To: <f6f88950-979d-4f6d-8265-310d73ae3a60@w15g2000vbn.googlegroups.com>
References: <zrOdnZU2U6syDxvMnZ2dnUVZ_v2dnZ2d@giganews.com> <5186aeb6$0$29997$c3e8da3$5496439d@news.astraweb.com> <af561505-fffe-4e2a-8446-35a98da6ded7@googlegroups.com> <CAA=1kxSQE2uCkgTQbr3cfmg4PFQEj=OcYpTi5F+Chd+j0Q8Tuw@mail.gmail.com> <CAPTjJmoF1CK661GDovgw0MmveOv-tAOGSW2odquM4eh7kei-eg@mail.gmail.com> <mailman.1320.1367826603.3114.python-list@python.org> <f6f88950-979d-4f6d-8265-310d73ae3a60@w15g2000vbn.googlegroups.com>
Date: Tue, 7 May 2013 14:35:05 +0100
Subject: Re: Why do Perl programmers make more money than Python programmers
From: =?ISO-8859-1?Q?F=E1bio_Santos?= <fabiosantosart@gmail.com>
To: jmfauth <wxjmfauth@gmail.com>
Content-Type: multipart/alternative; boundary=047d7b6da4c89bddf804dc20e241
Cc: python-list@python.org
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.1410.1367933714.3114.python-list@python.org>
Lines: 167
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:44891

--047d7b6da4c89bddf804dc20e241
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

>
>
> -----
>
>
> 1) The memory gain for many of us (usually non ascii users)
> just become irrelevant.
>
> >>> sys.getsizeof('ma=C3=A7=C3=A3')
> 41
> >>> sys.getsizeof('abcd')
> 29
>
> 2) More critical, Py 3.3, just becomes non unicode compliant,
> (eg European languages or "ascii" typographers !)
>
> >>> import timeit
> >>> timeit.timeit("'abcd'*1000 + 'a'")
> 2.186670111428325
> >>> timeit.timeit("'abcd'*1000 + '=E2=82=AC'")
> 2.9951699820528432
> >>> timeit.timeit("'abcd'*1000 + '=C5=93'")
> 3.0036780444886233
> >>> timeit.timeit("'abcd'*1000 + '=E1=BA=9E'")
> 3.004992278824048
> >>> timeit.timeit("'ma=C3=A7=C3=A3'*1000 + '=C5=93'")
> 3.231025618708202
> >>> timeit.timeit("'ma=C3=A7=C3=A3'*1000 + '=E2=82=AC'")
> 3.215894398100758
> >>> timeit.timeit("'ma=C3=A7=C3=A3'*1000 + '=C5=93'")
> 3.224407974255655
> >>> timeit.timeit("'ma=C3=A7=C3=A3'*1000 + '=E2=80=99'")
> 3.2206342273566406
> >>> timeit.timeit("'abcd'*1000 + '=E2=80=99'")
> 2.9914403449067777
>
> 3) Python is "pround" to cover the whole unicode range,
> unfortunately it "breaks" the BMP range.
> Small GvR exemple (ascii) from the the bug list,
> but with non ascii characters.
>
> # Py 3.2, all chars
>
> >>> timeit.repeat("a =3D 'hundred'; 'x' in a")
> [0.09087790617297742, 0.07456871885972305, 0.07449940353376405]
> >>> timeit.repeat("a =3D 'ma=C3=A7=C3=A3=C3=A9=E2=82=AC=E1=BA=9E'; 'x' in=
 a")
> [0.10088136800095526, 0.07488497003487282, 0.07497594640028638]
>
>
> # Py 3.3 ascii and non ascii chars
> >>> timeit.repeat("a =3D 'hundred'; 'x' in a")
> [0.11426985953005442, 0.10040049292649655, 0.09920834808588097]
> >>> timeit.repeat("a =3D 'ma=C3=A7=C3=A3=C3=A9=E2=82=AC=E1=BA=9E'; '=C3=
=A9' in a")
> [0.2345595188256766, 0.21637172864154763, 0.2179096624382737]
>
>
> There are plenty of good reasons to use Python. There are
> also plenty of good reasons to not use (or now to drop)
> Python and to realize that if you wish to process text
> seriously, you are better served by using "corporate
> products" or tools using Unicode properly.
>
> jmf

This is so off-topic that, after reading this, I feel I have just returned
from the Moon.

OTOH, it would seem like you know the Portuguese word for apple, so I also
feel home.

I am so confused.

--047d7b6da4c89bddf804dc20e241
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<p dir=3D"ltr"><br>
&gt;<br>
&gt;<br>
&gt; -----<br>
&gt;<br>
&gt;<br>
&gt; 1) The memory gain for many of us (usually non ascii users)<br>
&gt; just become irrelevant.<br>
&gt;<br>
&gt; &gt;&gt;&gt; sys.getsizeof(&#39;ma=C3=A7=C3=A3&#39;)<br>
&gt; 41<br>
&gt; &gt;&gt;&gt; sys.getsizeof(&#39;abcd&#39;)<br>
&gt; 29<br>
&gt;<br>
&gt; 2) More critical, Py 3.3, just becomes non unicode compliant,<br>
&gt; (eg European languages or &quot;ascii&quot; typographers !)<br>
&gt;<br>
&gt; &gt;&gt;&gt; import timeit<br>
&gt; &gt;&gt;&gt; timeit.timeit(&quot;&#39;abcd&#39;*1000 + &#39;a&#39;&quo=
t;)<br>
&gt; 2.186670111428325<br>
&gt; &gt;&gt;&gt; timeit.timeit(&quot;&#39;abcd&#39;*1000 + &#39;=E2=82=AC&=
#39;&quot;)<br>
&gt; 2.9951699820528432<br>
&gt; &gt;&gt;&gt; timeit.timeit(&quot;&#39;abcd&#39;*1000 + &#39;=C5=93&#39=
;&quot;)<br>
&gt; 3.0036780444886233<br>
&gt; &gt;&gt;&gt; timeit.timeit(&quot;&#39;abcd&#39;*1000 + &#39;=E1=BA=9E&=
#39;&quot;)<br>
&gt; 3.004992278824048<br>
&gt; &gt;&gt;&gt; timeit.timeit(&quot;&#39;ma=C3=A7=C3=A3&#39;*1000 + &#39;=
=C5=93&#39;&quot;)<br>
&gt; 3.231025618708202<br>
&gt; &gt;&gt;&gt; timeit.timeit(&quot;&#39;ma=C3=A7=C3=A3&#39;*1000 + &#39;=
=E2=82=AC&#39;&quot;)<br>
&gt; 3.215894398100758<br>
&gt; &gt;&gt;&gt; timeit.timeit(&quot;&#39;ma=C3=A7=C3=A3&#39;*1000 + &#39;=
=C5=93&#39;&quot;)<br>
&gt; 3.224407974255655<br>
&gt; &gt;&gt;&gt; timeit.timeit(&quot;&#39;ma=C3=A7=C3=A3&#39;*1000 + &#39;=
=E2=80=99&#39;&quot;)<br>
&gt; 3.2206342273566406<br>
&gt; &gt;&gt;&gt; timeit.timeit(&quot;&#39;abcd&#39;*1000 + &#39;=E2=80=99&=
#39;&quot;)<br>
&gt; 2.9914403449067777<br>
&gt;<br>
&gt; 3) Python is &quot;pround&quot; to cover the whole unicode range,<br>
&gt; unfortunately it &quot;breaks&quot; the BMP range.<br>
&gt; Small GvR exemple (ascii) from the the bug list,<br>
&gt; but with non ascii characters.<br>
&gt;<br>
&gt; # Py 3.2, all chars<br>
&gt;<br>
&gt; &gt;&gt;&gt; timeit.repeat(&quot;a =3D &#39;hundred&#39;; &#39;x&#39; =
in a&quot;)<br>
&gt; [0.09087790617297742, 0.07456871885972305, 0.07449940353376405]<br>
&gt; &gt;&gt;&gt; timeit.repeat(&quot;a =3D &#39;ma=C3=A7=C3=A3=C3=A9=E2=82=
=AC=E1=BA=9E&#39;; &#39;x&#39; in a&quot;)<br>
&gt; [0.10088136800095526, 0.07488497003487282, 0.07497594640028638]<br>
&gt;<br>
&gt;<br>
&gt; # Py 3.3 ascii and non ascii chars<br>
&gt; &gt;&gt;&gt; timeit.repeat(&quot;a =3D &#39;hundred&#39;; &#39;x&#39; =
in a&quot;)<br>
&gt; [0.11426985953005442, 0.10040049292649655, 0.09920834808588097]<br>
&gt; &gt;&gt;&gt; timeit.repeat(&quot;a =3D &#39;ma=C3=A7=C3=A3=C3=A9=E2=82=
=AC=E1=BA=9E&#39;; &#39;=C3=A9&#39; in a&quot;)<br>
&gt; [0.2345595188256766, 0.21637172864154763, 0.2179096624382737]<br>
&gt;<br>
&gt;<br>
&gt; There are plenty of good reasons to use Python. There are<br>
&gt; also plenty of good reasons to not use (or now to drop)<br>
&gt; Python and to realize that if you wish to process text<br>
&gt; seriously, you are better served by using &quot;corporate<br>
&gt; products&quot; or tools using Unicode properly.<br>
&gt;<br>
&gt; jmf<br></p>
<p dir=3D"ltr">This is so off-topic that, after reading this, I feel I have=
 just returned from the Moon.</p>
<p dir=3D"ltr">OTOH, it would seem like you know the Portuguese word for ap=
ple, so I also feel home.</p>
<p dir=3D"ltr">I am so confused.<br>
</p>

--047d7b6da4c89bddf804dc20e241--