Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!newsfeed.xs4all.nl!newsfeed4.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
MIME-Version: 1.0
In-Reply-To: <CAN1F8qUqVnhjxXEUnub36GEwK_DD+ukKK2GwA5wYku0s=wpdgA@mail.gmail.com>
References: <lennh4$kpm$1@cabale.usenet-fr.net> <CAPTjJmomgZSFj7TanBn8_qXP4ULhCm85abJ5=2PcikXJkrH6GQ@mail.gmail.com> <CAN1F8qU8AJ+Z-Rb4+wGsYWd_bS5aOA3bsjd5NjF46MUvnecSeQ@mail.gmail.com> <mailman.7473.1393598638.18130.python-list@python.org> <XnsA2E95FA1E1EB6duncanbooth@127.0.0.1> <bnvctpF5vanU1@mid.individual.net> <mailman.7920.1394252278.18130.python-list@python.org> <87eh2d3x8h.fsf_-_@elektro.pacujo.net> <CAGGBd_qU3Zp3A4pymnDQfWynWZwFVrdHJpG=U0WZTap4HiymdA@mail.gmail.com> <lffv32$mqo$1@ger.gmane.org> <CAN1F8qU=2K6ysbpnu-JoUtfWfTToRsOaRek0dSQzjPx_sYzPKQ@mail.gmail.com> <CAMMy=OsYUzULRTRHwytiTszd44f3anz43=2DtGALHR3nBGe8JQ@mail.gmail.com> <CAN1F8qUqVnhjxXEUnub36GEwK_DD+ukKK2GwA5wYku0s=wpdgA@mail.gmail.com>
From: Daniel Stutzbach <stutzbach@google.com>
Date: Mon, 17 Mar 2014 18:01:24 -0700
Subject: Re: Balanced trees
To: Joshua Landau <joshua@landau.ws>
Content-Type: multipart/alternative; boundary=047d7b2e0b29a3b10004f4d715b1
Cc: python-list <python-list@python.org>
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.8233.1395104535.18130.python-list@python.org>
Lines: 226
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:68474

--047d7b2e0b29a3b10004f4d715b1
Content-Type: text/plain; charset=UTF-8

On Mon, Mar 17, 2014 at 5:08 PM, Joshua Landau <joshua@landau.ws> wrote:

> Thanks.  First, I want to state that there are two aspects to my
>  claim. The first is that these benchmarks to not represent typical
> use-cases. I will not go too far into this, though, because it's
> mostly obvious.
>

I would love to have include macro-benchmarks.  I keep waiting for the PyPy
benchmark suite to get ported to Python 3...


> "Create from an iterator" gives me relatively different results when I
> run it (Python 3).
>

The graphs were originally created to compare vanilla Python with a Python
modified to use blist as the built-in list type.  I think I used Python
3.1, but I'm not certain.  As I recall, the built-in type has a few small
advantages over any third-party extension type, so that might be what
you're seeing.  Alternately, something may have changed between Python
versions.


> "Delete a slice" is fudged from its inclusion of multiplication, which
> is far faster on blists. I admit that it's not obvious how to fix
> this.
>

I could move the initialization into the timed part, similar to what I did
for sort (see below).  That has downsides too, of course, but it might be
an improvement.


> "First in, first out (FIFO)" should be "x.append(0); x.pop(0)".
>

Wow, I mangled that one badly.


> "Last in, first out (LIFO)" should use "pop()" over "pop(-1)",
> although I admit it shouldn't make a meaningful difference.
>

I like pop(-1) because it's explicit rather than implicit.  I agree it
shouldn't make a meaningful difference.


> "Sort *" are really unfair because they put initialisation in the
> timed part


That's a limitation of timeit.  The setup step is only executed once.  If I
put the initialization there, every sort after the first one would be
sorting a pre-sorted list.  If you compare the "Create form an iterator"
and "Sort a random list", you'll see that the initialization cost is
dwarfed by the sorting cost for n > 15 or so.


> and all have keys.


If you use classes with __lt__ methods instead of keys, the cost is
dominated by the calls to __lt__.  You're right that I should include both,
though.


> >>> python -m timeit -s "from random import choice; import blist; lst =
> blist.blist(range(10**0))" "choice(lst)"
> 1000000 loops, best of 3: 1.18 usec per loop
>
> >>> python -m timeit -s "from random import choice; import blist; lst =
> blist.blist(range(10**8))" "choice(lst)"
> 1000000 loops, best of 3: 1.56 usec per loop
>
> Lower size ranges are hidden by the function-call overhead.
> Perhaps this effect is to do with caching, in which case the limits of
> the cache should be explained more readily.
>

That's definitely a cache issue, which is always a risk with
micro-benchmarks.  I see growth even for the built-in list:

gnusto:~$ python -m timeit -s "from random import choice; lst =
list(range(10**0))" "choice(lst)"
1000000 loops, best of 3: 0.349 usec per loop
gnusto:~$ python -m timeit -s "from random import choice; lst =
list(range(10**8))" "choice(lst)"
1000000 loops, best of 3: 0.634 usec per loop

I agree it's more interesting to pick items randomly instead of always
querying the same index.  The overhead of choice() is kind of a problem,
though.  Since I'm only plotting up to 10**5, I'd expect these to look more
or less flat.

Thanks for all of the feedback.  I filed a bug with myself to improve the
metrics:
https://github.com/DanielStutzbach/blist/issues/64

-- 
Daniel Stutzbach

--047d7b2e0b29a3b10004f4d715b1
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div class=3D"gmail_extra"><div class=3D"gmail_quote">On M=
on, Mar 17, 2014 at 5:08 PM, Joshua Landau <span dir=3D"ltr">&lt;<a href=3D=
"mailto:joshua@landau.ws" target=3D"_blank">joshua@landau.ws</a>&gt;</span>=
 wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;p=
adding-left:1ex"><div class=3D""><div class=3D"h5"><span style=3D"color:rgb=
(34,34,34)">Thanks. =C2=A0First, I want to state that there are two aspects=
 to my</span><br>

</div></div>
claim. The first is that these benchmarks to not represent typical<br>
use-cases. I will not go too far into this, though, because it&#39;s<br>
mostly obvious.<br></blockquote><div><br></div><div>I would love to have in=
clude macro-benchmarks. =C2=A0I keep waiting for the PyPy benchmark suite t=
o get ported to Python 3...</div><div>=C2=A0</div><blockquote class=3D"gmai=
l_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-lef=
t-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">

&quot;Create from an iterator&quot; gives me relatively different results w=
hen I<br>
run it (Python 3).<br></blockquote><div><br></div><div>The graphs were orig=
inally created to compare vanilla Python with a Python modified to use blis=
t as the built-in list type. =C2=A0I think I used Python 3.1, but I&#39;m n=
ot certain. =C2=A0As I recall, the built-in type has a few small advantages=
 over any third-party extension type, so that might be what you&#39;re seei=
ng. =C2=A0Alternately, something may have changed between Python versions.<=
/div>

<div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px =
0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-l=
eft-style:solid;padding-left:1ex">
&quot;Delete a slice&quot; is fudged from its inclusion of multiplication, =
which<br>
is far faster on blists. I admit that it&#39;s not obvious how to fix<br>
this.<br></blockquote><div><br></div><div>I could move the initialization i=
nto the timed part, similar to what I did for sort (see below). =C2=A0That =
has downsides too, of course, but it might be an improvement.</div><div>=C2=
=A0</div>

<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;p=
adding-left:1ex">
&quot;First in, first out (FIFO)&quot; should be &quot;x.append(0); x.pop(0=
)&quot;.<br></blockquote><div><br></div><div>Wow, I mangled that one badly.=
</div><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0p=
x 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);bo=
rder-left-style:solid;padding-left:1ex">


&quot;Last in, first out (LIFO)&quot; should use &quot;pop()&quot; over &qu=
ot;pop(-1)&quot;,<br>
although I admit it shouldn&#39;t make a meaningful difference.<br></blockq=
uote><div><br></div><div>I like pop(-1) because it&#39;s explicit rather th=
an implicit. =C2=A0I agree it shouldn&#39;t make a meaningful difference.</=
div>

<div>=C2=A0<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px =
0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);bord=
er-left-style:solid;padding-left:1ex">
&quot;Sort *&quot; are really unfair because they put initialisation in the=
<br>
timed part</blockquote><div><br></div><div>That&#39;s a limitation of timei=
t. =C2=A0The setup step is only executed once. =C2=A0If I put the initializ=
ation there, every sort after the first one would be sorting a pre-sorted l=
ist. =C2=A0If you compare the &quot;Create form an iterator&quot; and &quot=
;Sort a random list&quot;, you&#39;ll see that the initialization cost is d=
warfed by the sorting cost for n &gt; 15 or so.</div>

<div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px =
0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-l=
eft-style:solid;padding-left:1ex"> and all have keys.</blockquote><div><br>=
</div>

<div>If you use classes with __lt__ methods instead of keys, the cost is do=
minated by the calls to __lt__. =C2=A0You&#39;re right that I should includ=
e both, though.</div><div>=C2=A0</div><blockquote class=3D"gmail_quote" sty=
le=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(=
204,204,204);border-left-style:solid;padding-left:1ex">

&gt;&gt;&gt; python -m timeit -s &quot;from random import choice; import bl=
ist; lst =3D blist.blist(range(10**0))&quot; &quot;choice(lst)&quot;<br>
1000000 loops, best of 3: 1.18 usec per loop<br>
<br>
&gt;&gt;&gt; python -m timeit -s &quot;from random import choice; import bl=
ist; lst =3D blist.blist(range(10**8))&quot; &quot;choice(lst)&quot;<br>
1000000 loops, best of 3: 1.56 usec per loop<br>
<br>
Lower size ranges are hidden by the function-call overhead.<br>
Perhaps this effect is to do with caching, in which case the limits of<br>
the cache should be explained more readily.<br></blockquote><div><br></div>=
<div>That&#39;s definitely a cache issue, which is always a risk with micro=
-benchmarks. =C2=A0I see growth even for the built-in list:</div><div><br><=
/div>

<div><div><div>gnusto:~$ python -m timeit -s &quot;from random import choic=
e; lst =3D list(range(10**0))&quot; &quot;choice(lst)&quot;</div><div>10000=
00 loops, best of 3: 0.349 usec per loop</div></div><div>gnusto:~$ python -=
m timeit -s &quot;from random import choice; lst =3D list(range(10**8))&quo=
t; &quot;choice(lst)&quot;</div>

<div>1000000 loops, best of 3: 0.634 usec per loop</div><div><br></div></di=
v><div>I agree it&#39;s more interesting to pick items randomly instead of =
always querying the same index. =C2=A0The overhead of choice() is kind of a=
 problem, though. =C2=A0Since I&#39;m only plotting up to 10**5, I&#39;d ex=
pect these to look more or less flat.</div>

<div><br></div><div>Thanks for all of the feedback. =C2=A0I filed a bug wit=
h myself to improve the metrics:</div><div><a href=3D"https://github.com/Da=
nielStutzbach/blist/issues/64">https://github.com/DanielStutzbach/blist/iss=
ues/64</a><br>

</div><div><br></div></div>-- <br>Daniel Stutzbach
</div></div>

--047d7b2e0b29a3b10004f4d715b1--