MIME-Version: 1.0
Sender: joshua.landau.ws@gmail.com
In-Reply-To: <ksgk53$e6e$1@ger.gmane.org>
References: <mailman.4865.1374240179.3114.python-list@python.org> <51e967bb$0$29971$c3e8da3$5496439d@news.astraweb.com> <ksbt1a$5q1$1@ger.gmane.org> <CAN1F8qXttLWtMFDED-+gEdOR_5tZmDKtqDsF2kdojCRSMAn7eg@mail.gmail.com> <ksdtu7$ctm$1@ger.gmane.org> <CAN1F8qVv0N1D=JEkvWeQePhZg7fT7ULPPXp+ZTOmu=wQCzXiHQ@mail.gmail.com> <ksg3gh$ts9$1@ger.gmane.org> <CAN1F8qXcyMZ=6od+V75=9F2fwXHKiZ643PtbnrFer5i+S3i_BQ@mail.gmail.com> <ksgk53$e6e$1@ger.gmane.org>
From: Joshua Landau <joshua@landau.ws>
Date: Sun, 21 Jul 2013 13:49:29 +0100
Subject: Re: Find and Replace Simplification
To: Serhiy Storchaka <storchaka@gmail.com>
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
Cc: python-list <python-list@python.org>
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.4953.1374411017.3114.python-list@python.org>
Lines: 32
NNTP-Posting-Host: 2001:888:2000:d::a6
Path: csiph.com!usenet.pasdenom.info!news.franciliens.net!feed.ac-versailles.fr!nerim.net!novso.com!newsfeed.xs4all.nl!newsfeed4.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
Xref: csiph.com comp.lang.python:51011

On 21 July 2013 13:28, Serhiy Storchaka <storchaka@gmail.com> wrote:
> 21.07.13 14:29, Joshua Landau =D0=BD=D0=B0=D0=BF=D0=B8=D1=81=D0=B0=D0=B2(=
=D0=BB=D0=B0):
>
>> On 21 July 2013 08:44, Serhiy Storchaka <storchaka@gmail.com> wrote:
>>>
>>> 20.07.13 20:03, Joshua Landau =D0=BD=D0=B0=D0=BF=D0=B8=D1=81=D0=B0=D0=
=B2(=D0=BB=D0=B0):
>>>
>>>> Still, it seems to me that it should be optimizable for sensible
>>>> builtin types such that .translate is significantly faster, as there's
>>>> no theoretical extra work that .translate *has* to do that .replace
>>>> does not, and .replace also has to rebuild the string a lot of times.
>>>
>>>
>>> You should analyze overall mapping and reorder items in right order (if
>>> it
>>> possible), i.e. '&' should be replaced before '<' in html.escape. This
>>> extra
>>> work is too large for most real input.
>>
>>
>> I don't understand. What items are you reordering?
>
>
> mapping.items(). We can implement s.translate({ord('<'): '&lt;', ord('&')=
:
> '&amp;'}) as s.replace('&', '&amp;').replace('<', '&lt;'), but not as
> s.replace('<', '&lt;').replace('&', '&amp;').

I see -- that won't always be the case though, as there can be "loops"
aka "ab" -> "ba".