Path: csiph.com!usenet.pasdenom.info!weretis.net!feeder4.news.weretis.net!npeer.de.kpn-eurorings.net!npeer-ng0.de.kpn-eurorings.net!news.tele.dk!news.tele.dk!small.news.tele.dk!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.031 X-Spam-Evidence: '*H*': 0.94; '*S*': 0.00; 'encoding': 0.05; 'string.': 0.05; 'url:pipermail': 0.05; 'utf-8': 0.07; 'ascii': 0.09; 'differently.': 0.09; 'observation': 0.09; 'width': 0.09; 'python': 0.11; 'agree.': 0.16; 'complained': 0.16; 'denying': 0.16; 'wrote:': 0.18; 'users.': 0.18; 'now?': 0.24; 'string,': 0.24; 'regardless': 0.24; 'header:In-Reply-To:1': 0.27; 'rest': 0.29; 'points': 0.29; 'message-id:@mail.gmail.com': 0.30; 'largest': 0.30; "i'm": 0.30; 'code': 0.31; 'argue': 0.31; "d'aprano": 0.31; 'steven': 0.31; "we're": 0.32; 'url:python': 0.33; 'entirely': 0.33; 'fri,': 0.33; 'implemented': 0.33; 'sense': 0.34; 'except': 0.35; 'no,': 0.35; 'but': 0.35; 'received:google.com': 0.35; 'scheme': 0.36; 'view,': 0.36; 'similar': 0.36; 'url:org': 0.36; 'being': 0.38; 'to:addr:python- list': 0.38; 'issue': 0.38; 'fact': 0.38; 'pm,': 0.38; 'does': 0.39; 'expensive': 0.39; 'itself': 0.39; 'to:addr:python.org': 0.39; 'url:mail': 0.40; 'simply': 0.61; 'first': 0.61; 'back': 0.62; 'making': 0.63; 'myself': 0.63; 'believe': 0.68; 'benefit': 0.68; '26,': 0.68; 'jul': 0.74; 'day': 0.76; 'fourth': 0.84; 'points,': 0.84; 'watched': 0.91; '2013': 0.98 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=MbIK0XiBLr4lPRL3wE3H/QvsOWhtim70Gk0pmPMHVAA=; b=KrmKbGbycWJYSWCGbl4MJxxh0egwErdwq+jWDP6bHfpuI4D/lYVgUnUSOfbftpOfFh WHpYzMJgK6E3H4YT28nssH1VRkFRqltVeKqSBIz/9hElDV1+dt3AkMu7BOhBMPSJpLP0 COvsLAMiau7KeZd+LNk0bk4eOht0bMc7cmsY4uAzhUG8hVhAKQWL88ZKusjdaWcsB54E XLpoPoqfx0tjYNrYwU5+2PFYO79SlstRdmmLtQ3VssY11s4/cujryUb4ugHUj//9sY0t IAQNN9Ob0vpLB1Mo6eBif9AnsaTm30OEHQgZoVGm/ZuWCtebr9qdGacq85h4bSQdxA/t FyBw== X-Received: by 10.66.219.135 with SMTP id po7mr58470205pac.21.1374898396994; Fri, 26 Jul 2013 21:13:16 -0700 (PDT) MIME-Version: 1.0 In-Reply-To: <51f3406f$0$29971$c3e8da3$5496439d@news.astraweb.com> References: <571a6dfe-fd66-42cf-92fc-8b97cbe6e9e4@googlegroups.com> <51DFDE65.5040001@Gmail.com> <4f1067f6-bc99-42ad-9166-37fb228b90e8@googlegroups.com> <51f14395$0$29971$c3e8da3$5496439d@news.astraweb.com> <51f15e03$0$29971$c3e8da3$5496439d@news.astraweb.com> <51f1e371$0$29971$c3e8da3$5496439d@news.astraweb.com> <51f3406f$0$29971$c3e8da3$5496439d@news.astraweb.com> From: Ian Kelly Date: Fri, 26 Jul 2013 22:12:36 -0600 Subject: Re: RE Module Performance To: Python Content-Type: text/plain; charset=ISO-8859-1 X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 22 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1374898818 news.xs4all.nl 15968 [2001:888:2000:d::a6]:52258 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:51334 On Fri, Jul 26, 2013 at 9:37 PM, Steven D'Aprano wrote: > See the similarity now? Both flexibly change the width used by code- > points, UTF-8 based on the code-point itself regardless of the rest of > the string, Python based on the largest code-point in the string. No, I think we're just using the word "flexible" differently. In my view, simply being variable-width does not make an encoding "flexible" in the sense of the FSR. But I'm not going to keep repeating myself in order to argue about it. > Having watched this issue from Day One when JMF first complained about > it, I believe this is entirely about denying any benefit to ASCII users. > Had Python implemented a system identical to the current FSR except that > it added a fourth category, "all ASCII", which used an eight-byte > encoding scheme (thus making ASCII strings twice as expensive as strings > including code points from the Supplementary Multilingual Planes), JMF > would be the scheme's number one champion. I agree. In fact I made a similar observation back in December: http://mail.python.org/pipermail/python-list/2012-December/636942.html