Path: csiph.com!usenet.pasdenom.info!weretis.net!feeder1.news.weretis.net!feeder.erje.net!eu.feeder.erje.net!eweka.nl!lightspeed.eweka.nl!194.109.133.87.MISMATCH!newsfeed.xs4all.nl!newsfeed1.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.002 X-Spam-Evidence: '*H*': 1.00; '*S*': 0.00; 'from:addr:yahoo.co.uk': 0.04; 'skip:[ 20': 0.04; 'subject:Python': 0.06; 'correct,': 0.09; 'directions': 0.09; 'lawrence': 0.09; 'received:80.91': 0.09; 'received:80.91.229': 0.09; 'received:gmane.org': 0.09; 'received:list': 0.09; 'subject: [': 0.09; 'python': 0.11; '(just': 0.16; 'opposite': 0.16; 'received:80.91.229.3': 0.16; 'received:plane.gmane.org': 0.16; 'subject: \n ': 0.16; 'wrote:': 0.18; 'header:User-Agent:1': 0.23; 'mathematical': 0.24; 'unicode': 0.24; 'url:moin': 0.24; '(or': 0.24; '---': 0.24; 'subject:/': 0.26; 'header:X-Complaints-To:1': 0.27; 'header:In- Reply-To:1': 0.27; "i'm": 0.30; 'url:wiki': 0.31; '>>>>': 0.31; 'text': 0.33; 'url:python': 0.33; 'comment': 0.34; '"the': 0.34; 'skip:d 20': 0.34; 'problem.': 0.35; 'german': 0.35; 'really': 0.36; 'url:org': 0.36; 'subject:]': 0.38; 'to:addr:python-list': 0.38; 'to:addr:python.org': 0.39; 'received:org': 0.40; 'read': 0.60; 'french': 0.61; "you're": 0.61; 'feeling': 0.68; 'bare': 0.84; 'compose': 0.84; 'subject:long': 0.84; 'walking': 0.91; 'serious': 0.97 X-Injected-Via-Gmane: http://gmane.org/ To: python-list@python.org From: Mark Lawrence Subject: Re: flaming vs accuracy [was Re: Performance of int/long in Python 3] Date: Sun, 31 Mar 2013 13:55:33 +0100 References: <5153a12d$0$29998$c3e8da3$5496439d@news.astraweb.com> <5153d313$0$29984$c3e8da3$5496439d@news.astraweb.com> <0b6be19c-ff11-4e24-a7dc-fec0af411393@kw7g2000pbb.googlegroups.com> <5153f5ce$0$29984$c3e8da3$5496439d@news.astraweb.com> <11ef1d36-0783-4cb2-b29f-9ae573ed7e47@googlegroups.com> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Gmane-NNTP-Posting-Host: host-92-18-43-197.as13285.net User-Agent: Mozilla/5.0 (Windows NT 6.0; rv:17.0) Gecko/20130307 Thunderbird/17.0.4 In-Reply-To: X-Antivirus: avast! (VPS 130331-0, 31/03/2013), Outbound message X-Antivirus-Status: Clean X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 46 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1364734495 news.xs4all.nl 6844 [2001:888:2000:d::a6]:52798 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:42366 On 31/03/2013 08:35, jmfauth wrote: > ------ > > Neil Hodgson: > > "The counter-problem is that a French document that needs to include > one mathematical symbol (or emoji) outside Latin-1 will double in size > as a Python string." > > Serious developers/typographers/users know that you can not compose > a text in French with "latin-1". This is now also the case with > German (Germany). > > --- > > Neil's comment is correct, > >>>> sys.getsizeof('a' * 1000 + 'z') > 1026 >>>> sys.getsizeof('a' * 1000 + '€') > 2040 > > This is not really the problem. "Serious users" may > notice sooner or later, Python and Unicode are walking in > opposite directions (technically and in spirit). > >>>> timeit.repeat("'a' * 1000 + 'ẞ'") > [1.1088995672090292, 1.0842266613261913, 1.1010779011941594] >>>> timeit.repeat("'a' * 1000 + 'z'") > [0.6362570846925735, 0.6159128762502917, 0.6200501673623791] > > > (Just an opinion) > > jmf > I'm feeling very sorry for this horse, it's been flogged so often it's down to bare bones. -- If you're using GoogleCrap™ please read this http://wiki.python.org/moin/GoogleGroupsPython. Mark Lawrence