Path: csiph.com!usenet.pasdenom.info!aioe.org!rt.uk.eu.org!newsfeed.xs4all.nl!newsfeed4.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.063 X-Spam-Evidence: '*H*': 0.88; '*S*': 0.01; 'essentially': 0.04; 'indexing': 0.07; 'subject:file': 0.07; 'string': 0.09; 'received:80.91': 0.09; 'received:80.91.229': 0.09; 'received:gmane.org': 0.09; 'received:list': 0.09; 'jan': 0.12; 'received:80.91.229.3': 0.16; 'received:plane.gmane.org': 0.16; 'reedy': 0.16; 'subject:String': 0.16; 'demonstrate': 0.16; 'wrote:': 0.18; 'examples': 0.20; 'work,': 0.20; 'header:User- Agent:1': 0.23; 'replace': 0.24; 'unicode': 0.24; 'least': 0.26; 'header:X-Complaints-To:1': 0.27; 'header:In-Reply-To:1': 0.27; 'reply.': 0.31; 'apparently': 0.31; 'subject:the': 0.34; 'something': 0.35; 'there': 0.35; 'thanks': 0.36; 'possible': 0.36; 'wrong': 0.37; 'too': 0.37; 'being': 0.38; 'to:addr:python- list': 0.38; 'pm,': 0.38; 'to:addr:python.org': 0.39; 'received:org': 0.40; 'space': 0.40; 'new': 0.61; 'success': 0.61; 'received:173': 0.61; 'save': 0.62; "you've": 0.63; 'show': 0.63; 'overall': 0.69; 'saving': 0.69; 'respect': 0.70; 'goal': 0.75; 'behavior': 0.77; 'goals': 0.81; 'batchelder': 0.84; 'developed.': 0.84; 'penalty': 0.84; 'received:fios.verizon.net': 0.84; 'sensibly': 0.91; 'contrary': 0.95; 'exceeded': 0.97 X-Injected-Via-Gmane: http://gmane.org/ To: python-list@python.org From: Terry Reedy Subject: Re: Chardet, file, ... and the Flexible String Representation Date: Mon, 09 Sep 2013 16:47:43 -0400 References: <4ce85ea8-4a4c-46cf-a546-ad999576a5f7@googlegroups.com> <04abbe99-ca1e-40b5-86c7-64b0e5d9de9c@googlegroups.com> <522DF982.7060106@nedbatchelder.com> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Gmane-NNTP-Posting-Host: pool-173-75-251-66.phlapa.fios.verizon.net User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:17.0) Gecko/20130801 Thunderbird/17.0.8 In-Reply-To: <522DF982.7060106@nedbatchelder.com> X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 25 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1378759675 news.xs4all.nl 16012 [2001:888:2000:d::a6]:39778 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:53891 On 9/9/2013 12:38 PM, Ned Batchelder wrote: > jmf, thanks for your reply. You've calmed my fears that there is > something wrong with the Flexible String Representation. None of the > examples you show demonstrate any behavior contrary to the Unicode spec. The goals of the new unicode implementation: 1. one implementation on all platforms, working the same on all platforms. 2. works correctly 3. O(1) indexing 4. save as much space as sensibly possible 5. not too much time penalty for the space saving. The new implementation succeeded on all points. It exceeded the goal for 5. With much optimization work, there essentially is no overall time penalty left. Jmf's size examples show success with respect to goal 4. He apparently disagrees with that goal and would replace it with something else. At least some of his time examples show that saving space can save time, as was predicted when the FSR was being developed. -- Terry Jan Reedy