Path: csiph.com!newsfeed.hal-mli.net!feeder3.hal-mli.net!newsfeed.hal-mli.net!feeder1.hal-mli.net!newsfeed.xs4all.nl!newsfeed3.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.017 X-Spam-Evidence: '*H*': 0.97; '*S*': 0.00; '(especially': 0.07; 'string': 0.09; 'bytes,': 0.09; 'compact': 0.09; 'received:80.91': 0.09; 'received:80.91.229': 0.09; 'received:gmane.org': 0.09; 'received:list': 0.09; 'complicated,': 0.16; 'reasonably': 0.16; 'received:80.91.229.3': 0.16; 'received:plane.gmane.org': 0.16; 'subject:3.3': 0.16; 'unicode.': 0.16; 'subject:python': 0.16; 'spread': 0.22; 'header:User-Agent:1': 0.23; 'bytes': 0.24; 'unicode': 0.24; 'header:X-Complaints-To:1': 0.27; 'header:In- Reply-To:1': 0.27; 'point': 0.28; 'code': 0.31; 'easy,': 0.31; 'fighting': 0.31; 'languages': 0.32; "can't": 0.35; 'requirement': 0.35; 'but': 0.35; 'doubt': 0.36; 'transition': 0.36; "didn't": 0.36; 'to:addr:python-list': 0.38; 'itself': 0.39; 'to:addr:python.org': 0.39; 'received:org': 0.40; 'such': 0.63; 'choose': 0.64; 'more': 0.64; 'different': 0.65; 'wish': 0.70; 'received:109': 0.72; '..........': 0.84; 'action.': 0.84; 'complexity': 0.84; 'points,': 0.84; 'route': 0.84 X-Injected-Via-Gmane: http://gmane.org/ To: python-list@python.org From: Robin Becker Subject: Re: python 3.3 repr Date: Fri, 15 Nov 2013 15:39:04 +0000 References: <0d383a3c-247f-4b6a-9a18-7e7fadeb6047@googlegroups.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Gmane-NNTP-Posting-Host: 109.174.168.73 User-Agent: Mozilla/5.0 (Windows NT 5.1; rv:24.0) Gecko/20100101 Thunderbird/24.1.0 In-Reply-To: <0d383a3c-247f-4b6a-9a18-7e7fadeb6047@googlegroups.com> X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 17 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1384529961 news.xs4all.nl 16001 [2001:888:2000:d::a6]:56001 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:59539 ......... > > Dealing with bytes and Unicode is complicated, and the 2->3 transition is not easy, but let's please not spread the misunderstanding that somehow the Flexible String Representation is at fault. However you store Unicode code points, they are different than bytes, and it is complex having to deal with both. You can't somehow make the dichotomy go away, you can only choose where you want to think about it. > > --Ned. ....... I don't think that's what I said; the flexible representation is just an added complexity that has come about because of the wish to store strings in a compact way. The requirement for such complexity is the unicode type itself (especially the storage requirements) which necessitated some remedial action. There's no point in fighting the change to using unicode. The type wasn't required for any technical reason as other languages didn't go this route and are reasonably ok, but there's no doubt the change made things more difficult. -- Robin Becker