Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!xlned.com!feeder7.xlned.com!newsfeed.xs4all.nl!newsfeed1a.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.008 X-Spam-Evidence: '*H*': 0.98; '*S*': 0.00; 'subject:Python': 0.06; 'string': 0.09; '32-bit': 0.09; 'bash': 0.09; 'cc:addr:python- list': 0.11; 'python': 0.11; 'programs.': 0.14; '(just': 0.16; '8-bit': 0.16; 'from:addr:rosuav': 0.16; 'from:name:chris angelico': 0.16; 'integers.': 0.16; 'demonstrate': 0.16; 'wrote:': 0.18; "python's": 0.19; 'cc:addr:python.org': 0.22; 'text.': 0.24; 'cc:2**0': 0.24; 'nearly': 0.26; 'header:In-Reply-To:1': 0.27; 'point': 0.28; 'am,': 0.29; 'unix': 0.29; 'message- id:@mail.gmail.com': 0.30; 'program,': 0.31; 'subject:some': 0.31; 'testing.': 0.31; 'another': 0.32; 'text': 0.33; 'linux': 0.33; 'fri,': 0.33; 'received:google.com': 0.35; 'sequence': 0.36; 'rich': 0.38; 'itself': 0.39; 'skip:u 10': 0.60; "you're": 0.61; 'talking': 0.65; 'here': 0.66; 'treats': 0.84; 'understand,': 0.84; 'to:none': 0.92 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:cc :content-type; bh=0BqumP2Bp1GiFNDMKS9FkKZlPnSs9mG9eZvPJkYHaBc=; b=WD1REzyuuAxIZSDdg+IJs3vCyjdwzmfPBR8kTK77w1aDKoAwjRAVogXWrKnpvSvDVw /71p+LMxVhEa9ZgJ9nX4dxdTyUdbJvUlERmrjLDEJDKlOvxXt20gkJKVNvCLhZE6T9S1 AB16161QVuO2sVWQGKHBb/cnxsS4Rkimx59ZplJrgy+UGFtyUZYaHkmypT1Tme9hrYgm Gsxy/xg5jpRcr75Y3pHGhrhpIvAnaUSWXUeosNdKTvu2K+esQswmZFXazg2HTt7UTbiO UBbW4zK2q5S6u+WPwtVwRZT/7slAzXs+EeJOtjlvu8Uxm9Z07pA00+NTVQBmcO/G35ei Krjw== MIME-Version: 1.0 X-Received: by 10.220.81.194 with SMTP id y2mr12835967vck.29.1401989323176; Thu, 05 Jun 2014 10:28:43 -0700 (PDT) In-Reply-To: <87ha3zti2h.fsf@elektro.pacujo.net> References: <538a8f48$0$29978$c3e8da3$5496439d@news.astraweb.com> <538bcfff$0$29978$c3e8da3$5496439d@news.astraweb.com> <538C5BB8.1020702@chamonix.reportlab.co.uk> <538f1a61$0$29978$c3e8da3$5496439d@news.astraweb.com> <53902bb1$0$11109$c3e8da3@news.astraweb.com> <87wqcvu20h.fsf@elektro.pacujo.net> <7b3543f6-6f62-49c5-abdc-e2783fd6d629@googlegroups.com> <87oay7tnxt.fsf@elektro.pacujo.net> <53908dd0$0$29978$c3e8da3$5496439d@news.astraweb.com> <87ha3zti2h.fsf@elektro.pacujo.net> Date: Fri, 6 Jun 2014 03:28:43 +1000 Subject: Re: Python 3.2 has some deadly infection From: Chris Angelico Cc: "python-list@python.org" Content-Type: text/plain; charset=UTF-8 X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 14 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1401989331 news.xs4all.nl 2852 [2001:888:2000:d::a6]:46573 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:72737 On Fri, Jun 6, 2014 at 2:52 AM, Marko Rauhamaa wrote: > That linux text is not the same thing as Python's text. Conceptually, > Python text is a sequence of 32-bit integers. Linux text is a sequence > of 8-bit integers. Point of terminology: Linux is the kernel, everything you say below here is talking about particular programs. From what I understand, bash (just another Unix program) treats strings as sequences of codepoints, just as Python does; though its string manipulation is not nearly as rich as Python's, so it's harder to prove. Python is itself a Unix program, so you can do the exact same proofs and demonstrate that Linux is clearly Unicode-aware. It's not Linux you're testing. ChrisA