Path: csiph.com!news.swapon.de!fu-berlin.de!uni-berlin.de!not-for-mail From: Michael Torrie Newsgroups: comp.lang.python Subject: Re: psss...I want to move from Perl to Python Date: Sun, 31 Jan 2016 16:45:24 -0700 Lines: 25 Message-ID: References: <2a8dc773-87a1-4ffd-8b8f-a77f2f6ff693@googlegroups.com> Mime-Version: 1.0 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit X-Trace: news.uni-berlin.de d/rvIf7aJvSk85sYc0qpGgfDBvdBI3DzbQ6DdqOSRADA== Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.011 X-Spam-Evidence: '*H*': 0.98; '*S*': 0.00; 'subject:Python': 0.05; 'file)': 0.07; 'unicode,': 0.09; 'python': 0.10; '(say': 0.16; '(unlike': 0.16; 'buggy': 0.16; 'from:addr:torriem': 0.16; 'from:name:michael torrie': 0.16; 'in-memory': 0.16; 'long-time': 0.16; 'points)': 0.16; 'received:io': 0.16; 'received:psf.io': 0.16; 'sad': 0.16; 'troll': 0.16; 'usenet': 0.16; 'wrote:': 0.16; 'byte': 0.18; 'bytes': 0.18; '>>>': 0.20; 'am,': 0.23; 'seems': 0.23; "python's": 0.23; 'header:In-Reply-To:1': 0.24; "i've": 0.25; 'header:User-Agent:1': 0.26; 'handling': 0.27; '3.0': 0.27; 'correct': 0.28; 'behaviour': 0.29; 'thinks': 0.29; 'code': 0.30; "can't": 0.32; 'language.': 0.32; 'builds': 0.33; "he's": 0.33; 'message-id:@gmail.com': 0.34; 'file': 0.34; 'handle': 0.34; 'add': 0.34; 'list': 0.34; 'unicode': 0.35; 'but': 0.36; 'list,': 0.36; 'to:addr:python-list': 0.36; 'pm,': 0.36; 'subject:: ': 0.37; 'say': 0.37; 'received:org': 0.37; 'desired': 0.37; 'list.': 0.37; 'anything': 0.38; 'subject:from': 0.39; 'received:192': 0.39; 'to:addr:python.org': 0.40; 'still': 0.40; 'him': 0.60; 'your': 0.60; 'side': 0.62; 'email addr:gmail.com': 0.62; 'charset:windows-1252': 0.62; 'integrated': 0.72; '3.5.1': 0.84; 'banned': 0.84; 'ucs-2': 0.84; 'subject:want': 0.93; 'suffering': 0.93 X-Virus-Scanned: amavisd-new at torriefamily.org User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.7.0 In-Reply-To: X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.20+ Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Xref: csiph.com comp.lang.python:102371 On 01/31/2016 03:34 PM, Fillmore wrote: > On 01/30/2016 05:26 AM, wxjmfauth@gmail.com wrote: > >>> Python 2 vs python 3 is anything but "solved". >> >> >> Python 3.5.1 is still suffering from the same buggy >> behaviour as in Python 3.0 . > Can you elaborate? Sad to say jmf is a long-time troll on this list who seems to thinks he understands Unicode better than he does. We've been round and round with him on this list over his issues and he's since been banned from the email part of this list, but still spams the Usenet side from time to time. If you're using NNTP, just add him to your kill file or block list. There's nothing to elaborate on. Python 3.5's integrated handling of unicode is the best I've seen in any language. It just works and you don't need to worry about it (unlike most other languages), other than to remember that anytime you take bytes into the program (say from a file) they must be "decoded" into unicode, and whenever you write bytes out (say to a file or socket) unicode strings must be "encoded" to a desired byte encoding, such as UTF-8. Python's in-memory representation of unicode strings is correct (unlike Python 2.7, which had UCS-2 narrow builds that can't handle non-BLM code points) for all unicode codepoints.