Path: csiph.com!newsfeed.hal-mli.net!feeder3.hal-mli.net!newsfeed.hal-mli.net!feeder1.hal-mli.net!newsfeed.xs4all.nl!newsfeed3.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.030 X-Spam-Evidence: '*H*': 0.94; '*S*': 0.00; 'subject:Python': 0.06; 'binary': 0.07; 'ascii': 0.09; 'escape': 0.09; 'cc:addr:python- list': 0.11; 'cc:name:python list': 0.16; 'really?': 0.16; 'subject:More': 0.16; 'subject:Unicode': 0.16; 'cc:addr:python.org': 0.22; 'bytes': 0.24; 'guys': 0.24; 'cc:2**0': 0.24; 'long,': 0.26; 'defined': 0.27; 'header:In-Reply- To:1': 0.27; 'chris': 0.29; 'characters': 0.30; 'message- id:@mail.gmail.com': 0.30; 'apparently': 0.31; 'subject:About': 0.31; 'text': 0.33; 'community': 0.33; 'problem': 0.35; 'problem.': 0.35; 'something': 0.35; 'but': 0.35; 'received:google.com': 0.35; 'doing': 0.36; "didn't": 0.36; 'fact': 0.38; 'that,': 0.38; 'does': 0.39; 'bad': 0.39; 'subject:"': 0.60; "you'll": 0.62; 'refer': 0.63; 'different': 0.65; '(generic': 0.84; '*really*': 0.84; 'habit': 0.91 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=HC9hUAN7mKhebi4LGiB1zQQRxp8qN/3mHhqp2tUTav0=; b=mBTXVbcTOJQbwEHp68jvQc5t2UxvN8WULRYs56cyPFrRj9SSbSxjAwfdtYFqUmx5kP aasr3FwlPQyWMFkOtLTHhqy9P3vb+DeJa2dnEmsuJxZMu6jZYe8trrRJ/UFvklZqlPmK rfxfO/wNLj5IYuJqgxoF9JZ0QpV9lpfvmb8g4UH2pD4q7S27nqOaXHorZ+KXoWVyvq8h KypYTYUvHAwpeClkv8bPG89rOiMGTLjnV8qrSJjqfmJQ+mE9H1zOZlEx1bWTS9xyvmF3 qcUphW1RQoeoH/BHYAr32RjJONbfygW5fqqRZ8itzmCXlpJoxE3ZuXs5hQ0lmJ+c9KSD i3sw== MIME-Version: 1.0 X-Received: by 10.194.219.232 with SMTP id pr8mr13406961wjc.6.1389036655215; Mon, 06 Jan 2014 11:30:55 -0800 (PST) In-Reply-To: <52CAF722.6010301@stoneleaf.us> References: <52C9FD02.3080109@stoneleaf.us> <52cae78d$0$29971$c3e8da3$5496439d@news.astraweb.com> <52CAF722.6010301@stoneleaf.us> Date: Mon, 6 Jan 2014 13:30:55 -0600 Subject: Re: "More About Unicode in Python 2 and 3" From: Mark Janssen To: Ethan Furman Content-Type: text/plain; charset=ISO-8859-1 Cc: Python List X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 23 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1389036656 news.xs4all.nl 2867 [2001:888:2000:d::a6]:38136 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:63324 >> Chris didn't say "bytes and ascii data", he said "bytes and TEXT". >> Text != "ascii data", and the fact that some people apparently think it >> does is pretty much the heart of the problem. > > The heart of a different problem, not this one. The problem I refer to is > that many binary formats have well-defined ascii-encoded text tidbits. Really? If people are using binary with "well-defined ascii-encoded tidbits", they're doing something wrong. Perhaps you think escape characters "\n" are "well defined tidbits", but YOU WOULD BE WRONG. The purpose of binary is to keep things raw. WTF? You guys are so strange. > >> If you (generic you) don't get that, you'll have a bad time. I mean >> *really* >> get it, deep down in the bone. The long, bad habit of thinking as >> ASCII-encoded bytes as text is the problem here. I think the whole forking community is confused at because of your own arrogance. Foo(l)s. markj