Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.013 X-Spam-Evidence: '*H*': 0.97; '*S*': 0.00; 'subject:text': 0.05; 'character,': 0.09; 'subject:question': 0.10; 'url:blog': 0.10; 'cc:addr:python-list': 0.11; 'footnote': 0.16; 'from:addr:rosuav': 0.16; 'from:name:chris angelico': 0.16; 'vulgar': 0.16; 'wrote:': 0.18; 'wed,': 0.18; 'trying': 0.19; 'cc:addr:python.org': 0.22; 'expanded': 0.24; 'unicode': 0.24; 'cc:2**0': 0.24; 'header:In- Reply-To:1': 0.27; 'am,': 0.29; 'character': 0.29; "doesn't": 0.30; 'message-id:@mail.gmail.com': 0.30; 'received:google.com': 0.35; 'there': 0.35; 'url:org': 0.36; 'more': 0.64; 'here': 0.66; 'mar': 0.68; '2015': 0.84; 'url:03': 0.84; 'to:none': 0.92 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:cc :content-type:content-transfer-encoding; bh=Bs697IEjy0JOrUa4F4fdKcu0zZzJ5yG+zRyKrt3Bi3Q=; b=yH52Q9NbNPX/CCifrcm87vpN+6gIzXWaI8uvVpq1SRz7vZvlr7mUz3y/GOR34i85EW EaGp0vbPmbngo0EtxC39cv0oXnd97uwtw+eIoEeR/CvYAOUsgaPoFc5haI2jWaRrLkoo COFLBhQr47ZX7gUBHgjkYVKGWVMi0f3HPpOhVfw4561LHsWJV86Wljtqp1gNsuB/BCrF +59RaijpA77ybm7NN85xuFTDCF7T1YSNO7B5X5cCzpMeA7Mvks4uHWxe67eE4YAOLfCe 1sEGr78mNVUBIrzlwFaFl0ZSgITQaFp0bjJSJFu09dRR5pqm7bQCKx+bAfHuo1slxTl3 mpkQ== MIME-Version: 1.0 X-Received: by 10.50.4.40 with SMTP id h8mr30769868igh.34.1425408233638; Tue, 03 Mar 2015 10:43:53 -0800 (PST) In-Reply-To: <9169f3b1-2ac7-42a3-8033-584f84b88a1f@googlegroups.com> References: <54EC5FA4.6070703@davea.name> <201502241455.t1OEtffT016452@fido.openend.se> <201502241507.t1OF7aUm018883@fido.openend.se> <201502241524.t1OFO09k022270@fido.openend.se> <201502241620.t1OGKf4n002146@fido.openend.se> <54ECB134.5090304@davea.name> <201502241945.t1OJjshO013092@fido.openend.se> <201502241957.t1OJvrJS015604@fido.openend.se> <9169f3b1-2ac7-42a3-8033-584f84b88a1f@googlegroups.com> Date: Wed, 4 Mar 2015 05:43:53 +1100 Subject: Re: Newbie question about text encoding From: Chris Angelico Cc: "python-list@python.org" Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.19 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 13 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1425408236 news.xs4all.nl 2959 [2001:888:2000:d::a6]:56726 X-Complaints-To: abuse@xs4all.nl Path: csiph.com!usenet.pasdenom.info!bete-des-vosges.org!feed.ac-versailles.fr!nerim.net!novso.com!newsfeed.xs4all.nl!newsfeed3.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Xref: csiph.com comp.lang.python:86859 On Wed, Mar 4, 2015 at 5:03 AM, Rustom Mody wrote: > What I was trying to say expanded here > http://blog.languager.org/2015/03/whimsical-unicode.html > [Hope the word 'whimsical' is less jarring and more accurate than 'gibbe= rish'] Re footnote #4: =C2=BD is a single character for compatibility reasons. =E2=85=9F=E2=82=81=E2=82=80=E2=82=80 doesn't need to be a single character,= because there are countably infinite vulgar fractions and only 0x110000 Unicode characters. ChrisA