Path: csiph.com!usenet.pasdenom.info!weretis.net!feeder4.news.weretis.net!rt.uk.eu.org!newsfeed.xs4all.nl!newsfeed2a.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.016 X-Spam-Evidence: '*H*': 0.97; '*S*': 0.00; '"this': 0.03; 'ascii': 0.09; 'instance.': 0.09; 'cc:addr:python-list': 0.11; 'ascii,': 0.16; 'finney': 0.16; 'from:addr:rosuav': 0.16; 'from:name:chris angelico': 0.16; 'mainstream': 0.16; 'subject:Unicode': 0.16; 'wrote:': 0.18; 'cc:addr:python.org': 0.22; "aren't": 0.24; 'cc:2**0': 0.24; 'law.': 0.26; 'switch': 0.26; 'header:In-Reply- To:1': 0.27; 'chris': 0.29; 'characters': 0.30; 'forgot': 0.30; 'message-id:@mail.gmail.com': 0.30; "d'aprano": 0.31; 'steven': 0.31; 'writes:': 0.31; 'critical': 0.32; 'alone': 0.33; 'fri,': 0.33; 'basic': 0.35; 'common': 0.35; 'but': 0.35; 'received:google.com': 0.35; 'there': 0.35; 'science,': 0.36; 'represent': 0.38; 'ben': 0.38; 'pm,': 0.38; 'does': 0.39; 'even': 0.60; 'easy': 0.60; 'gone': 0.61; 'mentioned': 0.61; 'industry,': 0.78; 'to:none': 0.92 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:cc :content-type:content-transfer-encoding; bh=q19aKE/+2T4XSL4h/t1cM2zgQGU9YQUmnsHkRh4rBZ4=; b=A+mGLCfmYejTD/eGFr9+YAXWUH6L4VRan0LK2B+K/VA0t9vCxMn/a/PoEL3dsaIChh UVCCaT18lJdoc+TmfQFyTdsn3IH06wqMnb1asPW0yE+ew9gSWf7N4n/L+6oSbz14zTrm EmIHjyoBaNu4TenPW4JaQsddyT5yNgK96TUNJ0iATfyNPi40EWVBxMVyTldMrgu8WHSn bX2WB2q0fmBv4X1A184gJN74c1HDtSqz1xYD4+19OQAQyTPZ0lApHqzqQdyui7wW5n4f z2994/0P3PV3SjDK/q2/HB0WsC9c0HFII7ETZZHmRmZtJtN8SnTW1zuli/xfgFsdIe3G +8Jw== MIME-Version: 1.0 X-Received: by 10.220.159.4 with SMTP id h4mr12839243vcx.1.1399022673072; Fri, 02 May 2014 02:24:33 -0700 (PDT) In-Reply-To: <85a9b0h76a.fsf@benfinney.id.au> References: <5361d4f9$0$11109$c3e8da3@news.astraweb.com> <82067b83-a6f5-4b16-b012-385535ea5607@googlegroups.com> <5362D9C1.9000108@mrabarnett.plus.com> <1c08837e-496b-4fb2-8ff9-f8a495b67d67@googlegroups.com> <53635299$0$29965$c3e8da3$5496439d@news.astraweb.com> <85a9b0h76a.fsf@benfinney.id.au> Date: Fri, 2 May 2014 19:24:32 +1000 Subject: Re: Unicode 7 From: Chris Angelico Cc: "python-list@python.org" Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 23 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1399022682 news.xs4all.nl 2858 [2001:888:2000:d::a6]:55423 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:70858 On Fri, May 2, 2014 at 7:16 PM, Ben Finney wrote: > Chris Angelico writes: > >> On Fri, May 2, 2014 at 6:08 PM, Steven D'Aprano >> wrote: >> > ... even *Americans* cannot represent all their common characters in >> > ASCII, let alone specialised characters from mathematics, science, >> > the printing industry, and law. >> >> Aside: What additional characters does law use that aren't in ASCII? >> Section =C2=A7 and paragraph =C2=B6 are used frequently, but you already >> mentioned the printing industry. Are there other symbols? > > ASCII does not contain =E2=80=9C=C2=A9=E2=80=9D (U+00A9 COPYRIGHT SIGN) n= or =E2=80=9C=C2=AE=E2=80=9D (U+00AE > REGISTERED SIGN), for instance. Heh! I forgot about those. U+00A9 in particular has gone so mainstream that it's easy to think of it not as "I'm going to switch to my 'British English + Legal' dictionary now" and just as "This is a critical part of the basic dictionary". ChrisA