Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #100581

Re: Should stdlib files contain 'narrow non breaking space' U+202F?

Path csiph.com!fu-berlin.de!uni-berlin.de!not-for-mail
From Chris Angelico <rosuav@gmail.com>
Newsgroups comp.lang.python
Subject Re: Should stdlib files contain 'narrow non breaking space' U+202F?
Date Fri, 18 Dec 2015 17:51:32 +1100
Lines 90
Message-ID <mailman.45.1450421501.30845.python-list@python.org> (permalink)
References <n4vf3d$rer$1@ger.gmane.org> <CAPTjJmoa5dBiDJ5zDu4jEy4MjOm6vp=J3zLqJw7ZfEUf9da=DQ@mail.gmail.com> <n509h0$p6h$1@ger.gmane.org>
Mime-Version 1.0
Content-Type text/plain; charset=UTF-8
Content-Transfer-Encoding quoted-printable
X-Trace news.uni-berlin.de YNIT/9KlAGgWY9E7IFnuwA/xewQtUnZGIoduS6dy7P2Q==
Return-Path <rosuav@gmail.com>
X-Original-To python-list@python.org
Delivered-To python-list@mail.python.org
X-Spam-Status OK 0.000
X-Spam-Evidence '*H*': 1.00; '*S*': 0.00; 'guido': 0.05; "'',": 0.07; '2005,': 0.07; 'l\xc3\xb6wis': 0.07; 'raises': 0.07; 'root,': 0.07; "subject:' ": 0.07; 'utf-8': 0.07; 'cc:addr:python-list': 0.09; 'files:': 0.09; 'non-ascii': 0.09; 'subject:files': 0.09; '\xe2\x80\x94': 0.09; 'ignore': 0.14; '"test"': 0.16; 'concatenate': 0.16; 'dirs,': 0.16; 'from:addr:rosuav': 0.16; 'from:name:chris angelico': 0.16; 'headers,': 0.16; 'inclined': 0.16; 'invisible': 0.16; 'leave.': 0.16; 'literals.': 0.16; 'received:io': 0.16; 'received:psf.io': 0.16; 'reedy': 0.16; 'repr(line)': 0.16; 'root:': 0.16; "skip:' 60": 0.16; 'skip:( 60': 0.16; 'subject:breaking': 0.16; 'subject:non': 0.16; 'thread.': 0.16; 'wrote:': 0.16; 'string': 0.17; 'fixed.': 0.18; 'try:': 0.18; '2015': 0.20; 'cc:2**0': 0.20; 'cc:addr:python.org': 0.20; 'martin': 0.22; '%s"': 0.22; 'ascii': 0.22; 'dec': 0.23; 'nearly': 0.23; 'import': 0.24; 'written': 0.24; 'header:In-Reply-To:1': 0.24; 'script': 0.25; 'fri,': 0.27; 'message-id:@mail.gmail.com': 0.27; 'cases.': 0.29; "people's": 0.29; 'url:wikipedia': 0.29; '(c)': 0.29; "i'm": 0.30; 'url:wiki': 0.30; 'code': 0.30; 'guess': 0.31; 'probably': 0.31; 'especially': 0.32; 'skip:_ 10': 0.32; 'possibly': 0.32; '2006': 0.33; 'http': 0.33; 'file': 0.34; 'except': 0.34; 'received:google.com': 0.35; 'should': 0.36; 'there': 0.36; 'url:org': 0.36; 'lines': 0.36; 'received:209.85': 0.36; '(and': 0.36; '2005': 0.36; 'subject:?': 0.36; 'pm,': 0.36; 'subject:: ': 0.37; 'received:209.85.213': 0.37; 'wanted': 0.37; 'received:209': 0.38; 'files': 0.38; 'von': 0.38; 'still': 0.40; 'space': 0.40; 'skip:u 10': 0.61; 'avoid': 0.61; 'show': 0.62; 'more': 0.63; '8bit%:95': 0.65; 'due': 0.65; 'response.': 0.66; "they're": 0.66; '8bit%:96': 0.67; '\xe2\x80\x93': 0.72; 'special': 0.73; '5:36': 0.84; 'chrisa': 0.84; 'sending,': 0.84; 'subject:space': 0.84; 'to:none': 0.91; 'comment.': 0.91; 'subject:+': 0.91; 'urls,': 0.91; 'url:ru': 0.98
DKIM-Signature v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:cc :content-type:content-transfer-encoding; bh=p5Ifqg8jVh0+/yTb49Bs2xWi16Dz62a4sLQnGyQL7RI=; b=KpT0j5NqXIhQi/+11I6P3sgbnXRfHCONL0mk9taK/WlA4gqnhnCgghvrIK8cAy8/sf 2fUE0qqQMyBgaBlr8nj3rAdcQ+u5AJwich9Tj7kyY4jLnuBp9aitggy7GqV06lyFitZ3 ul2Rm4U9Hip46uXgaRGSGPQJzhdOvnox1qO7noHJtddvcXsLPvuJh3h1pG/qsGdCY8na avI/GkxoqYIZCrGy5zz4mdGtNckrB8o7hE1HaeKF/TXSwOxCScWtlf523RFWSm339TEE QAYBFgGH9JbgRi3gtPNvdwgUnJp6DVoNbBKr4VXQRs36LMxJBQey2FTIqgkFIhtGyyCW CJiA==
X-Received by 10.50.70.38 with SMTP id j6mr1036357igu.13.1450421493008; Thu, 17 Dec 2015 22:51:33 -0800 (PST)
In-Reply-To <n509h0$p6h$1@ger.gmane.org>
X-BeenThere python-list@python.org
X-Mailman-Version 2.1.20+
Precedence list
List-Id General discussion list for the Python programming language <python-list.python.org>
List-Unsubscribe <https://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe>
List-Archive <http://mail.python.org/pipermail/python-list/>
List-Post <mailto:python-list@python.org>
List-Help <mailto:python-list-request@python.org?subject=help>
List-Subscribe <https://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe>
Xref csiph.com comp.lang.python:100581

Show key headers only | View raw


On Fri, Dec 18, 2015 at 5:36 PM, Terry Reedy <tjreedy@udel.edu> wrote:
> Last I knew, Guido still wanted stdlib files to be all-ascii, especially
> possibly in special cases. There is no good reason I can think of for there
> to be an invisible non-ascii space in a comment.  It strikes me as most
> likely an accident (typo) that should be fixed.  I suspect the same of most
> of the following.  Perhaps you should file an issue (and patch?) on the
> tracker.

You're probably right on that one. Here's others - and the script I
used to find them.

import os
for root, dirs, files in os.walk("."):
    if "test" in root: continue
    for fn in files:
        if not fn.endswith(".py"): continue
        if "test" in fn: continue
        with open(os.path.join(root,fn),"rb") as f:
            for l,line in enumerate(f):
                try:
                    line.decode("ascii")
                    continue # Ignore the ASCII lines
                except UnicodeDecodeError:
                    line = line.rstrip(b"\n")
                    try: line = line.decode("UTF-8")
                    except UnicodeDecodeError: line = repr(line) # If
it's not UTF-8 either, show it as b'...'
                    print("%s:%d: %s" % (fn,l,line))


shlex.py:37:             self.wordchars += ('ßàáâãäåæçèéêëìíîïðñòóôõöøùúûüýþÿ'
shlex.py:38:                                'ÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖØÙÚÛÜÝÞ')
functools.py:7: # and Łukasz Langa <lukasz at langa.pl>.
heapq.py:34: [explanation by François Pinard]
getopt.py:21: # Peter Åstrand <astrand@lysator.liu.se> added gnu_getopt().
sre_compile.py:26:     (0x69, 0x131), # iı
sre_compile.py:28:     (0x73, 0x17f), # sſ
sre_compile.py:30:     (0xb5, 0x3bc), # µμ
sre_compile.py:32:     (0x345, 0x3b9, 0x1fbe), # \u0345ιι
sre_compile.py:34:     (0x390, 0x1fd3), # ΐΐ
sre_compile.py:36:     (0x3b0, 0x1fe3), # ΰΰ
sre_compile.py:38:     (0x3b2, 0x3d0), # βϐ
sre_compile.py:40:     (0x3b5, 0x3f5), # εϵ
sre_compile.py:42:     (0x3b8, 0x3d1), # θϑ
sre_compile.py:44:     (0x3ba, 0x3f0), # κϰ
sre_compile.py:46:     (0x3c0, 0x3d6), # πϖ
sre_compile.py:48:     (0x3c1, 0x3f1), # ρϱ
sre_compile.py:50:     (0x3c2, 0x3c3), # ςσ
sre_compile.py:52:     (0x3c6, 0x3d5), # φϕ
sre_compile.py:54:     (0x1e61, 0x1e9b), # ṡẛ
sre_compile.py:56:     (0xfb05, 0xfb06), # ſtst
punycode.py:2: Written by Martin v. Löwis.
koi8_t.py:2: # http://ru.wikipedia.org/wiki/КОИ-8
__init__.py:0: # Copyright (C) 2005 Martin v. Löwis
client.py:737:         a Date representing the file’s last-modified time, a
client.py:739:         containing a guess at the file’s type. See also the
bdist_msi.py:0: # Copyright (C) 2005, 2006 Martin von Löwis
connection.py:399:             # Issue # 20540: concatenate before
sending, to avoid delays due
message.py:531:                        filename=('utf-8', '', Fußballer.ppt'))
message.py:533:                        filename='Fußballer.ppt'))
request.py:181:     * geturl() — return the URL of the resource
retrieved, commonly used to
request.py:184:     * info() — return the meta-information of the
page, such as headers, in the
request.py:188:     * getcode() – return the HTTP status code of the
response.  Raises URLError
dbapi2.py:2: # Copyright (C) 2004-2005 Gerhard Häring <gh@ghaering.de>
__init__.py:2: # Copyright (C) 2005 Gerhard Häring <gh@ghaering.de>

They're nearly all comments. A few string literals.

I would be inclined to ASCIIfy the apostrophes, dashes, and the
connection.py space that started this thread. People's names, URLs,
and demonstrative characters I'm more inclined to leave. Agreed?

ChrisA

Back to comp.lang.python | Previous | NextNext in thread | Find similar | Unroll thread


Thread

Re: Should stdlib files contain 'narrow non breaking space' U+202F? Chris Angelico <rosuav@gmail.com> - 2015-12-18 17:51 +1100
  Re: Should stdlib files contain 'narrow non breaking space' U+202F? Steven D'Aprano <steve@pearwood.info> - 2015-12-18 20:49 +1100
    Re: Should stdlib files contain 'narrow non breaking space' U+202F? wxjmfauth@gmail.com - 2015-12-18 07:55 -0800
    Re: Should stdlib files contain 'narrow non breaking space' U+202F? Terry Reedy <tjreedy@udel.edu> - 2015-12-18 16:56 -0500
      Re: Should stdlib files contain 'narrow non breaking space' U+202F? Marko Rauhamaa <marko@pacujo.net> - 2015-12-19 00:16 +0200
        Re: Should stdlib files contain 'narrow non breaking space' U+202F? Chris Angelico <rosuav@gmail.com> - 2015-12-19 10:58 +1100
      Re: Should stdlib files contain 'narrow non breaking space' U+202F? wxjmfauth@gmail.com - 2015-12-19 00:52 -0800

csiph-web