Path: csiph.com!usenet.pasdenom.info!weretis.net!feeder4.news.weretis.net!rt.uk.eu.org!newsfeed.xs4all.nl!newsfeed3a.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.019 X-Spam-Evidence: '*H*': 0.96; '*S*': 0.00; 'say,': 0.05; '*not*': 0.07; 'way:': 0.09; 'width': 0.09; 'yeah,': 0.09; 'cc:addr:python- list': 0.11; 'alphabet': 0.16; 'confuse': 0.16; 'foo,': 0.16; 'from:addr:rosuav': 0.16; 'from:name:chris angelico': 0.16; 'identifiers': 0.16; 'identifiers,': 0.16; 'subject:python': 0.16; 'wrote:': 0.18; '(in': 0.22; 'cc:addr:python.org': 0.22; 'instance,': 0.24; 'unicode': 0.24; 'fairly': 0.24; 'cc:2**0': 0.24; 'mention': 0.26; 'least': 0.26; 'header:In-Reply-To:1': 0.27; 'chris': 0.29; 'character': 0.29; "doesn't": 0.30; 'message- id:@mail.gmail.com': 0.30; "i'm": 0.30; 'that.': 0.31; '"",': 0.31; '(on': 0.31; 'assert': 0.31; 'concern.': 0.31; "d'aprano": 0.31; 'minor': 0.31; 'steven': 0.31; 'file': 0.32; 'another': 0.32; '(most': 0.33; 'entirely': 0.33; 'problem': 0.35; 'something': 0.35; 'but': 0.35; 'received:google.com': 0.35; 'set.': 0.36; 'possible': 0.36; 'being': 0.38; 'apple': 0.38; 'somebody': 0.38; 'pm,': 0.38; 'rather': 0.38; 'recent': 0.39; 'does': 0.39; 'bad': 0.39; 'sure': 0.39; 'enough': 0.39; 'full': 0.61; 'new': 0.61; 'entire': 0.61; 'such': 0.63; 'different': 0.65; 'nobody': 0.68; 'risk': 0.72; 'brand': 0.72; 'increase': 0.74; 'saw': 0.77; '.com': 0.84; 'to:none': 0.92 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:cc :content-type:content-transfer-encoding; bh=O2ma9Vt/ZL6x6i8AILMLAsatXq4IFca2hwPhW2qQ1Co=; b=lRJJN+xZq18A0CXBKJaz0pYnwhoddxYN00we3ACJ2PXuksf1dbpV/BxS9om8VXIpOB HnaYzDrPjmBPbIdkPyFiux/vZcYpvkC9OsCEh4ntLt/qV0FykBYI72ltJsdWCB5H68hN eDNr/uyMUgROUTYZVFY3vHeKs96IlKzBFbvlOMnAPtPQ2J4nmvvtw+ONecozXDI9akqK qMgQyM+vUiV7F3rYjKV2boQ9J/W2ssWqTWV2zywElDFsuxj8apnat5wYKly3k+UMhjiX z4ffO65a3TAaM6CIgkFyzmCnmTeDHpFYJ4sti+2mZf9UUO5YIvAyO/+QMGl+ljBQH8/p i/bQ== MIME-Version: 1.0 X-Received: by 10.58.85.65 with SMTP id f1mr24889950vez.20.1401616776873; Sun, 01 Jun 2014 02:59:36 -0700 (PDT) In-Reply-To: <538aed19$0$29978$c3e8da3$5496439d@news.astraweb.com> References: <8761kozg98.fsf@rudin.co.uk> <7deeb0f8-f4f4-4661-b0bb-f9481b6ef35a@googlegroups.com> <878upjwewt.fsf@elektro.pacujo.net> <7713af25-2ed4-49b3-9457-b352f9486c78@googlegroups.com> <87lhth9i38.fsf@elektro.pacujo.net> <538aed19$0$29978$c3e8da3$5496439d@news.astraweb.com> Date: Sun, 1 Jun 2014 19:59:36 +1000 Subject: Re: IDE for python From: Chris Angelico Cc: "python-list@python.org" Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 37 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1401616785 news.xs4all.nl 2965 [2001:888:2000:d::a6]:35500 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:72369 On Sun, Jun 1, 2014 at 7:06 PM, Steven D'Aprano wrote: > On Sun, 01 Jun 2014 18:31:09 +1000, Chris Angelico wrote: > >> the better solution is to permit the full Unicode alphabet in >> identifiers... > > I'm not entirely sure about that. Full Unicode support in identifiers > such as URLs doesn't create a brand new vulnerability, but it does > increase it from a fairly minor problem to something *much* harder to > deal with. It's bad enough when somebody manages to fool you into going > to (say) app1e.com instead of apple.com, without also being at risk from > =D0=B0=D1=80=D1=80l=D0=B5, =D0=B0p=D1=80l=D0=B5, =D0=B0=D1=80pl=D0=B5 and= =D0=B0=D1=80=D1=80le (to mention just a few). At least nobody > can fake .com with .=D1=81=D0=BEm. > > To put it another way: > > py> =D0=B0=D1=80=D1=80l=D0=B5 =3D 23 > py> apple =3D 42 > py> assert =D0=B0=D1=80=D1=80l=D0=B5 =3D=3D apple > Traceback (most recent call last): > File "", line 1, in > AssertionError Yeah, that is a concern. But as you say, it's already possible to confuse rn with m (in many fonts) and i/l/1, and (on a different level) Foo, foo, _foo, _Foo, and FOO, or movement_Direction and movement_direction. If you saw one of those in one part of a program and another in another, you'd have to consume an annoying amount of mindspace to keep them separate. Note, incidentally, that I said "alphabet" rather than the entire Unicode character set. I do *not* support the use of, for instance, U+200B 'ZERO WIDTH SPACE' in identifiers, that's just stupid :) ChrisA