Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!xlned.com!feeder3.xlned.com!newsfeed.xs4all.nl!newsfeed1.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.018 X-Spam-Evidence: '*H*': 0.96; '*S*': 0.00; 'mrab': 0.05; 'much!': 0.05; 'operator,': 0.09; 'pep': 0.09; 'url:unicode': 0.09; 'python': 0.11; 'identifier.': 0.16; 'identifiers': 0.16; 'subject:unicode': 0.16; 'underscore.': 0.16; 'wrote:': 0.18; 'number)': 0.24; 'unicode': 0.24; 'header:In-Reply-To:1': 0.27; '(this': 0.29; 'generally': 0.29; 'message-id:@mail.gmail.com': 0.30; '25,': 0.31; 'explained': 0.31; 'follows': 0.31; 'knows': 0.35; 'but': 0.35; 'received:google.com': 0.35; 'thanks': 0.36; "i'll": 0.36; 'url:org': 0.36; 'should': 0.36; 'to:addr:python- list': 0.38; 'pm,': 0.38; 'to:addr:python.org': 0.39; 'extended': 0.61; 'name': 0.63; 'happen': 0.63; '(that': 0.65; 'mar': 0.68; 'chinese': 0.74; 'characters,': 0.84 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; bh=y5U4I7dviIIqWSYm8bWhRwJ3Rd08zJqHQPan+NJNmxo=; b=PHSSiZ1VSFpnbBOWxlZ6jkeXAMB7ps65ZuJgPvtci7Rixf3k2gG7UHtUMERQDyV0hv 3QAM6YBGEHm42niAmqhBSCFwCGrsk24ITJAk42DA12v7McIVpaHZyX9t/dWHtd8LLSJp /iqu6f7l+et+CA0867h2i2HygsB86A56FKeb4mvPt/RmfTtRhPulG4OoBZopyWbOh58I 5SjMwFHxpKslkTzGBidfvR3tYDrzgwq6YVbV9B5v084TLfN39HIIA1gsE3JQmcn5NO/j 1Ej8QCyMgr+xElR2dGr/P757rayAS5mlYtz2m4E0Rd4aUqSQ/plU9h8GNTqj8iNNxQnp cc6A== X-Received: by 10.66.171.76 with SMTP id as12mr81013641pac.52.1395776989566; Tue, 25 Mar 2014 12:49:49 -0700 (PDT) MIME-Version: 1.0 In-Reply-To: <5331D902.3030902@gmail.com> References: <5331D902.3030902@gmail.com> From: Ian Kelly Date: Tue, 25 Mar 2014 13:49:09 -0600 Subject: Re: unicode as valid naming symbols To: Python Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 24 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1395776993 news.xs4all.nl 2839 [2001:888:2000:d::a6]:42730 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:69062 On Tue, Mar 25, 2014 at 1:29 PM, Mark H Harris wrot= e: > On 3/25/14 2:24 PM, MRAB wrote: >> It's explained in PEP 3131. >> >> Basically, a name should to start with a letter (this has been extended >> to include Chinese characters, etc) or an underscore. >> >> =CE=BB is a classified as Lowercase_Letter. >> >> =E2=88=9A is classified as Math_Symbol. > > Thanks much! I'll note that for improvements. Any unicode symbol (tha= t > is not a number) should be allowed as an identifier. =E2=88=9A cannot be used in identifiers for the same reasons that + and ~ cannot: identifiers are intended to be alphanumeric. =E2=88=9A is not currently the name of an operator, but who knows what may happen in the future? Python generally follows Annex 31 of the Unicode standard in this regard: http://www.unicode.org/reports/tr31/