Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!news.albasani.net!feeder.erje.net!eu.feeder.erje.net!newsfeed.xs4all.nl!newsfeed2.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
MIME-Version: 1.0
In-Reply-To: <CALwzidkTRYbpeBCY7gxSXa=ZNimaAe69mGbT0d9sbehHfHgvOw@mail.gmail.com>
References: <lgsi07$k1p$1@speranza.aioe.org> <mailman.8531.1395775491.18130.python-list@python.org> <5331D902.3030902@gmail.com> <53321819$0$29994$c3e8da3$5496439d@news.astraweb.com> <lh1g3h$meg$1@speranza.aioe.org> <CALwzidk+4diadosJ0bDFTj-OjU9ib712iPMBVxvqZfxAsY2cJg@mail.gmail.com> <53393BA4.2080305@rece.vub.ac.be> <CALwzidkWc+jQtyN7-nFd4wZCryqLNfNc2okVL_kSbCBkE6nuQQ@mail.gmail.com> <5339C281.7080300@rece.vub.ac.be> <CALwzidkTRYbpeBCY7gxSXa=ZNimaAe69mGbT0d9sbehHfHgvOw@mail.gmail.com>
Date: Mon, 31 Mar 2014 23:58:40 -0400
Subject: Re: unicode as valid naming symbols
From: David Hutto <dwightdhutto@gmail.com>
To: Ian Kelly <ian.g.kelly@gmail.com>
Content-Type: multipart/alternative; boundary=001a11c3cefcf401d904f5f32e57
Cc: Python <python-list@python.org>
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.8764.1396324729.18130.python-list@python.org>
Lines: 307
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:69463

--001a11c3cefcf401d904f5f32e57
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

I personally believe that it becomes hard to have even a programming
language overcome cultural learning styles, and programmatic differences,
because of nurture vs nature.

We can all program something which results in a similar return value, but
overcoming the nurturing the internet provides, becomes an imperative.

I'll just offer a reference to avoid personal mistakes in explaining
something that relates to how programmers/computer scientists/electrical
engineers approach their end results, and why those end results may still
differ in the mentality of the individual, or group, outcome of developing
A.I. systems:

http://en.wikipedia.org/wiki/Ethnolinguistics

http://en.wikipedia.org/wiki/Cognitive_anthropology

http://en.wikipedia.org/wiki/Cognitive_science

The latter probably explains what I mean in more depth than the two
formers.


On Mon, Mar 31, 2014 at 8:47 PM, Ian Kelly <ian.g.kelly@gmail.com> wrote:

> On Mon, Mar 31, 2014 at 1:31 PM, Antoon Pardon
> <antoon.pardon@rece.vub.ac.be> wrote:
> > Op 31-03-14 19:40, Ian Kelly schreef:
> >> That was an exaggeration on my part.  It wouldn't affect my job, as I
> >> wouldn't expect to ever actually have to maintain anything like the
> >> above.  My greater point though is that it damages Python's
> >> readability for no actual gain in my view.  There is nothing useful
> >> you can do with a name that is the U+1F4A9 character that you can't do
> >> just as easily with alphanumeric identifiers like pile_of_poo (or
> >> =D0=BA=D1=83=D1=87=D0=B0_=D1=84=D0=B5=D0=BA=D0=B0=D0=BB=D0=B8=D0=B9 if=
 one prefers; that's auto-translated, so don't blame me
> >> if it's a poor translation). The kinds of symbols that we're talking
> >> about here aren't part of any writing systems, and so to incorporate
> >> them in *names* as if they were is an abuse of Unicode.
> >
> > Your argument doesn't has much weight. First of all it can be used
> > for just restricting names to the ascii range.
>
> I disagree.  Non-ASCII written names are useful to anybody who prefers
> not to do all their programming in English.
>
> > Second of all I
> > think a good chosen symbolic name can be more readable than a
> > name in a character set you are not familiar with. A good chosen
> > symbol will evoke a meaning with a lot of people. A name in a
> > character set you are not familiar with is just gibberish to
> > you.
>
> Well, this is the path taken by APL.  It has its supporters.  It's not
> known for being readable.
>
> >> I don't think the comparisons to decorators and the if-else operator
> >> are apt.
> >
> > I didn't make such a comparison. I just noted the arguments against
> > were similar.
>
> That's the comparison to which I was referring.
>
> >> First, because while those may degrade readability, they do
> >> so in a constrained way.  A decorator application is just the @ symbol
> >> and an identifier.
> >
> > And if abused, can totally change the working of your function. There
> > is no guarantee that the function returned, has any relation with the
> > original function. If that can't be a night mare for readability,
> > I don't know what is.
>
> As Terry Reedy noted, this has nothing to do with the decorator
> syntax, so it isn't much of an argument against having such syntax.
>
> >> The if-else is just three expressions separated by
> >> keywords.
> >
> > Yes but if used unrestrained in arbitrary expressions will make those
> > expressions hard to understand.
>
> I don't disagree.  I hardly ever use it myself, certainly only if it
> can fit comfortably into one line, which is rare.  But it's still
> quite limited in syntactic scope.
>
> >> In the case of arbitrary Unicode identifiers, we're talking
> >> about approximately doubling the number of different characters (out
> >> of a continuously growing set) that could be used, many of which are
> >> easily confused with other characters. Of course the potential for
> >> confusion already exists, but that's no justification for aggravating
> >> it.
> >
> > So what if we double the number of different characters? I don't care
> > about the number of them, I care about how meaningful they are. And
> > as you say confusion is already possible. A good programmer knows
> > how to deal with such a possible confusion, that the number of
> > cases increases, doesn't need to be a problem for those that care
> > about this.
>
> So tell me then, how would you deal with it?  In the case of script
> identifiers, it's often not hard to discern from context whether a
> particular character is e.g. a Latin h or a Cyrillic =D2=BB.  Assuming th=
e
> original author wasn't being intentionally obfuscatory, if the rest of
> the identifier is Cyrillic then the character is probably also
> Cyrillic.  If it's a one-character identifier, then hopefully the rest
> of the module is consistent and you can guess from that.  If the
> identifier in question is just one symbol though, then you have a lot
> less context.
>
> >
> >> Second, at least in the case of decorators, while I don't dispute that
> >> they can harm readability, I think that in the majority of cases they
> >> actually help it.
> >
> > But that is not a fair comparison now, is it. What you are doing here
> > is comparing actual use, to a worst case doom scenario.
>
> I contend that there is no scenario with arbitrary Unicode identifiers
> where readability is improved.
> --
> https://mail.python.org/mailman/listinfo/python-list
>



--=20
Best Regards,
David Hutto
*CEO:* *http://www.hitwebdevelopment.com <http://www.hitwebdevelopment.com>=
*

--001a11c3cefcf401d904f5f32e57
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>I personally believe that it becomes hard to have eve=
n a programming language overcome cultural learning styles, and programmati=
c differences, because of nurture vs nature. </div><div><br></div><div>We c=
an all program something which results in a similar return value, but overc=
oming the nurturing the internet provides, becomes an imperative.</div>
<div><br></div><div>I&#39;ll just offer=C2=A0a reference to avoid personal =
mistakes in explaining something that relates to how programmers/computer s=
cientists/electrical engineers approach their end results, and why those en=
d results may still differ in the mentality of the individual, or group, ou=
tcome of developing A.I. systems:</div>
<div><br></div><div><a href=3D"http://en.wikipedia.org/wiki/Ethnolinguistic=
s">http://en.wikipedia.org/wiki/Ethnolinguistics</a></div><div><br></div><d=
iv><a href=3D"http://en.wikipedia.org/wiki/Cognitive_anthropology">http://e=
n.wikipedia.org/wiki/Cognitive_anthropology</a></div>
<div><br></div><div><a href=3D"http://en.wikipedia.org/wiki/Cognitive_scien=
ce">http://en.wikipedia.org/wiki/Cognitive_science</a></div><div><br></div>=
<div>The latter probably=C2=A0explains=C2=A0what I mean in more depth than =
the two formers.=C2=A0</div>
</div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Mon,=
 Mar 31, 2014 at 8:47 PM, Ian Kelly <span dir=3D"ltr">&lt;<a href=3D"mailto=
:ian.g.kelly@gmail.com" target=3D"_blank">ian.g.kelly@gmail.com</a>&gt;</sp=
an> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div>On Mon, Mar 31, 2014 at 1:31 PM, Antoon=
 Pardon<br>
&lt;<a href=3D"mailto:antoon.pardon@rece.vub.ac.be">antoon.pardon@rece.vub.=
ac.be</a>&gt; wrote:<br>
&gt; Op 31-03-14 19:40, Ian Kelly schreef:<br>
</div>&gt;&gt; That was an exaggeration on my part. =C2=A0It wouldn&#39;t a=
ffect my job, as I<br>
&gt;&gt; wouldn&#39;t expect to ever actually have to maintain anything lik=
e the<br>
&gt;&gt; above. =C2=A0My greater point though is that it damages Python&#39=
;s<br>
&gt;&gt; readability for no actual gain in my view. =C2=A0There is nothing =
useful<br>
&gt;&gt; you can do with a name that is the U+1F4A9 character that you can&=
#39;t do<br>
&gt;&gt; just as easily with alphanumeric identifiers like pile_of_poo (or<=
br>
&gt;&gt; =D0=BA=D1=83=D1=87=D0=B0_=D1=84=D0=B5=D0=BA=D0=B0=D0=BB=D0=B8=D0=
=B9 if one prefers; that&#39;s auto-translated, so don&#39;t blame me<br>
&gt;&gt; if it&#39;s a poor translation). The kinds of symbols that we&#39;=
re talking<br>
&gt;&gt; about here aren&#39;t part of any writing systems, and so to incor=
porate<br>
&gt;&gt; them in *names* as if they were is an abuse of Unicode.<br>
&gt;<br>
&gt; Your argument doesn&#39;t has much weight. First of all it can be used=
<br>
&gt; for just restricting names to the ascii range.<br>
<br>
I disagree. =C2=A0Non-ASCII written names are useful to anybody who prefers=
<br>
not to do all their programming in English.<br>
<br>
&gt; Second of all I<br>
&gt; think a good chosen symbolic name can be more readable than a<br>
&gt; name in a character set you are not familiar with. A good chosen<br>
&gt; symbol will evoke a meaning with a lot of people. A name in a<br>
&gt; character set you are not familiar with is just gibberish to<br>
&gt; you.<br>
<br>
Well, this is the path taken by APL. =C2=A0It has its supporters. =C2=A0It&=
#39;s not<br>
known for being readable.<br>
<br>
&gt;&gt; I don&#39;t think the comparisons to decorators and the if-else op=
erator<br>
&gt;&gt; are apt.<br>
&gt;<br>
&gt; I didn&#39;t make such a comparison. I just noted the arguments agains=
t<br>
&gt; were similar.<br>
<br>
That&#39;s the comparison to which I was referring.<br>
<div><br>
&gt;&gt; First, because while those may degrade readability, they do<br>
&gt;&gt; so in a constrained way. =C2=A0A decorator application is just the=
 @ symbol<br>
&gt;&gt; and an identifier.<br>
&gt;<br>
&gt; And if abused, can totally change the working of your function. There<=
br>
&gt; is no guarantee that the function returned, has any relation with the<=
br>
&gt; original function. If that can&#39;t be a night mare for readability,<=
br>
&gt; I don&#39;t know what is.<br>
<br>
</div>As Terry Reedy noted, this has nothing to do with the decorator<br>
syntax, so it isn&#39;t much of an argument against having such syntax.<br>
<br>
&gt;&gt; The if-else is just three expressions separated by<br>
&gt;&gt; keywords.<br>
&gt;<br>
&gt; Yes but if used unrestrained in arbitrary expressions will make those<=
br>
&gt; expressions hard to understand.<br>
<br>
I don&#39;t disagree. =C2=A0I hardly ever use it myself, certainly only if =
it<br>
can fit comfortably into one line, which is rare. =C2=A0But it&#39;s still<=
br>
quite limited in syntactic scope.<br>
<br>
&gt;&gt; In the case of arbitrary Unicode identifiers, we&#39;re talking<br=
>
&gt;&gt; about approximately doubling the number of different characters (o=
ut<br>
&gt;&gt; of a continuously growing set) that could be used, many of which a=
re<br>
&gt;&gt; easily confused with other characters. Of course the potential for=
<br>
&gt;&gt; confusion already exists, but that&#39;s no justification for aggr=
avating<br>
&gt;&gt; it.<br>
&gt;<br>
&gt; So what if we double the number of different characters? I don&#39;t c=
are<br>
&gt; about the number of them, I care about how meaningful they are. And<br=
>
&gt; as you say confusion is already possible. A good programmer knows<br>
&gt; how to deal with such a possible confusion, that the number of<br>
&gt; cases increases, doesn&#39;t need to be a problem for those that care<=
br>
&gt; about this.<br>
<br>
So tell me then, how would you deal with it? =C2=A0In the case of script<br=
>
identifiers, it&#39;s often not hard to discern from context whether a<br>
particular character is e.g. a Latin h or a Cyrillic =D2=BB. =C2=A0Assuming=
 the<br>
original author wasn&#39;t being intentionally obfuscatory, if the rest of<=
br>
the identifier is Cyrillic then the character is probably also<br>
Cyrillic. =C2=A0If it&#39;s a one-character identifier, then hopefully the =
rest<br>
of the module is consistent and you can guess from that. =C2=A0If the<br>
identifier in question is just one symbol though, then you have a lot<br>
less context.<br>
<div><br>
&gt;<br>
&gt;&gt; Second, at least in the case of decorators, while I don&#39;t disp=
ute that<br>
&gt;&gt; they can harm readability, I think that in the majority of cases t=
hey<br>
&gt;&gt; actually help it.<br>
&gt;<br>
</div>&gt; But that is not a fair comparison now, is it. What you are doing=
 here<br>
&gt; is comparing actual use, to a worst case doom scenario.<br>
<br>
I contend that there is no scenario with arbitrary Unicode identifiers<br>
where readability is improved.<br>
<span class=3D"HOEnZb"><font color=3D"#888888">--<br>
<a href=3D"https://mail.python.org/mailman/listinfo/python-list" target=3D"=
_blank">https://mail.python.org/mailman/listinfo/python-list</a><br>
</font></span></blockquote></div><br><br clear=3D"all"><br>-- <br>Best Rega=
rds,<br><span style=3D"font-family:arial,helvetica,sans-serif">David Hutto<=
/span><br><i><b>CEO:</b></i> <u><a href=3D"http://www.hitwebdevelopment.com=
" target=3D"_blank">http://www.hitwebdevelopment.com</a></u><br>

</div>

--001a11c3cefcf401d904f5f32e57--