Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #92404

Re: Python NBSP DWIM

X-FeedAbuse http://nntpfeed.proxad.net/abuse.pl feeded by 195.154.70.45
Path csiph.com!usenet.pasdenom.info!nntpfeed.proxad.net!news.redatomik.org!newsfeed.xs4all.nl!newsfeed2.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
Return-Path <lac@openend.se>
X-Original-To python-list@python.org
Delivered-To python-list@mail.python.org
X-Spam-Status OK 0.000
X-Spam-Evidence '*H*': 1.00; '*S*': 0.00; 'subject:Python': 0.05; 'context': 0.05; 'received:openend.se': 0.09; 'received:theraft.openend.se': 0.09; 'cc:addr:python-list': 0.10; 'wed,': 0.15; '>on': 0.16; 'cc:addr:lac': 0.16; 'cc:addr:openend.se': 0.16; 'from:addr:lac': 0.16; 'from:addr:openend.se': 0.16; 'from:name:laura creighton': 0.16; 'message-id:@fido.openend.se': 0.16; 'received:89.233': 0.16; 'received:89.233.217': 0.16; 'received:89.233.217.133': 0.16; 'received:fido': 0.16; 'received:fido.openend.se': 0.16; 'wrote:': 0.16; 'laura': 0.18; 'skip': 0.18; 'cc:2**2': 0.20; 'cc:addr:python.org': 0.21; "aren't": 0.22; 'bug?': 0.22; 'cc:no real name:2**0': 0.23; 'am,': 0.23; '2015': 0.23; 'slightly': 0.23; 'split': 0.23; 'header:In-Reply-To:1': 0.24; 'tim': 0.24; 'words': 0.24; 'url:bugs': 0.24; 'separate': 0.27; '-0500,': 0.29; 'chase': 0.29; 'received:se': 0.29; 'spaces': 0.29; 'whitespace': 0.29; 'reported': 0.32; 'url:python': 0.33; 'languages': 0.34; 'unicode': 0.35; 'url:org': 0.36; 'there': 0.36; 'subject:: ': 0.37; 'charset:us-ascii': 0.37; 'method': 0.39; 'does': 0.39; 'where': 0.40; 'entire': 0.61; 'times': 0.61; 'header:Message- Id:1': 0.62; 'different': 0.64; 'received:89': 0.80
To Skip Montanaro <skip.montanaro@gmail.com>
cc Tim Chase <python.list@tim.thechases.com>, Python <python-list@python.org>, lac@openend.se
From Laura Creighton <lac@openend.se>
Subject Re: Python NBSP DWIM
In-Reply-To Message from Skip Montanaro <skip.montanaro@gmail.com> of "Wed, 10 Jun 2015 09:28:24 -0500." <CANc-5UxaGMbY4BCDjuui4XXq=g2NKYXnYJAzgp9aNzDjOWZjyg@mail.gmail.com>
References <20150610082812.2ce887c3@bigbox.christie.dr><CANc-5UxaGMbY4BCDjuui4XXq=g2NKYXnYJAzgp9aNzDjOWZjyg@mail.gmail.com>
MIME-Version 1.0
Content-Type text/plain; charset="us-ascii"
Content-ID <4900.1433948638.1@fido>
Date Wed, 10 Jun 2015 17:03:58 +0200
X-Greylist Sender IP whitelisted, not delayed by milter-greylist-4.3.9 (theraft.openend.se [89.233.217.130]); Wed, 10 Jun 2015 17:04:02 +0200 (CEST)
X-BeenThere python-list@python.org
X-Mailman-Version 2.1.20+
Precedence list
List-Id General discussion list for the Python programming language <python-list.python.org>
List-Unsubscribe <https://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe>
List-Archive <http://mail.python.org/pipermail/python-list/>
List-Post <mailto:python-list@python.org>
List-Help <mailto:python-list-request@python.org?subject=help>
List-Subscribe <https://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe>
Newsgroups comp.lang.python
Message-ID <mailman.346.1433948652.13271.python-list@python.org> (permalink)
Lines 24
NNTP-Posting-Host 2001:888:2000:d::a6
X-Trace 1433948652 news.xs4all.nl 2843 [2001:888:2000:d::a6]:39742
X-Complaints-To abuse@xs4all.nl
Xref csiph.com comp.lang.python:92404

Show key headers only | View raw


In a message of Wed, 10 Jun 2015 09:28:24 -0500, Skip Montanaro writes:
>On Wed, Jun 10, 2015 at 8:28 AM, Tim Chase
><python.list@tim.thechases.com> wrote:
>> Is this a bug?
>
>Looks like it's been reported a few times with slightly different context:
>
>https://bugs.python.org/issue6537
>https://bugs.python.org/issue16623
>https://bugs.python.org/issue20491
>https://bugs.python.org/issue1390608
>
>The couple times it's come up in the context of str.split, it's been
>rejected, since the purpose of that method is to split words.
>
>Skip

In these unicode days, this thinking may need to be revisited.  There
are many languages where whitespace does not separate words -- either
words aren't separated, or in Vietnamese, spaces separate syllables,
so entire words have spaces in them.

Laura

Back to comp.lang.python | Previous | Next | Find similar | Unroll thread


Thread

Re: Python NBSP DWIM Laura Creighton <lac@openend.se> - 2015-06-10 17:03 +0200

csiph-web