Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #87201 > unrolled thread

Re: Letter class in re

Started byWolfgang Maier <wolfgang.maier@biologie.uni-freiburg.de>
First post2015-03-09 15:04 +0100
Last post2015-03-09 15:04 +0100
Articles 1 — 1 participant

Back to article view | Back to comp.lang.python

This discussion starts older than the indexed window; earlier articles aren't shown. The article labeled Started by below is the oldest one visible, not the original post.


Contents

  Re: Letter class in re Wolfgang Maier <wolfgang.maier@biologie.uni-freiburg.de> - 2015-03-09 15:04 +0100

#87201 — Re: Letter class in re

FromWolfgang Maier <wolfgang.maier@biologie.uni-freiburg.de>
Date2015-03-09 15:04 +0100
SubjectRe: Letter class in re
Message-ID<mailman.210.1425909917.21433.python-list@python.org>
On 03/09/2015 02:33 PM, Albert-Jan Roskam wrote:
> --------------------------------------------
> On Mon, 3/9/15, Tim Chase <python.list@tim.thechases.com> wrote:
>
> "[^\d\W_]+" means something like "one or more (+) of 'not (a digit, a non-word, an underscore)'.
>

interesting (using Python3.4 and
U+2188 	ROMAN NUMERAL ONE HUNDRED THOUSAND 	ↈ):

 >>> re.search('[^\d\W_]+', '\u2188', re.I | re.U)
<_sre.SRE_Match object; span=(0, 1), match='ↈ'>

ↈ and at least some other Nl (letter numbers) category characters seem 
to be part of \w (not part of \W).

Would that be considered a bug ?

Wolfgang

[toc] | [standalone]


Back to top | Article view | comp.lang.python


csiph-web