Path: csiph.com!usenet.pasdenom.info!aioe.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed3.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
Date: Thu, 10 Jul 2014 15:08:23 +0100
From: MRAB <python@mrabarnett.plus.com>
User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:24.0) Gecko/20100101 Thunderbird/24.6.0
MIME-Version: 1.0
To: python-list@python.org
Subject: Re: Why is it different about '\s' Matches whitespace and Equivalent to [\t\n\r\f]?
References: <1e8dbd65-bd19-4b9d-a7ec-961e8304ace0@googlegroups.com> <mailman.11722.1404991086.18130.python-list@python.org> <7593d956-f202-4d1c-9e35-1269ab3dda57@googlegroups.com>
In-Reply-To: <7593d956-f202-4d1c-9e35-1269ab3dda57@googlegroups.com>
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.11726.1405001309.18130.python-list@python.org>
Lines: 27
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:74304

On 2014-07-10 14:32, fl wrote:
> On Thursday, July 10, 2014 7:18:01 AM UTC-4, MRAB wrote:
>> It's equivalent to [ \t\n\r\f], i.e. it also includes a space, so
>> either the tutorial is wrong, or you didn't look closely enough. :-)
>>
>> The string starts with ' ', not '\t'.
>>
>> The string starts with ' ', which isn't in the character set.
>>
> The '\s' description is on link:
>
> http://www.tutorialspoint.com/python/python_reg_expressions.htm
>
I can see that the space is missing. It should say:

     \s    Matches whitespace. Equivalent to [ \t\n\r\f].

> Could you give me an example to use the equivalent pattern?
>
(I'm using Python 3.4, which is why the match object looks different.)

 >>> import re
 >>> re.match(r'\s*\d\d*$', '   111')
<_sre.SRE_Match object; span=(0, 6), match='   111'>
 >>> re.match(r'[ \t\n\r\f]*\d\d*$', '   111')
<_sre.SRE_Match object; span=(0, 6), match='   111'>