Path: csiph.com!usenet.pasdenom.info!aioe.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed6.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
Date: Mon, 19 Nov 2012 20:50:46 +0000
From: MRAB <python@mrabarnett.plus.com>
User-Agent: Mozilla/5.0 (Windows NT 5.1; rv:16.0) Gecko/20121026 Thunderbird/16.0.2
MIME-Version: 1.0
To: python-list@python.org
Subject: Re: Robust regex
References: <assp.06702d7ae4.92097A6A775D5147B1078E3F15430B92348EA96E@prato.activenetwerx.local>
In-Reply-To: <assp.06702d7ae4.92097A6A775D5147B1078E3F15430B92348EA96E@prato.activenetwerx.local>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
Precedence: list
Reply-To: python-list@python.org
Newsgroups: comp.lang.python
Message-ID: <mailman.11.1353358434.29569.python-list@python.org>
Lines: 19
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:33556

On 2012-11-19 20:32, Joseph L. Casale wrote:
> Trying to robustly parse a string that will have key/value pairs separated
> by three pipes, where each additional key/value (if more than one exists)
> will be delineated by four more pipes.
>
>      string = 'key_1|||value_1||||key_2|||value_2'
>      regex = '((?:(?!\|\|\|).)+)(?:\|\|\|)((?:(?!\|\|\|).)+)(?:\|\|\|\|)?'
>
> I am not convinced this is the most effective or safest, any opinions would
> be greatly appreciated!
>
Do you need to use regex?

It would be simpler to use the .split method:

for pair in string.split("||||"):
     key, value = pair.split("|||")
     print("key is {!r}, value is {!r}".format(key, value))