Path: csiph.com!fu-berlin.de!uni-berlin.de!not-for-mail From: Stefan Schwarzer Newsgroups: de.comp.lang.python Subject: Re: [Python-de] Gruppen in sich wiederholenden Sub-Patterns Date: Fri, 16 Nov 2018 09:13:50 +0100 Lines: 28 Message-ID: References: <3baad237-6d51-8748-bf15-c9cd66ca5aff@sschwarzer.net> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: news.uni-berlin.de 1iYd2rVWoZmsDZkFUdvxbgwGRQEAgL7ck9CBrLzbJtiQ== Return-Path: X-Original-To: python-de@python.org Delivered-To: python-de@mail.python.org Openpgp: preference=signencrypt Autocrypt: addr=sschwarzer@sschwarzer.net; prefer-encrypt=mutual; keydata= xsFNBFW78IwBEADMkDDhNj/nvNT5J/uqPYGte7bB4N3w5F9uwvHR1Mnm7DTMtzNaheVv6wWF ZdHNkq+PWqYaJZQ0m12Nso31YKHdSm+Bw1ppYQGg5Jv5sJKzTE47FvzVylh63dHjOTXhLDHX HNLOssVvr8xl6LJOoC2pvtD+fdGUvGEeyso+WbgmgjlSh/2AhO8hg9FUcxCHzcCvIrC/c+ot 8Fnf/MXpI+4sytBQscEAGo8YSLqXqs8dOOd1NONghSlfyWWz32quXX1qyN4ptjVpMQxRgabm 4HDaUQqL/WbNde815n4I2jGvLZVV3+MVeZ+3pI1FduH7+HQOARO98b98AZ+kUXP7bn4TOgNT zRMdOO8lF26T2Og5ELZWm4yan4gNUdIFbE7a7UBv+Pf99zQEYcGFbpG6i1an4R+KWUFAQmwT gEvxGW4tTFLkYnJTuvptW4sCaO5JSDfKTrYBHVlIxUP6yY/EaRsjTalESy3YYFg36sKp4URu 6mrxdNs+dRxpkTTfDdnmQkz2DRks7qBwfXJZncD2U70+5q4cp2P9hLEGvCskpaMBy2D1zTdh +TZW0tHQAvCvQKYNlXKdtqsTHMWhGKTheTKfNOAR6NlJxhlHmWFT2KXE54E0dKZSWbZzAP4b pVeHQpB7b/5Qvrylq70KdPwh7q0IS3xGRPutekW9Dznm1VKOhQARAQABzSxTdGVmYW4gU2No d2FyemVyIDxzc2Nod2FyemVyQHNzY2h3YXJ6ZXIubmV0PsLBewQTAQIAJQIbAwYLCQgHAwIG FQgCCQoLBBYCAwECHgECF4AFAlW9ApICGQEACgkQrRxpUjCGWJ7rLA/8DFFU8aJTtbMDXm8o OTIc+1uuAeYhw/5VWcT4NxoztIN5UHY/07F/AXsCLkqydX4+/5VxmTl4U/e0U1pz8LTiZAaK zP9YPrqRfrB8v6ARh26yNPyABcUH+DDMNLNj4228xK0bwPfmWnPx0B39kGD6DVn9DfPz8sIg hkCKCRqNZhTbfUy307NWmZK0MZRMEe/H1oKGQsXUwWu5aSJcCezQsly7vfpVwne3RbDYrJRb xmDI9LfJ1wvg+vIfYeYEVR6/hKFrEqxhVdGDgAtqzSvZxy/ySfmnjFf4aMti2hktDvRN5D+3 z5mhl4oIFXJk1KgbUk84umumGXqks3PTOiDnWh4t8A9QC9hFpU3dcM0AGbYywE55Xtvquzoh Sy90nTaC94RCs8vlPyU4oYDrKRvqnv6zfREpsk9eccrsfWEqGXyj02dFSW48mdpFT+yXrKFp pAW2dv6C0TpTBRGr/WmGdbxvduJbuEak9bJGtb1sNOaRD7kmDdv+qJqisbmAAOv+pTnvmjuh aRdzzl0G/uv8ixbL8HX7cX143v9D9U4HIBRr/7sECMf1x9FBfqm7rInP5MYWk5a9QaMeL1an xMn80zkTP2V8QIleVfQT3kZ71IEVF7ouPGwepm6oPbKhW0NOU1mQcwHYClDKhoyc4sfCP9tl 3T/y+2Gsoo+05DWrBDLOwU0EVbvwjAEQAJsFeIY0q0IJuyJNIjsfitKVU12sMDn2hV043SRz 2S7ApNclcY8Y2xEh+q+GXFzMDLAgOijmWN93eOryzcswhgXKiMoVbQeRoSFa0WQv2ozR8Jzs X6KBETSF7PnMC8ymxP2JEbIbUZyyhcwXvtd2EZLSTHlQoQzmm6+MEQ09cpRJpkf/KYsc4S+J 2JXq7GgHYNDyBYCIR4QOPZVA3oPt6DBPVMwc7rRjnfhBlVvDwCsUlmrqJwY0ysTF7V2Ud079 JlsbNzIp49WAwCSvivgl3HdpqW1gv/UnK1PtG3Crsb1yvLIBCPGcb1SIifc5HUFGWgD1r0Y3 Q9e15scLMqHazFBUkvnsS4U2qrPqJJPwbxpGj16/d62rIwiHMHtGxHazaPlbb9hLRFj96Vx4 +v8/02mj+pbkXbKf8Kc4+i4MBKjZ4tYi8UFT2uC9GNYMH7Ajf3w17HRvIbBX7K/0QnI9GF/P o/NRKv66YpZGXwRqMwcC8sAfEHmNa9x3PTH6IvveBo6w/MyJ0KQhawRMSTSeKsKHSvnzqp7l 9u1cpKpi5FzFgumJKCxbR3fJgKV11Ia+PvXRoOkvt/7bROgc+E86HUdxVhOIk4qvt+ztbI4B jyho/u+C7OP/xWAdztyZNbA4sIdbwJ8LghT0CvB40n+dzPzwBu43SFaEcvabidO9PO9XABEB AAHCwV8EGAECAAkFAlW78IwCGwwACgkQrRxpUjCGWJ6Niw/8DQIQ7wxb9C0AdCAasxTOwRVG 0d/99vs1HhyR2s8OmGZjHqtXHCl08tkf5fcncIl/628ZjiugtZAvft4KkxGWNzmF97LgHZ+0 v005/t+6YnaRODS4IRM/mB6SDEiNW3Yraxwf2hcXwF9Hjy8zNgWve2UYzlJsrQqDYyO/ktxB ucu1bCfmuClY1VqipL0hQQVPjjC7BKhti3gdOKNNdsx3FAba6hqOgU5zbHAkLhFGY83ppn/t tBT5OULZ3JUmceELWGnNhO+REfokDmvQIWFQ1eo4eZjL27rPn/EXJFtf0o7woyfGj0/0htzr UNrEqoumSep/RmSSG29vyNQqS5kKmNCeBmSZXR39nkVVk+gOFsic78HoAZDMGblv3aIMbc85 DR+Q8bpdhmcvV049/CirqHkKaOYcblPvKhuZf7fmUbZSDjryvEaVSZmZO/TrNQXnbzNBHc10 PcZBnHb76fvGjXCnThgoXHXS38+SUGisGrpIRRZuFl9icE9J8L/FMvodycrkthbEKds777p0 dkgvDeD8Wvr+5orkt6fEC4gczOD0oK46KYqOSWGCim3FyAm0EDIMHtra+wLlyzv5MGHbnjzl BjkNbYpaPBfkwACkk5hclu9jAOx4aBHeXpHyRDbsoJJ5yoTx3qZKT14kPO8aytDwukobeW03 WdtUokwYRxk= User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.24) Gecko/20100411 Thunderbird/2.0.0.24 Mnenhy/0.7.6.666 In-Reply-To: Content-Language: de-DE X-Provags-ID: V03:K1:+P2uDxp5uOzEfQBMIgHAjXmufb21TaUvHahUPHPeomozhoEHQ3F SN4iJTESzcpYRZTdBa64e8RZGsoxQILmjn2vPwKVCCAuaLJNdVL1c9C1sci77xj//kgpY25 mtpZFy+CZV9BZDKp5t/p/Mn5shBCs/Fb9Dvyp1QL0NqltH5/12ss7WCYkQ+pHsp6YAJB2m1 9qEXMN+vHwPqLvOvC9UHQ== X-Spam-Flag: NO X-UI-Out-Filterresults: notjunk:1;V01:K0:4q9wSdja8hQ=:C45ZvEvlwIC8pH4MrltwHj qejq4OCcP3M4C2dL+RYj4a09cvoli5f35zmPvi1o7PbuzX3/Xvp4HwkLSwwkq+tGYDnA5MLF/ oJR3KWLGXv5NF5EnvuuuBrRnlo/4OLwAX/YCtQGLQe9GLpu8K0tZqXlHvMTvdFkrx56PxKwZb +BwhyW4kapCiX2sPFqG5sSFm7PpwJZG6TmdN7WMu6dk+yqxOm+0GBXxgm1+woK56fwpBusRvs UOgWJwKQoCMMyuqKekPCQu4f2TRhL14zv4FFjSCgzaSBRjdwRHaFmvXxbHr4BPWxy9LnTfwXo oPRCvaJNmH/kFEmTjEVYjzQTd+ohOd28SulU+D8mFGfjJyRpjIX5M+m9V5a4KKLQVDLSA87Hu 3ckL7VAncDT4t1NM0ekYQQV3SqHtQiyVmPu96uxO8CWixiZ//8V6VeWNugBNw+KWRVY+6DeF4 PlzHCKcludVJugJuUTlPhTczV+Ks4biGFFTlr75eps3NeRQJQPLcuAc4KovmOscEF86TuKMj8 AGa8W6imHU17/+ZycMgTRMH3m8NDydy+ofs+mO3tN2lg/9i4yBCRqzGVd8rQtbXHXNR3zNcPo laZn/dJk37VkNcawKldPIqXrnffpVEblys2yefOutLnWNouXsjVED1yr0LbtraL1JDGC+djib d0ppdcpd44HqW8HK8PMUKfJS9+IlsUCWZ6b8X2ej2qU/KlYgqLO+0vugyE6LzBVue+cDIDQO8 RLVAwZu2/q5pfjicHZBe8R90X6iluodL6B0CoudTR87jgu1l5iSIZWdKBG5sR4yAZvNkznXGG 1rXjRgLEcYlz1Y3XTpgkNfJlN9y2mBPceCLNDcB2aC3WhhIaW8= X-BeenThere: python-de@python.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Die Deutsche Python Mailingliste List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Mailman-Original-Message-ID: X-Mailman-Original-References: <3baad237-6d51-8748-bf15-c9cd66ca5aff@sschwarzer.net> Xref: csiph.com de.comp.lang.python:5334 On 16/11/2018 08.25, Dustin Vanidestine wrote: >> Gibt es eine Möglichkeit, den regulären Ausdruck so umzuschreiben, >> dass ich alle gewünschten Gruppen bekomme? >> >> Falls nicht, wie würdet ihr das Problem sonst lösen? Mir sind >> mögliche Ansätze eingefallen, aber die wirken alle ziemlich >> frickelig. > > So würde ich es machen (Python 3.7): >>>> import re >>>> regex = re.compile(r"""\(\w+,\w+\)""", > re.U).findall("""(ab,cd),(ef,gh),(ij,kl),(mn,op)""") >>>> regex > ['(ab,cd)', '(ef,gh)', '(ij,kl)', '(mn,op)'] >>>> regex = re.compile(r"""\(\w+,\w+\)""", > re.U).findall("""Anfang(ab,cd),(ef,gh),(ij,kl),(mn,op)(andereszeug)""") >>>> regex > ['(ab,cd)', '(ef,gh)', '(ij,kl)', '(mn,op)'] Das Problem hierbei ist, dass die Kommas zwischen den eingeklammerten Gruppen Bestandteil der Syntax sind. Es soll zwischen den Gruppen nicht irgendetwas beliebiges stehen können. Ansonsten wäre `findall` natürlich das Mittel der Wahl. Viele Grüße Stefan