Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #10750 > unrolled thread

Re: How to define repeated string when using the re module?

Started byChris Rebert <clp2@rebertia.com>
First post2011-08-02 10:22 -0700
Last post2011-08-02 10:22 -0700
Articles 1 — 1 participant

Back to article view | Back to comp.lang.python

This discussion starts older than the indexed window; earlier articles aren't shown. The article labeled Started by below is the oldest one visible, not the original post.


Contents

  Re: How to define repeated string when using the re module? Chris Rebert <clp2@rebertia.com> - 2011-08-02 10:22 -0700

#10750 — Re: How to define repeated string when using the re module?

FromChris Rebert <clp2@rebertia.com>
Date2011-08-02 10:22 -0700
SubjectRe: How to define repeated string when using the re module?
Message-ID<mailman.1792.1312305752.1164.python-list@python.org>
On Tue, Aug 2, 2011 at 9:20 AM, smith jack <thinke365@gmail.com> wrote:
> if it's for a single character, this should be very easy, such as
> c{m,n}   the occurrence of c is between m and n,
>
> if i want to define the occurrence of (.*?)</div>  how should make it
> done?  ((.*?)</div>){1,3}  seems not work, any method to define repeat
> string using python regex?

Don't parse HTML using regexes; use an HTML parser!
http://stackoverflow.com/questions/1732348/regex-match-open-tags-except-xhtml-self-contained-tags

Here's a survey of Python HTML parsing libraries:
http://blog.ianbicking.org/2008/03/30/python-html-parser-performance/

Cheers,
Chris
--
http://rebertia.com

[toc] | [standalone]


Back to top | Article view | comp.lang.python


csiph-web