Groups | Search | Server Info | Keyboard shortcuts | Login | Register


Groups > comp.os.linux.development.system > #162

extract data between two regex

Path csiph.com!x330-a1.tempe.blueboxinc.net!usenet.pasdenom.info!gegeweb.org!eternal-september.org!feeder.eternal-september.org!.POSTED!not-for-mail
From Rudra Banerjee <bnrj.rudra@gmail.com>
Newsgroups comp.os.linux.development.system
Subject extract data between two regex
Date Thu, 09 Jun 2011 15:49:21 +0200
Organization A noiseless patient Spider
Lines 21
Message-ID <1307627361.3593.4.camel@roddur> (permalink)
Mime-Version 1.0
Content-Type text/plain; charset="UTF-8"
Content-Transfer-Encoding quoted-printable
Injection-Info mx04.eternal-september.org; posting-host="yXvgrccv7m47YEm3bx6RQw"; logging-data="11985"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+EZ2Y+u9lKVom4KWLdjgsA"
X-Mailer Evolution 2.32.2 (2.32.2-1.fc14)
Cancel-Lock sha1:cRaoQfRfjNPg1kzZ44er6RFdXXU=
Xref x330-a1.tempe.blueboxinc.net comp.os.linux.development.system:162

Show key headers only | View raw


Dear friends,
How can I extract data sandwiched between two regex ? Say, for a
file(snipped from gcstar export html) like this pasted below.
What I want to do is to extract the titke (sandwidched between <srtong>
and  </strong>) and export it to latex(or other format).
hoping for your help.

<table border="0" cellspacing="10" cellpadding="0" width="100%">
	<tr><td colspan="3"><strong>Band Theory and Electronic Properties of
Solids (Oxford Master Series in Condensed Matter Physics)
(9780198506447)</strong></td></tr>
	<tr><td rowspan="5" width="80"><img
src="booklist_images/Band_Theory_and_Electronic_Properties_of_Solids__Oxford_Master_Series_in_Condensed_Matter_Physics___9780198506447__0.jpg" height="160" alt="Band Theory and Electronic Properties of Solids (Oxford Master Series in Condensed Matter Physics) (9780198506447)" title="Band Theory and Electronic Properties of Solids (Oxford Master Series in Condensed Matter Physics) (9780198506447)" border="0"/></td></tr>	
</table>

Back to comp.os.linux.development.system | Previous | NextNext in thread | Find similar


Thread

extract data between two regex Rudra Banerjee <bnrj.rudra@gmail.com> - 2011-06-09 15:49 +0200
  Re: extract data between two regex Jorgen Grahn <grahn+nntp@snipabacken.se> - 2011-06-09 14:26 +0000
    Re: extract data between two regex Richard Kettlewell <rjk@greenend.org.uk> - 2011-06-09 16:02 +0100
  Re: extract data between two regex Wil Taphoorn <wil@nogo.wtms.nl> - 2011-06-09 16:50 +0200

csiph-web