Groups | Search | Server Info | Keyboard shortcuts | Login | Register
Groups > comp.os.linux.development.system > #162
| Path | csiph.com!x330-a1.tempe.blueboxinc.net!usenet.pasdenom.info!gegeweb.org!eternal-september.org!feeder.eternal-september.org!.POSTED!not-for-mail |
|---|---|
| From | Rudra Banerjee <bnrj.rudra@gmail.com> |
| Newsgroups | comp.os.linux.development.system |
| Subject | extract data between two regex |
| Date | Thu, 09 Jun 2011 15:49:21 +0200 |
| Organization | A noiseless patient Spider |
| Lines | 21 |
| Message-ID | <1307627361.3593.4.camel@roddur> (permalink) |
| Mime-Version | 1.0 |
| Content-Type | text/plain; charset="UTF-8" |
| Content-Transfer-Encoding | quoted-printable |
| Injection-Info | mx04.eternal-september.org; posting-host="yXvgrccv7m47YEm3bx6RQw"; logging-data="11985"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+EZ2Y+u9lKVom4KWLdjgsA" |
| X-Mailer | Evolution 2.32.2 (2.32.2-1.fc14) |
| Cancel-Lock | sha1:cRaoQfRfjNPg1kzZ44er6RFdXXU= |
| Xref | x330-a1.tempe.blueboxinc.net comp.os.linux.development.system:162 |
Show key headers only | View raw
Dear friends, How can I extract data sandwiched between two regex ? Say, for a file(snipped from gcstar export html) like this pasted below. What I want to do is to extract the titke (sandwidched between <srtong> and </strong>) and export it to latex(or other format). hoping for your help. <table border="0" cellspacing="10" cellpadding="0" width="100%"> <tr><td colspan="3"><strong>Band Theory and Electronic Properties of Solids (Oxford Master Series in Condensed Matter Physics) (9780198506447)</strong></td></tr> <tr><td rowspan="5" width="80"><img src="booklist_images/Band_Theory_and_Electronic_Properties_of_Solids__Oxford_Master_Series_in_Condensed_Matter_Physics___9780198506447__0.jpg" height="160" alt="Band Theory and Electronic Properties of Solids (Oxford Master Series in Condensed Matter Physics) (9780198506447)" title="Band Theory and Electronic Properties of Solids (Oxford Master Series in Condensed Matter Physics) (9780198506447)" border="0"/></td></tr> </table>
Back to comp.os.linux.development.system | Previous | Next — Next in thread | Find similar
extract data between two regex Rudra Banerjee <bnrj.rudra@gmail.com> - 2011-06-09 15:49 +0200
Re: extract data between two regex Jorgen Grahn <grahn+nntp@snipabacken.se> - 2011-06-09 14:26 +0000
Re: extract data between two regex Richard Kettlewell <rjk@greenend.org.uk> - 2011-06-09 16:02 +0100
Re: extract data between two regex Wil Taphoorn <wil@nogo.wtms.nl> - 2011-06-09 16:50 +0200
csiph-web