Path: csiph.com!x330-a1.tempe.blueboxinc.net!usenet.pasdenom.info!news.dougwise.org!nntpfeed.proxad.net!proxad.net!feeder1-2.proxad.net!usenet-fr.net!de-l.enfer-du-nord.net!feeder1.enfer-du-nord.net!talisker.lacave.net!lacave.net!not-for-mail From: Felipe Espinoza Newsgroups: comp.lang.ruby Subject: Pdf Parsing Challenge Date: Tue, 17 May 2011 16:04:19 -0500 Organization: Service de news de lacave.net Lines: 26 Message-ID: NNTP-Posting-Host: bristol.highgroove.com Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Trace: talisker.lacave.net 1305666738 44106 65.111.164.187 (17 May 2011 21:12:18 GMT) X-Complaints-To: abuse@lacave.net NNTP-Posting-Date: Tue, 17 May 2011 21:12:18 +0000 (UTC) X-Received-From: This message has been automatically forwarded from the ruby-talk mailing list by a gateway at comp.lang.ruby. If it is SPAM, it did not originate at comp.lang.ruby. Please report the original sender, and not us. Thanks! For more details about this gateway, please visit: http://blog.grayproductions.net/categories/the_gateway X-Mail-Count: 383403 X-Ml-Name: ruby-talk X-Rubymirror: Yes X-Ruby-Talk: Xref: x330-a1.tempe.blueboxinc.net comp.lang.ruby:4693 Hi Everyone, I'm just trying to use the pdf-reader gem, but I have some trouble understading how the gem wokds If someone can help me with this, i'll be really grateful The Problem: I have to extract some data from a paper in a pdf format. I just need some data from the page 1, like the title of the paper, the authors list, the universities of these autors, their mails, the abstract and keywords how I can extract this data from this paper? http://dl.dropbox.com/u/6928078/CLEI_2008_002.pdf with a simple string that contains the information of a complete field (keywords, abstract, etc) would help me It's not necessary to use this gem, but I need a string for each field with this info, how can I do that? -- Posted via http://www.ruby-forum.com/.