Pdf Parsing Challenge

Path	csiph.com!x330-a1.tempe.blueboxinc.net!usenet.pasdenom.info!news.dougwise.org!nntpfeed.proxad.net!proxad.net!feeder1-2.proxad.net!usenet-fr.net!de-l.enfer-du-nord.net!feeder1.enfer-du-nord.net!talisker.lacave.net!lacave.net!not-for-mail
From	Felipe Espinoza <fespinozacast@gmail.com>
Newsgroups	comp.lang.ruby
Subject	Pdf Parsing Challenge
Date	Tue, 17 May 2011 16:04:19 -0500
Organization	Service de news de lacave.net
Lines	26
Message-ID	<b3e54e146d346d393b16b935800076bb@ruby-forum.com> (permalink)
NNTP-Posting-Host	bristol.highgroove.com
Content-Type	text/plain; charset=UTF-8
Content-Transfer-Encoding	7bit
X-Trace	talisker.lacave.net 1305666738 44106 65.111.164.187 (17 May 2011 21:12:18 GMT)
X-Complaints-To	abuse@lacave.net
NNTP-Posting-Date	Tue, 17 May 2011 21:12:18 +0000 (UTC)
X-Received-From	This message has been automatically forwarded from the ruby-talk mailing list by a gateway at comp.lang.ruby. If it is SPAM, it did not originate at comp.lang.ruby. Please report the original sender, and not us. Thanks! For more details about this gateway, please visit: http://blog.grayproductions.net/categories/the_gateway
X-Mail-Count	383403
X-Ml-Name	ruby-talk
X-Rubymirror	Yes
X-Ruby-Talk	<b3e54e146d346d393b16b935800076bb@ruby-forum.com>
Xref	x330-a1.tempe.blueboxinc.net comp.lang.ruby:4693

Show key headers only | View raw

Hi Everyone,

I'm just trying to use the pdf-reader gem, but I have some trouble
understading how the gem wokds

If someone can help me with this, i'll be really grateful

The Problem:

I have to extract some data from a paper in a pdf format. I just need
some data from the page 1, like the title of the paper, the authors
list, the universities of these autors, their mails, the abstract and
keywords

how I can extract this data from this paper?
http://dl.dropbox.com/u/6928078/CLEI_2008_002.pdf

with a simple string that contains the information of a complete field
(keywords, abstract, etc) would help me

It's not necessary to use this gem, but I need a string for each field
with this info, how can I do that?

-- 
Posted via http://www.ruby-forum.com/.

Back to comp.lang.ruby | Previous | Next — Next in thread | Find similar | Unroll thread

Thread

Pdf Parsing Challenge Felipe Espinoza <fespinozacast@gmail.com> - 2011-05-17 16:04 -0500
  Re: Pdf Parsing Challenge Phillip Gawlowski <cmdjackryan@googlemail.com> - 2011-05-17 16:31 -0500
    Re: Pdf Parsing Challenge Felipe Espinoza <fespinozacast@gmail.com> - 2011-05-17 16:38 -0500
      Re: Pdf Parsing Challenge Phillip Gawlowski <cmdjackryan@googlemail.com> - 2011-05-17 16:45 -0500
  Re: Pdf Parsing Challenge Mark T <paradisaeidae@gmail.com> - 2011-05-17 19:42 -0500
  Re: Pdf Parsing Challenge Mark T <paradisaeidae@gmail.com> - 2011-05-17 19:37 -0500
  Re: Pdf Parsing Challenge Kouhei Sutou <kou@cozmixng.org> - 2011-05-18 08:23 -0500
  Re: Pdf Parsing Challenge Johannes Held <johannes.held@informatik.uni-erlangen.de> - 2011-05-19 10:21 +0200

csiph-web