Path: csiph.com!usenet.pasdenom.info!aioe.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed4.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
MIME-Version: 1.0
In-Reply-To: <b26d653e-fd1a-4e27-a0f9-da432251ea7e@googlegroups.com>
References: <b26d653e-fd1a-4e27-a0f9-da432251ea7e@googlegroups.com>
Date: Tue, 19 Mar 2013 09:46:00 -0400
Subject: Re: How to extract certain set of lines from PDF
From: Joel Goldstick <joel.goldstick@gmail.com>
To: adamnizar01@gmail.com
Content-Type: multipart/alternative; boundary=bcaec54d4cfc660fe004d847532f
Cc: "python-list@python.org" <python-list@python.org>
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.3501.1363700763.2939.python-list@python.org>
Lines: 74
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:41492

--bcaec54d4cfc660fe004d847532f
Content-Type: text/plain; charset=UTF-8

On Tue, Mar 19, 2013 at 9:16 AM, <adamnizar01@gmail.com> wrote:

> Hello,
>
> I need to extract certain set of lines from PDF
> Ex:-
> IF(......)
> ..........
> ..........
>    IF(.....)
>    ...........
>    ...........
>    ENDIF
> ENDIF
>
> I need to copy entire lines from first "IF" till last "ENDIF".and extract
> it to seperate row of excel sheet.when ever a new occurrance of this kind
> of IF loops are found out.
> --
> http://mail.python.org/mailman/listinfo/python-list
>

You might start with this: http://knowah.github.com/PyPDF2/

I've never had to read pdf files, but it looks like there are several
libraries to choose from


-- 
Joel Goldstick
http://joelgoldstick.com

--bcaec54d4cfc660fe004d847532f
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><br><div class=3D"gmail_extra"><br><br><div class=3D"gmail=
_quote">On Tue, Mar 19, 2013 at 9:16 AM,  <span dir=3D"ltr">&lt;<a href=3D"=
mailto:adamnizar01@gmail.com" target=3D"_blank">adamnizar01@gmail.com</a>&g=
t;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left:1px solid rgb(204,204,204);padding-left:1ex">Hello,<br>
<br>
I need to extract certain set of lines from PDF<br>
Ex:-<br>
IF(......)<br>
..........<br>
..........<br>
=C2=A0 =C2=A0IF(.....)<br>
=C2=A0 =C2=A0...........<br>
=C2=A0 =C2=A0...........<br>
=C2=A0 =C2=A0ENDIF<br>
ENDIF<br>
<br>
I need to copy entire lines from first &quot;IF&quot; till last &quot;ENDIF=
&quot;.and extract it to seperate row of excel sheet.when ever a new occurr=
ance of this kind of IF loops are found out.<br>
<span class=3D""><font color=3D"#888888">--<br>
<a href=3D"http://mail.python.org/mailman/listinfo/python-list" target=3D"_=
blank">http://mail.python.org/mailman/listinfo/python-list</a><br>
</font></span></blockquote></div><br></div><div class=3D"gmail_extra">You m=
ight start with this: <a href=3D"http://knowah.github.com/PyPDF2/">http://k=
nowah.github.com/PyPDF2/</a><br><br></div><div class=3D"gmail_extra">I&#39;=
ve never had to read pdf files, but it looks like there are several librari=
es to choose from<br>
</div><div class=3D"gmail_extra"><br clear=3D"all"><br>-- <br><div dir=3D"l=
tr"><div>Joel Goldstick<br></div><a href=3D"http://joelgoldstick.com" targe=
t=3D"_blank">http://joelgoldstick.com</a><br></div>
</div></div>

--bcaec54d4cfc660fe004d847532f--