Path: csiph.com!x330-a1.tempe.blueboxinc.net!usenet.pasdenom.info!gegeweb.org!de-l.enfer-du-nord.net!feeder2.enfer-du-nord.net!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail From: Helge Blischke Newsgroups: comp.lang.postscript Subject: Re: text messes up in ps2pdf converted pdf Followup-To: comp.lang.postscript Date: Tue, 26 Apr 2011 15:18:57 +0200 Lines: 25 Message-ID: <91nv22F5p3U1@mid.individual.net> References: <0ee8b9ed-146a-403a-8b34-f37497f2b5df@n10g2000yqf.googlegroups.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7Bit X-Trace: individual.net LQ/u/yH39ADZgmOfYSdGRQvU7JVdq+xV4KQdJj4fZg84FZb4LX Cancel-Lock: sha1:FdmCU8l1WoJIKckUUPPb4x0Og5Q= User-Agent: KNode/0.99.01 Xref: x330-a1.tempe.blueboxinc.net comp.lang.postscript:154 Peng Yu wrote: > Hi, > > I convert the ps file to a pdf file. But the text in the pdf is messed > up. I'm wondering how to convert a ps file to a pdf file so that the > text can be copied? Thanks and look forward to hearing from you! > > $ps2pdf pcfg-notes.ps > > http://www.cs.cmu.edu/~roni/11761-s01/PreviousYearsHandouts/pcfg-notes.ps > > -- > Regards, > Peng The PS file has been generated by dvips, the default TeX PostScript generation method. This method, by default, positions each character indifidually, therefore there is no trivial method to extract text from it; it requires more intelligent algorithms (e.g. to compare the distance between rendered characters with the normal interword spacing to determine word boundaries etc. Helge