Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!1.eu.feeder.erje.net!eternal-september.org!feeder.eternal-september.org!mx02.eternal-september.org!.POSTED!not-for-mail From: Unknown Newsgroups: comp.os.linux.misc Subject: pdf & O.C.R ? Date: Sat, 23 May 2015 07:49:37 +0000 (UTC) Organization: A noiseless patient Spider Lines: 18 Message-ID: Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Injection-Date: Sat, 23 May 2015 07:49:37 +0000 (UTC) Injection-Info: mx02.eternal-september.org; posting-host="9d0025e7ac33c81a717f76a77067729a"; logging-data="10093"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX189Q2kgxK5yCK4PKAnSNW9uLsm6Ahz8wt4=" User-Agent: Pan/0.133 (House of Butterflies) Cancel-Lock: sha1:Z+Yq1sN15Wq74LBuowUHAFoP+6E= Xref: csiph.com comp.os.linux.misc:14841 I'm confused and disturbed that xpdf of: http://www.inf.ethz.ch/personal/wirth/ProjectOberon/PO.Computer.pdf is perfect to the pixel, with maximum magnification [400%], which is expected, since it's computer-font generated, whereas: http://www.northernlaw.co.za/images/stories/files/actsandbills/COMPANY% 20LAW%20ACT.pdf shows blotchy and fibers as if it's a photo-of-a-paper-copy. And scanned copies of papers are apparently normal. BUT!! How is it that xpdf allows me to extract the text, via mouse-copy from COMPANY%20LAW%20ACT.pdf ? That would mean that the mouse-driver is doing O.C.R. ?! And mc's viewer [which uses ] reads this text. Is this some new O.C.R. which I could use on jpg-ed pages of text? ==Thanks for any answers.