Groups | Search | Server Info | Login | Register


Groups > alt.os.linux.debian > #10629

Re: Understanding pdfseparate error messages

From Richard Owlett <rowlett@access.net>
Newsgroups alt.os.linux.debian
Subject Re: Understanding pdfseparate error messages
Date 2025-07-25 08:33 -0500
Organization A noiseless patient Spider
Message-ID <1060147$1uu97$1@dont-email.me> (permalink)
References <105tiod$1goci$1@dont-email.me> <105v5mq$urjl$1@dont-email.me>

Show all headers | View raw


On 7/25/25 12:46 AM, William Unruh wrote:
> Why not use pdfseparate to extract the two pages you want, and then use
> pdftotext on each of the two pages?

That's what I'd done.

> 
> On 2025-07-24, Richard Owlett <rowlett@access.net> wrote:
>> I'm running Debian 12.8.
>> I have a 100+ page PDF document.
>> I wish to extract 2 of those pages, each to their own PDF file for later
>> editing.
>>
>> I'm focusing on poppler-utils as it appears to offer tools for current
>> and future goals.
>>
>> Doing "pdftotext -layout -f 116 -l 116 TFP2021.pdf jul24-a.txt" comes
>> very close to what I want.
>>
>> Having been surrounded by TECO-buffs in the 70's, comparing the output
>> of "pdftotext -f 116 -l 116 TFP2021.pdf jul24-b.txt" to the above
>> suggests an approach to resolving.
>>
>> It involves being able to edit a *SINGLE* rather than all 100+ companion
>> pages.
>>
>> I tried "pdfseparate -f 116 -l 116 TFP2021.pdf dianostic.pdf" and got
>>> Syntax Error (3868069): Missing 'endstream' or incorrect stream length
>>> Syntax Error (3557294): Missing 'endstream' or incorrect stream length
>>>      [multiple repetitions of those 2 lines
>>> Syntax Error (3556857): Bad FCHECK in flate stream
>>> Syntax Error (3868069): Missing 'endstream' or incorrect stream length
>>> Syntax Error (3866517): Bad FCHECK in flate stream
>>
>> How/where do I find interpretation of those?
>>
>> TIA
>>
>> *A postscript
>>
>> I had originally composed this message before discovering "pdfseparate"
>> had created output files that that appear to be what I intended.
>>
>> I'm still interested in the meaning of the error messages as it may hint
>> as why ""pdftotext" wasn't *exactly* what I hoped for.
>>
>>

Back to alt.os.linux.debian | Previous | NextPrevious in thread | Find similar


Thread

Understanding pdfseparate error messages Richard Owlett <rowlett@access.net> - 2025-07-24 10:16 -0500
  Re: Understanding pdfseparate error messages Richard Kettlewell <invalid@invalid.invalid> - 2025-07-24 21:55 +0100
    Re: Understanding pdfseparate error messages Richard Owlett <rowlett@access.net> - 2025-07-25 08:30 -0500
      Re: Understanding pdfseparate error messages Richard Owlett <rowlett@access.net> - 2025-07-25 09:48 -0500
        Re: Understanding pdfseparate error messages William Unruh <unruh@invalid.ca> - 2025-07-27 16:30 +0000
  Re: Understanding pdfseparate error messages William Unruh <unruh@invalid.ca> - 2025-07-25 05:46 +0000
    Re: Understanding pdfseparate error messages Richard Owlett <rowlett@access.net> - 2025-07-25 08:33 -0500

csiph-web