Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.infosystems.www.misc > #200

Re: Program to Split Single-Page HTML Documentation

From Helmut Richter <hr.usenet@email.de>
Newsgroups comp.infosystems.www.misc, comp.os.linux.misc
Subject Re: Program to Split Single-Page HTML Documentation
Date 2023-08-02 22:29 +0200
Message-ID <d9ddb265-e637-7a8-b6e6-b1d7a6281a3c@email.de> (permalink)
References <64c73d4a@news.ausics.net> <ee6npjxig4.ln2@Telcontar.valinor> <423fa31-6dd-a7a8-7758-4eca1c952b4@email.de> <64c9941b@news.ausics.net>

Cross-posted to 2 groups.

Show all headers | View raw


On Wed, 2 Aug 2023, Computer Nerd Kev wrote:

> In comp.infosystems.www.misc Helmut Richter <hr.usenet@email.de> wrote:
> > Consequently, the proposed tool HTMLdoc excludes virtually everything:
> > 
> >  While it currently does not support many things in "the modern web"
> >  such as Cascading Style Sheets (CSS), forms, full Unicode, and Emoji
> >  characters, ...
> 
> Well no, it doesn't exclude most software documentation because for
> whatever reason the HTML in much of that has remained relatively
> sane. Some CSS is creeping in, but a tool that ignores it still
> produces clear text with some formatting. So far I've tested it
> with two documentation pages published in 2023 and it's understood
> the HTML fine (except for that out-of-page link problem in the RPi
> doc). Plus often I'm looking at docs published 10-20 years ago
> anyway.

Out-of-page links are trivial: replace each link "#xyz" by "subpage#xyz". 
It is known to which subpage each link belongs, at least if you go over 
the text in two passes. This is a procedure which I apply to all my web 
pages, which are written as one document, and split into pieces later.

Another automatic change is to close and reopen each open tag:

<tag1> 
  a...
  <tag2>
    b...
---------------------------- cut here --------------
    c...
  </tag2>
  d...
</tag1>

becomes

<tag1> 
  a...
  <tag2>
    b...
  </tag2>
</tag1>
--------------------------was cut here ---------------
<tag1>
  <tag2>
    c...
  </tag2>
  d...
</tag1>

It is not obvious which attributes should be repeated in the second copy. 
Mostly it is safer to repeat them, at least "class" and "style" attributes 
and "id" attributes if they are referred to in style sheets.

-- 
Helmut Richter

Back to comp.infosystems.www.misc | Previous | NextPrevious in thread | Next in thread | Find similar


Thread

Program to Split Single-Page HTML Documentation Computer Nerd Kev <not@telling.you.invalid> - 2023-07-31 14:49 +1000
  Re: Program to Split Single-Page HTML Documentation Bud Frede <frede@mouse-potato.com> - 2023-07-31 08:21 -0400
    Re: Program to Split Single-Page HTML Documentation Computer Nerd Kev <not@telling.you.invalid> - 2023-08-01 15:57 +1000
      Re: Program to Split Single-Page HTML Documentation Bud Frede <frede@mouse-potato.com> - 2023-08-01 07:13 -0400
  Re: Program to Split Single-Page HTML Documentation "Carlos E.R." <robin_listas@es.invalid> - 2023-08-01 13:21 +0200
    Re: Program to Split Single-Page HTML Documentation Helmut Richter <hr.usenet@email.de> - 2023-08-01 14:31 +0200
      Re: Program to Split Single-Page HTML Documentation not@telling.you.invalid (Computer Nerd Kev) - 2023-08-02 09:24 +1000
        Re: Program to Split Single-Page HTML Documentation Helmut Richter <hr.usenet@email.de> - 2023-08-02 22:29 +0200
          Re: Program to Split Single-Page HTML Documentation Helmut Richter <hr.usenet@email.de> - 2023-08-03 11:44 +0200
            Re: Program to Split Single-Page HTML Documentation Computer Nerd Kev <not@telling.you.invalid> - 2023-08-04 22:07 +1000
              Re: Program to Split Single-Page HTML Documentation "23k.304" <23k304@bfxw9.net> - 2023-08-05 00:45 -0400
                Re: Program to Split Single-Page HTML Documentation Computer Nerd Kev <not@telling.you.invalid> - 2023-08-05 18:21 +1000
                Re: Program to Split Single-Page HTML Documentation Helmut Richter <hr.usenet@email.de> - 2023-08-05 10:56 +0200
                Re: Program to Split Single-Page HTML Documentation "23k.304" <23k304@bfxw9.net> - 2023-08-05 23:37 -0400
                Re: Program to Split Single-Page HTML Documentation "23k.304" <23k304@bfxw9.net> - 2023-08-05 23:39 -0400
    Re: Program to Split Single-Page HTML Documentation not@telling.you.invalid (Computer Nerd Kev) - 2023-08-02 09:39 +1000
  Re: Program to Split Single-Page HTML Documentation Theo <theom+news@chiark.greenend.org.uk> - 2023-08-07 18:51 +0100
    Re: Program to Split Single-Page HTML Documentation not@telling.you.invalid (Computer Nerd Kev) - 2023-08-08 09:06 +1000

csiph-web