Groups | Search | Server Info | Login | Register


Groups > comp.infosystems.www.misc > #192

Program to Split Single-Page HTML Documentation

Message-ID <64c73d4a@news.ausics.net> (permalink)
From Computer Nerd Kev <not@telling.you.invalid>
Subject Program to Split Single-Page HTML Documentation
Newsgroups comp.infosystems.www.misc, comp.os.linux.misc
Date 2023-07-31 14:49 +1000
Organization Ausics - https://www.ausics.net

Cross-posted to 2 groups.

Show all headers | View raw


I don't like browsing huge single HTML pages of documentation. Does
anyone know of a program or script (preferably for Linux) that can
scan a big software manual's single HTML page and automatically
break it up according to the contents section and the corresponding
anchor links?

Basically I want something to turn this:
http://www.gnu.org/software/coreutils/manual/coreutils.html

into this:
http://www.gnu.org/software/coreutils/manual/html_node/index.html

But without the Texinfo source like GNU software (usually) uses.
Just from the HTML itself. I also want it to output static HTML, so
no solutions using Javascript or browser add-ons.

One option might be to use csplit to break it up at common section
separator patterns, then a simple script renames the new files
according to their heading text. But I'd like to have HTML
navigation links, ideally including converting existing anchor
links inside the document.

A prime target would be the Raspberry Pi configuration
documentation, which has convinced me of the merit of multi-page
docs by how confusing it has become for me since they switched to a
single-page layout:
https://www.raspberrypi.com/documentation/computers/configuration.html

-- 
__          __
#_ < |\| |< _#

Back to comp.infosystems.www.misc | Previous | NextNext in thread | Find similar


Thread

Program to Split Single-Page HTML Documentation Computer Nerd Kev <not@telling.you.invalid> - 2023-07-31 14:49 +1000
  Re: Program to Split Single-Page HTML Documentation Bud Frede <frede@mouse-potato.com> - 2023-07-31 08:21 -0400
    Re: Program to Split Single-Page HTML Documentation Computer Nerd Kev <not@telling.you.invalid> - 2023-08-01 15:57 +1000
      Re: Program to Split Single-Page HTML Documentation Bud Frede <frede@mouse-potato.com> - 2023-08-01 07:13 -0400
  Re: Program to Split Single-Page HTML Documentation "Carlos E.R." <robin_listas@es.invalid> - 2023-08-01 13:21 +0200
    Re: Program to Split Single-Page HTML Documentation Helmut Richter <hr.usenet@email.de> - 2023-08-01 14:31 +0200
      Re: Program to Split Single-Page HTML Documentation not@telling.you.invalid (Computer Nerd Kev) - 2023-08-02 09:24 +1000
        Re: Program to Split Single-Page HTML Documentation Helmut Richter <hr.usenet@email.de> - 2023-08-02 22:29 +0200
          Re: Program to Split Single-Page HTML Documentation Helmut Richter <hr.usenet@email.de> - 2023-08-03 11:44 +0200
            Re: Program to Split Single-Page HTML Documentation Computer Nerd Kev <not@telling.you.invalid> - 2023-08-04 22:07 +1000
              Re: Program to Split Single-Page HTML Documentation "23k.304" <23k304@bfxw9.net> - 2023-08-05 00:45 -0400
                Re: Program to Split Single-Page HTML Documentation Computer Nerd Kev <not@telling.you.invalid> - 2023-08-05 18:21 +1000
                Re: Program to Split Single-Page HTML Documentation Helmut Richter <hr.usenet@email.de> - 2023-08-05 10:56 +0200
                Re: Program to Split Single-Page HTML Documentation "23k.304" <23k304@bfxw9.net> - 2023-08-05 23:37 -0400
                Re: Program to Split Single-Page HTML Documentation "23k.304" <23k304@bfxw9.net> - 2023-08-05 23:39 -0400
    Re: Program to Split Single-Page HTML Documentation not@telling.you.invalid (Computer Nerd Kev) - 2023-08-02 09:39 +1000
  Re: Program to Split Single-Page HTML Documentation Theo <theom+news@chiark.greenend.org.uk> - 2023-08-07 18:51 +0100
    Re: Program to Split Single-Page HTML Documentation not@telling.you.invalid (Computer Nerd Kev) - 2023-08-08 09:06 +1000

csiph-web