Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.infosystems.www.misc > #200
| From | Helmut Richter <hr.usenet@email.de> |
|---|---|
| Newsgroups | comp.infosystems.www.misc, comp.os.linux.misc |
| Subject | Re: Program to Split Single-Page HTML Documentation |
| Date | 2023-08-02 22:29 +0200 |
| Message-ID | <d9ddb265-e637-7a8-b6e6-b1d7a6281a3c@email.de> (permalink) |
| References | <64c73d4a@news.ausics.net> <ee6npjxig4.ln2@Telcontar.valinor> <423fa31-6dd-a7a8-7758-4eca1c952b4@email.de> <64c9941b@news.ausics.net> |
Cross-posted to 2 groups.
On Wed, 2 Aug 2023, Computer Nerd Kev wrote:
> In comp.infosystems.www.misc Helmut Richter <hr.usenet@email.de> wrote:
> > Consequently, the proposed tool HTMLdoc excludes virtually everything:
> >
> > While it currently does not support many things in "the modern web"
> > such as Cascading Style Sheets (CSS), forms, full Unicode, and Emoji
> > characters, ...
>
> Well no, it doesn't exclude most software documentation because for
> whatever reason the HTML in much of that has remained relatively
> sane. Some CSS is creeping in, but a tool that ignores it still
> produces clear text with some formatting. So far I've tested it
> with two documentation pages published in 2023 and it's understood
> the HTML fine (except for that out-of-page link problem in the RPi
> doc). Plus often I'm looking at docs published 10-20 years ago
> anyway.
Out-of-page links are trivial: replace each link "#xyz" by "subpage#xyz".
It is known to which subpage each link belongs, at least if you go over
the text in two passes. This is a procedure which I apply to all my web
pages, which are written as one document, and split into pieces later.
Another automatic change is to close and reopen each open tag:
<tag1>
a...
<tag2>
b...
---------------------------- cut here --------------
c...
</tag2>
d...
</tag1>
becomes
<tag1>
a...
<tag2>
b...
</tag2>
</tag1>
--------------------------was cut here ---------------
<tag1>
<tag2>
c...
</tag2>
d...
</tag1>
It is not obvious which attributes should be repeated in the second copy.
Mostly it is safer to repeat them, at least "class" and "style" attributes
and "id" attributes if they are referred to in style sheets.
--
Helmut Richter
Back to comp.infosystems.www.misc | Previous | Next — Previous in thread | Next in thread | Find similar
Program to Split Single-Page HTML Documentation Computer Nerd Kev <not@telling.you.invalid> - 2023-07-31 14:49 +1000
Re: Program to Split Single-Page HTML Documentation Bud Frede <frede@mouse-potato.com> - 2023-07-31 08:21 -0400
Re: Program to Split Single-Page HTML Documentation Computer Nerd Kev <not@telling.you.invalid> - 2023-08-01 15:57 +1000
Re: Program to Split Single-Page HTML Documentation Bud Frede <frede@mouse-potato.com> - 2023-08-01 07:13 -0400
Re: Program to Split Single-Page HTML Documentation "Carlos E.R." <robin_listas@es.invalid> - 2023-08-01 13:21 +0200
Re: Program to Split Single-Page HTML Documentation Helmut Richter <hr.usenet@email.de> - 2023-08-01 14:31 +0200
Re: Program to Split Single-Page HTML Documentation not@telling.you.invalid (Computer Nerd Kev) - 2023-08-02 09:24 +1000
Re: Program to Split Single-Page HTML Documentation Helmut Richter <hr.usenet@email.de> - 2023-08-02 22:29 +0200
Re: Program to Split Single-Page HTML Documentation Helmut Richter <hr.usenet@email.de> - 2023-08-03 11:44 +0200
Re: Program to Split Single-Page HTML Documentation Computer Nerd Kev <not@telling.you.invalid> - 2023-08-04 22:07 +1000
Re: Program to Split Single-Page HTML Documentation "23k.304" <23k304@bfxw9.net> - 2023-08-05 00:45 -0400
Re: Program to Split Single-Page HTML Documentation Computer Nerd Kev <not@telling.you.invalid> - 2023-08-05 18:21 +1000
Re: Program to Split Single-Page HTML Documentation Helmut Richter <hr.usenet@email.de> - 2023-08-05 10:56 +0200
Re: Program to Split Single-Page HTML Documentation "23k.304" <23k304@bfxw9.net> - 2023-08-05 23:37 -0400
Re: Program to Split Single-Page HTML Documentation "23k.304" <23k304@bfxw9.net> - 2023-08-05 23:39 -0400
Re: Program to Split Single-Page HTML Documentation not@telling.you.invalid (Computer Nerd Kev) - 2023-08-02 09:39 +1000
Re: Program to Split Single-Page HTML Documentation Theo <theom+news@chiark.greenend.org.uk> - 2023-08-07 18:51 +0100
Re: Program to Split Single-Page HTML Documentation not@telling.you.invalid (Computer Nerd Kev) - 2023-08-08 09:06 +1000
csiph-web