Groups | Search | Server Info | Login | Register


Groups > comp.infosystems.www.authoring.html > #6564

Re: Numeric character references in W3C HTML 5.2?

From Eli the Bearded <*@eli.users.panix.com>
Newsgroups comp.infosystems.www.authoring.html
Subject Re: Numeric character references in W3C HTML 5.2?
Date 2018-04-06 20:59 +0000
Organization Some absurd concept
Message-ID <eli$1804061659@qaz.wtf> (permalink)
References <numeric-20180406135137@ram.dialup.fu-berlin.de> <pa82ha$2j8$1@news.albasani.net> <eli$1804061415@qaz.wtf> <pa8j6k$3po$1@news.albasani.net>

Show all headers | View raw


In comp.infosystems.www.authoring.html,
David E. Ross <nobody@nowhere.invalid> wrote:
> On 4/6/2018 11:15 AM, Eli the Bearded wrote [in part]:
> > 65 is a valid Unicode codepoint for LATIN LETTER CAPITAL A. Do you have
> > any reson for asking people not to use codepoints for encoind? I churn
> > out thousands per day, and until there is a way to name every character
> > in Unicode I'm likely to continue doing so.
> Too often, I see unreadable garbage because someone has composed HTML
> using Unicode code points instead of named character references.  Most
> often, this is a result of faulty or inconsistent <head> declarations.

Numerical encodings are ALWAYS to Unicode code points. If you have an
ISO-8859-1 document numerical encoding is the only way to include Farsi
(eg). Faulty or inconsistent declarations elsewhere are a separate issue
and not related to &#NUMBER; problems.

> Even with such faulty declarations, the amount of garbage may be reduced
> by using named character references, especially since they do not depend
> on any charset attribute.

Numerical encodings are ALWAYS to Unicode code points. They do not
depend on any charset, character set, or character encoding.

> Generally, the garbage appears in HTML-formatted E-mail and newsgroup
> messages.  However, I also see this on Web pages.

Ah, why no love for "newsgroup messages"? "E-mail" and "Web" get capitals.

(ten line sig snipped)

Elijah
------
does not use a .sig

Back to comp.infosystems.www.authoring.html | Previous | NextPrevious in thread | Next in thread | Find similar


Thread

Re: Numeric character references in W3C HTML 5.2? "David E. Ross" <nobody@nowhere.invalid> - 2018-04-06 08:08 -0700
  Re: Numeric character references in W3C HTML 5.2? "Jukka K. Korpela" <jukkakk@gmail.com> - 2018-04-06 18:52 +0300
    Re: Numeric character references in W3C HTML 5.2? Thomas 'PointedEars' Lahn <PointedEars@web.de> - 2018-04-06 21:37 +0200
      Re: Numeric character references in W3C HTML 5.2? John W Kennedy <john.w.kennedy@gmail.com> - 2018-04-06 23:00 -0400
        Re: Numeric character references in W3C HTML 5.2? Thomas 'PointedEars' Lahn <PointedEars@web.de> - 2018-04-08 01:43 +0200
    Re: Numeric character references in W3C HTML 5.2? "David E. Ross" <nobody@nowhere.invalid> - 2018-04-06 12:42 -0700
      Re: Numeric character references in W3C HTML 5.2? Gloops <gloops@zailes.invalid.org.invalid> - 2018-07-12 12:41 +0200
  Re: Numeric character references in W3C HTML 5.2? Eli the Bearded <*@eli.users.panix.com> - 2018-04-06 18:15 +0000
    Re: Numeric character references in W3C HTML 5.2? "David E. Ross" <nobody@nowhere.invalid> - 2018-04-06 12:52 -0700
      Re: Numeric character references in W3C HTML 5.2? Eli the Bearded <*@eli.users.panix.com> - 2018-04-06 20:59 +0000
  Re: Numeric character references in W3C HTML 5.2? HTML scrawler <html@nowhere.invalid> - 2018-04-15 13:51 -0400
    Re: Numeric character references in W3C HTML 5.2? "Jukka K. Korpela" <jukkakk@gmail.com> - 2018-04-15 23:17 +0300
      Re: Numeric character references in W3C HTML 5.2? Thomas 'PointedEars' Lahn <PointedEars@web.de> - 2018-04-15 23:50 +0200
      Re: Numeric character references in W3C HTML 5.2? HTML scrawler <html@nowhere.invalid> - 2018-04-17 19:12 -0400

csiph-web