Groups | Search | Server Info | Login | Register
Groups > comp.os.linux.misc > #79366
| From | Johnny Billquist <bqt@softjar.se> |
|---|---|
| Newsgroups | comp.os.linux.misc, alt.folklore.computers |
| Subject | Re: Recent history of vi |
| Date | 2025-12-17 10:39 +0100 |
| Organization | MGT Consulting |
| Message-ID | <10httos$7mf$2@news.misty.com> (permalink) |
| References | (8 earlier) <slrn10ik3r5.2dppt.als@mordor.angband.thangorodrim.de> <10gpatq$jpt$3@news.misty.com> <10gpi2s$3cij2$2@dont-email.me> <10hoktv$o38$1@news.misty.com> <10hqfll$2g7ff$1@paganini.bofh.team> |
Cross-posted to 2 groups.
On 2025-12-16 03:20, Waldek Hebisch wrote: > In alt.folklore.computers Johnny Billquist <bqt@softjar.se> wrote: > <snip> >>> The biggest problem I have with any Unicode representation except (I >>> think) UTF-32 is that a program has no way of knowing how long a string >>> is without encoding/decoding it. Given a string of characters in some >>> codepage, how many bytes does it occupy when converted to UTF-8? Given a >>> UTF-8 character string, how many character positions does it occupy, >>> say, for example, when displayed on a screen? >> >> True. However, that has nothing to do with Unicode as such, but the >> UTF-8 encoding of it. > > Unicode has combining "characters", so to know how many "real" > character you have you need to combine. IIUC for Korean Hangul > character can be buit from 3 separate pieces, each taking one code > point, but also there are "precomposed" combinations taking a > single code point. My reading of description is that 3 pieces > version and precomposed one are supposed to display the same. > > There are also code point for ligatures, for most puproses ligature > fi' counts as two characters, but is a single code point. Terminal > may display it in a single cell, but arguably for noice monspaced > display one should expand ligatures. For display we have single > cell characters and double width one, so to know width one needs > at least table giving width of codepoint and add widths of all > codepoints. Excellent points. Johnny
Back to comp.os.linux.misc | Previous | Next — Previous in thread | Next in thread | Find similar
Re: Recent history of vi Johnny Billquist <bqt@softjar.se> - 2025-12-15 10:38 +0100
Re: Recent history of vi antispam@fricas.org (Waldek Hebisch) - 2025-12-16 02:20 +0000
Re: Recent history of vi Lawrence D’Oliveiro <ldo@nz.invalid> - 2025-12-16 02:52 +0000
Re: Recent history of vi Nuno Silva <nunojsilva@invalid.invalid> - 2025-12-16 11:53 +0000
Re: Recent history of vi Richard Kettlewell <invalid@invalid.invalid> - 2025-12-16 17:42 +0000
Re: Recent history of vi Johnny Billquist <bqt@softjar.se> - 2025-12-17 10:39 +0100
Re: Recent history of vi "Carlos E.R." <robin_listas@es.invalid> - 2025-12-16 23:34 +0100
Re: Recent history of vi Lawrence D’Oliveiro <ldo@nz.invalid> - 2025-12-17 01:49 +0000
csiph-web