Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.lang.java.programmer > #25557
| Date | 2011-02-06 15:38 -0500 |
|---|---|
| From | Arne Vajhøj <arne@vajhoej.dk> |
| Newsgroups | comp.lang.java.programmer |
| Subject | Re: Why No Supplemental Characters In Character Literals? |
| References | (6 earlier) <iil0uf$niu$4@lust.ihug.co.nz> <iil226$e6k$1@news.eternal-september.org> <alpine.DEB.1.10.1102061540090.21657@urchin.earth.li> <4d4ec62c$0$23751$14726298@news.sunsite.dk> <alpine.DEB.1.10.1102061834030.21657@urchin.earth.li> |
| Message-ID | <4d4f06b6$0$23753$14726298@news.sunsite.dk> (permalink) |
| Organization | SunSITE.dk - Supporting Open source |
On 06-02-2011 13:34, Tom Anderson wrote: > On Sun, 6 Feb 2011, Arne Vajhøj wrote: >> On 06-02-2011 10:42, Tom Anderson wrote: >>> On Sat, 5 Feb 2011, Mike Schilling wrote: >>>> "Lawrence D'Oliveiro" <ldo@geek-central.gen.new_zealand> wrote in >>>> message news:iil0uf$niu$4@lust.ihug.co.nz... >>>>> In message <iiji77$4fe$1@localhost.localdomain>, Martin Gregorie >>>>> wrote: >>>>> >>>>>> Arabic Letters also have different glyphs depending on whether >>>>>> they are at the start, middle or end of a word or an isolated >>>>>> letter, though six letters only have isolated and end-of-word >>>>>> representations. Unicode supports this with a code point for each >>>>>> representation of each letter. >>>>> >>>>> But they are not different characters, they should not have >>>>> different code points. >>>>> >>>>> Assigning different code points greatly complicates basic >>>>> text-processing tasks like editing and searching. >>>> >>>> Different code point for capitals and lower-case letters is equally >>>> silly. >>> >>> Agreed. Uppercase should be a combiner, like an accent. There could be >>> composed forms of uppercase letters, but there should be a modifier too, >>> so that when normalised in the right direction, searching and sorting >>> are simplified. >> >> It would make some things a lot easier. But guess the idea >> is 40-50 years late. > > For interchange purposes, yes. But i don't see why you couldn't write a > string implementation that stored characters this way internally. 24 bit for values and 8 bit for modifiers. I guess it could be done. Arne
Back to comp.lang.java.programmer | Previous | Next — Previous in thread | Next in thread | Find similar
Re: Why No Supplemental Characters In Character Literals? Lawrence D'Oliveiro <ldo@geek-central.gen.new_zealand> - 2011-02-05 11:45 +1300
Re: Why No Supplemental Characters In Character Literals? Arne Vajhøj <arne@vajhoej.dk> - 2011-02-06 15:38 -0500
Re: Why No Supplemental Characters In Character Literals? "javax.swing.JSnarker" <gharriman@boojum.mit.edu> - 2011-02-05 12:59 -0500
Re: Why No Supplemental Characters In Character Literals? Tom Anderson <twic@urchin.earth.li> - 2011-02-06 15:42 +0000
Re: Why No Supplemental Characters In Character Literals? Tom Anderson <twic@urchin.earth.li> - 2011-02-06 18:34 +0000
Re: Why No Supplemental Characters In Character Literals? Arne Vajhøj <arne@vajhoej.dk> - 2011-02-04 18:17 -0500
Re: Why No Supplemental Characters In Character Literals? Arne Vajhøj <arne@vajhoej.dk> - 2011-02-09 20:12 -0500
Re: Why No Supplemental Characters In Character Literals? Lawrence D'Oliveiro <ldo@geek-central.gen.new_zealand> - 2011-02-05 12:54 +1300
csiph-web