Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.java.programmer > #25557

Re: Why No Supplemental Characters In Character Literals?

Date 2011-02-06 15:38 -0500
From Arne Vajhøj <arne@vajhoej.dk>
Newsgroups comp.lang.java.programmer
Subject Re: Why No Supplemental Characters In Character Literals?
References (6 earlier) <iil0uf$niu$4@lust.ihug.co.nz> <iil226$e6k$1@news.eternal-september.org> <alpine.DEB.1.10.1102061540090.21657@urchin.earth.li> <4d4ec62c$0$23751$14726298@news.sunsite.dk> <alpine.DEB.1.10.1102061834030.21657@urchin.earth.li>
Message-ID <4d4f06b6$0$23753$14726298@news.sunsite.dk> (permalink)
Organization SunSITE.dk - Supporting Open source

Show all headers | View raw


On 06-02-2011 13:34, Tom Anderson wrote:
> On Sun, 6 Feb 2011, Arne Vajhøj wrote:
>> On 06-02-2011 10:42, Tom Anderson wrote:
>>> On Sat, 5 Feb 2011, Mike Schilling wrote:
>>>> "Lawrence D'Oliveiro" <ldo@geek-central.gen.new_zealand> wrote in
>>>> message news:iil0uf$niu$4@lust.ihug.co.nz...
>>>>> In message <iiji77$4fe$1@localhost.localdomain>, Martin Gregorie
>>>>> wrote:
>>>>>
>>>>>> Arabic Letters also have different glyphs depending on whether
>>>>>> they are at the start, middle or end of a word or an isolated
>>>>>> letter, though six letters only have isolated and end-of-word
>>>>>> representations. Unicode supports this with a code point for each
>>>>>> representation of each letter.
>>>>>
>>>>> But they are not different characters, they should not have
>>>>> different code points.
>>>>>
>>>>> Assigning different code points greatly complicates basic
>>>>> text-processing tasks like editing and searching.
>>>>
>>>> Different code point for capitals and lower-case letters is equally
>>>> silly.
>>>
>>> Agreed. Uppercase should be a combiner, like an accent. There could be
>>> composed forms of uppercase letters, but there should be a modifier too,
>>> so that when normalised in the right direction, searching and sorting
>>> are simplified.
>>
>> It would make some things a lot easier. But guess the idea
>> is 40-50 years late.
>
> For interchange purposes, yes. But i don't see why you couldn't write a
> string implementation that stored characters this way internally.

24 bit for values and 8 bit for modifiers.

I guess it could be done.

Arne

Back to comp.lang.java.programmer | Previous | NextPrevious in thread | Next in thread | Find similar


Thread

Re: Why No Supplemental Characters In Character Literals? Lawrence D'Oliveiro <ldo@geek-central.gen.new_zealand> - 2011-02-05 11:45 +1300
  Re: Why No Supplemental Characters In Character Literals? Arne Vajhøj <arne@vajhoej.dk> - 2011-02-06 15:38 -0500
  Re: Why No Supplemental Characters In Character Literals? "javax.swing.JSnarker" <gharriman@boojum.mit.edu> - 2011-02-05 12:59 -0500
  Re: Why No Supplemental Characters In Character Literals? Tom Anderson <twic@urchin.earth.li> - 2011-02-06 15:42 +0000
    Re: Why No Supplemental Characters In Character Literals? Tom Anderson <twic@urchin.earth.li> - 2011-02-06 18:34 +0000
  Re: Why No Supplemental Characters In Character Literals? Arne Vajhøj <arne@vajhoej.dk> - 2011-02-04 18:17 -0500
    Re: Why No Supplemental Characters In Character Literals? Arne Vajhøj <arne@vajhoej.dk> - 2011-02-09 20:12 -0500
  Re: Why No Supplemental Characters In Character Literals? Lawrence D'Oliveiro <ldo@geek-central.gen.new_zealand> - 2011-02-05 12:54 +1300

csiph-web