Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.lang.java.programmer > #7844
| Path | csiph.com!x330-a1.tempe.blueboxinc.net!newsfeed.hal-mli.net!feeder3.hal-mli.net!nx01.iad01.newshosting.com!newshosting.com!news-out.readnews.com!transit3.readnews.com!postnews.google.com!fi7g2000vbb.googlegroups.com!not-for-mail |
|---|---|
| From | Retahiv Oopsiscame <roopsisc@gmail.com> |
| Newsgroups | comp.lang.java.programmer |
| Subject | Re: ascii char 26 |
| Date | Sun, 11 Sep 2011 16:53:33 -0700 (PDT) |
| Organization | http://groups.google.com |
| Lines | 29 |
| Message-ID | <178dad1e-e924-4563-9685-3261752e7c35@fi7g2000vbb.googlegroups.com> (permalink) |
| References | <16f8836c-27b9-483b-a71f-61d7d6cfd188@i2g2000yqm.googlegroups.com> <slrnj6qger.r1i.bcd@microbel.pvv.ntnu.no> |
| NNTP-Posting-Host | 74.12.180.15 |
| Mime-Version | 1.0 |
| Content-Type | text/plain; charset=ISO-8859-1 |
| Content-Transfer-Encoding | quoted-printable |
| X-Trace | posting.google.com 1315785298 10798 127.0.0.1 (11 Sep 2011 23:54:58 GMT) |
| X-Complaints-To | groups-abuse@google.com |
| NNTP-Posting-Date | Sun, 11 Sep 2011 23:54:58 +0000 (UTC) |
| Complaints-To | groups-abuse@google.com |
| Injection-Info | fi7g2000vbb.googlegroups.com; posting-host=74.12.180.15; posting-account=PUEIwAoAAAD1lZSxxf6T_cZH5_1bHifI |
| User-Agent | G2/1.0 |
| X-Google-Web-Client | true |
| X-Google-Header-Order | HUALESNKRC |
| X-HTTP-UserAgent | Mozilla/5.0 (Windows NT 6.0; rv:6.0.2) Gecko/20100101 Firefox/6.0.2,gzip(gfe) |
| Xref | x330-a1.tempe.blueboxinc.net comp.lang.java.programmer:7844 |
Show key headers only | View raw
On Sep 11, 7:18 pm, Bent C Dalager <b...@pvv.ntnu.no> wrote: > On 2011-09-11, bob <b...@coolgroups.com> wrote: > > > Anyone know why ASCII char 26 is used in place of a hyphen in UTF-8? > > Unicode has multiple different hyphens and hyphen-like characters. > > The traditional ASCII hyphen is the Unicode "hyphen-minus" which > encodes to 0x2d in utf-8. > > http://www.fileformat.info/info/unicode/char/2d/index.htmsuggests the > following additional hyphen-like characters that you may actually be > working with in your string, and that will probably be mapped to 26 in > your case: > > hyphen U+2010 > non-breaking hyphen U+2011 > figure dash U+2012 > en dash U+2013 > minus sign U+2212 > roman uncia sign U+10191 Wow, what a mess! > One would tend to think there ought to be a library function somewhere > to convert a unicode string to ASCII-supported variants of its various > characters where possible, Indeed.
Back to comp.lang.java.programmer | Previous | Next — Previous in thread | Next in thread | Find similar
ascii char 26 bob <bob@coolgroups.com> - 2011-09-11 14:33 -0700
Re: ascii char 26 Arne Vajhøj <arne@vajhoej.dk> - 2011-09-11 17:48 -0400
Re: ascii char 26 Joshua Cranmer <Pidgeot18@verizon.invalid> - 2011-09-11 16:52 -0500
Re: ascii char 26 Eric Sosman <esosman@ieee-dot-org.invalid> - 2011-09-11 18:28 -0400
Re: ascii char 26 bob <bob@coolgroups.com> - 2011-09-11 19:12 -0700
Re: ascii char 26 Joshua Cranmer <Pidgeot18@verizon.invalid> - 2011-09-11 21:25 -0500
Re: ascii char 26 bob <bob@coolgroups.com> - 2011-09-12 01:30 -0700
Re: ascii char 26 Roedy Green <see_website@mindprod.com.invalid> - 2011-09-11 15:25 -0700
Re: ascii char 26 Bent C Dalager <bcd@pvv.ntnu.no> - 2011-09-11 23:18 +0000
Re: ascii char 26 Joshua Cranmer <Pidgeot18@verizon.invalid> - 2011-09-11 18:37 -0500
Re: ascii char 26 Retahiv Oopsiscame <roopsisc@gmail.com> - 2011-09-11 16:53 -0700
Re: ascii char 26 Roedy Green <see_website@mindprod.com.invalid> - 2011-09-14 11:55 -0700
csiph-web