Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.java.programmer > #7853

Re: ascii char 26

From Joshua Cranmer <Pidgeot18@verizon.invalid>
Newsgroups comp.lang.java.programmer
Subject Re: ascii char 26
Date 2011-09-11 21:25 -0500
Organization A noiseless patient Spider
Message-ID <j4jqkk$5f0$1@dont-email.me> (permalink)
References <16f8836c-27b9-483b-a71f-61d7d6cfd188@i2g2000yqm.googlegroups.com> <j4jakd$dfl$1@dont-email.me> <63554bdb-dab4-43e7-b809-5128fd831f3c@m38g2000vbn.googlegroups.com>

Show all headers | View raw


On 9/11/2011 9:12 PM, bob wrote:
> You're right.  I messed up, and it was the em dash.  It turned into 26
> after going thru 'b = html.getBytes("US-ASCII");'
>
> Here's the new code:

Hardcoding a list of tables is generally not a good thing; in 
particular, I don't think it's going to solve your problems. I have seen 
sites that use the Unicode ff and fi ligatures instead of relying on 
fonts to automatically pick up on that as well.

If I may ask, why do you need to convert the string to US-ASCII as 
opposed to UTF-8? That is going to cause major issues for the ~90% of 
the world that doesn't speak English as their main language.

> Also, I'm on Android 2.1, so import java.text.Normalizer; doesn't
> work.

It shouldn't be that hard to find other Java Unicode normalization 
libraries out there.

-- 
Beware of bugs in the above code; I have only proved it correct, not 
tried it. -- Donald E. Knuth

Back to comp.lang.java.programmer | Previous | NextPrevious in thread | Next in thread | Find similar


Thread

ascii char 26 bob <bob@coolgroups.com> - 2011-09-11 14:33 -0700
  Re: ascii char 26 Arne Vajhøj <arne@vajhoej.dk> - 2011-09-11 17:48 -0400
  Re: ascii char 26 Joshua Cranmer <Pidgeot18@verizon.invalid> - 2011-09-11 16:52 -0500
    Re: ascii char 26 Eric Sosman <esosman@ieee-dot-org.invalid> - 2011-09-11 18:28 -0400
    Re: ascii char 26 bob <bob@coolgroups.com> - 2011-09-11 19:12 -0700
      Re: ascii char 26 Joshua Cranmer <Pidgeot18@verizon.invalid> - 2011-09-11 21:25 -0500
        Re: ascii char 26 bob <bob@coolgroups.com> - 2011-09-12 01:30 -0700
  Re: ascii char 26 Roedy Green <see_website@mindprod.com.invalid> - 2011-09-11 15:25 -0700
  Re: ascii char 26 Bent C Dalager <bcd@pvv.ntnu.no> - 2011-09-11 23:18 +0000
    Re: ascii char 26 Joshua Cranmer <Pidgeot18@verizon.invalid> - 2011-09-11 18:37 -0500
    Re: ascii char 26 Retahiv Oopsiscame <roopsisc@gmail.com> - 2011-09-11 16:53 -0700
      Re: ascii char 26 Roedy Green <see_website@mindprod.com.invalid> - 2011-09-14 11:55 -0700

csiph-web