Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.lang.java.programmer > #17026
| From | qwertmonkey@syberianoutpost.ru |
|---|---|
| Newsgroups | comp.lang.java.programmer |
| Subject | retriving escape unicode sequences from files ... |
| Date | 2012-08-03 03:52 +0000 |
| Organization | Aioe.org NNTP Server |
| Message-ID | <jvfhtb$gh$1@speranza.aioe.org> (permalink) |
Why is it that if you save a unicode sequence in a file, say "français"
~
\u0066\u0072\u0061\u006e\u00e7\u0061\u0069\u0073
~
and then retrieve as a String you can't then convert it back to a UTF-8 String
~
As you can test with this piece of code, you can simply declare the String as
a literal one or give it in the command prompt, but retrieving what seems to be
the same sequence of characters (as they print to standard out) from a file
doesn't seem to work
~
import java.io.ByteArrayOutputStream;
import java.io.PrintStream;
import java.io.UnsupportedEncodingException;
import java.io.IOException;
// __
public class UniKdEnk00Test{
private static final String aNWLn = System.getProperty("line.separator");
// __
public static void main (String[] aArgs){
try{
// __
if((aArgs == null) || (aArgs.length != 1)){ throw new IOException(aNWLn +
"// __ usage:" + aNWLn + aNWLn +
" java UniKdEnk00Test \\u0066\\u0072\\u0061\\u006e\\u00e7\\u0061\\u0069\\u0073"
+ aNWLn); }
String aUniKdEnk = "\u0066\u0072\u0061\u006e\u00e7\u0061\u0069\u0073";
byte[] bAr = aUniKdEnk.getBytes("UTF-8");
ByteArrayOutputStream BOS = new ByteArrayOutputStream();
BOS.write(bAr, 0, bAr.length);
String aUTF8L = new String(BOS.toByteArray(), "UTF-8");
System.out.println(aUTF8L);
BOS.reset();
}catch(UnsupportedEncodingException UEncX){ UEncX.printStackTrace(); }
catch(IOException IOX) { IOX.printStackTrace(); }
// __
}
}
~
lbrtchx
comp.lang.java.programmer: escape unicode sequences in files ...
Back to comp.lang.java.programmer | Previous | Next — Next in thread | Find similar | Unroll thread
retriving escape unicode sequences from files ... qwertmonkey@syberianoutpost.ru - 2012-08-03 03:52 +0000
Re: retriving escape unicode sequences from files ... markspace <-@.> - 2012-08-02 21:00 -0700
Re: retriving escape unicode sequences from files ... Roedy Green <see_website@mindprod.com.invalid> - 2012-08-03 01:35 -0700
Re: retriving escape unicode sequences from files ... glen herrmannsfeldt <gah@ugcs.caltech.edu> - 2012-08-03 23:23 +0000
Re: retriving escape unicode sequences from files ... Arne Vajhøj <arne@vajhoej.dk> - 2012-08-03 20:37 -0400
Re: retriving escape unicode sequences from files ... Daniel Pitts <newsgroup.nospam@virtualinfinity.net> - 2012-08-03 20:49 -0700
Re: retriving escape unicode sequences from files ... markspace <-@.> - 2012-08-03 21:13 -0700
Re: retriving escape unicode sequences from files ... Lew <lewbloch@gmail.com> - 2012-08-03 23:45 -0700
Re: retriving escape unicode sequences from files ... Arne Vajhøj <arne@vajhoej.dk> - 2012-08-06 22:32 -0400
csiph-web