Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.java.programmer > #17083

retriving escape unicode sequences from files ...

From "qwertmonkey" <qwertmonkey@1:261/38.remove-yy0-this>
Subject retriving escape unicode sequences from files ...
Message-ID <501C1568.56042.calajapr@time.synchro.net> (permalink)
Newsgroups comp.lang.java.programmer
Date 2012-08-03 18:54 +0000
Organization tds.net

Show all headers | View raw


From: qwertmonkey@syberianoutpost.ru

 Why is it that if you save a unicode sequence in a file, say "frantais"
~
\u0066\u0072\u0061\u006e\u00e7\u0061\u0069\u0073
~
 and then retrieve as a String you can't then convert it back to a UTF-8 String
~
 As you can test with this piece of code, you can simply declare the String as
a literal one or give it in the command prompt, but retrieving what seems to be 
the same sequence of characters (as they print to standard out) from a file 
doesn't seem to work
~
import java.io.ByteArrayOutputStream; import java.io.PrintStream;
import java.io.UnsupportedEncodingException; import java.io.IOException;

// __
public class UniKdEnk00Test{
 private static final String aNWLn = System.getProperty("line.separator");
// __
 public static void main (String[] aArgs){
  try{
// __
   if((aArgs == null) ||  (aArgs.length != 1)){ throw new IOException(aNWLn +
"// __ usage:" + aNWLn + aNWLn +
" java UniKdEnk00Test \\u0066\\u0072\\u0061\\u006e\\u00e7\\u0061\\u0069\\u0073"
+ aNWLn);  }
   String aUniKdEnk = "\u0066\u0072\u0061\u006e\u00e7\u0061\u0069\u0073";
   byte[] bAr = aUniKdEnk.getBytes("UTF-8");
   ByteArrayOutputStream BOS = new ByteArrayOutputStream();
   BOS.write(bAr, 0, bAr.length);
   String aUTF8L = new String(BOS.toByteArray(), "UTF-8");
   System.out.println(aUTF8L);
   BOS.reset();
  }catch(UnsupportedEncodingException UEncX){ UEncX.printStackTrace(); }
    catch(IOException IOX) { IOX.printStackTrace(); }
// __
 }
}
~
 lbrtchx
 comp.lang.java.programmer: escape unicode sequences in files ...

--- BBBS/Li6 v4.10 Dada-1
 * Origin: Prism bbs (1:261/38)
--- Synchronet 3.16a-Win32 NewsLink 1.98
Time Warp of the Future BBS - telnet://time.synchro.net:24

Back to comp.lang.java.programmer | Previous | NextNext in thread | Find similar | Unroll thread


Thread

retriving escape unicode sequences from files ... "qwertmonkey" <qwertmonkey@1:261/38.remove-yy0-this> - 2012-08-03 18:54 +0000
  Re: retriving escape unicode sequences from files ... "markspace" <markspace@1:261/38.remove-yy0-this> - 2012-08-03 18:54 +0000
  Re: retriving escape unicode sequences from files ... "Roedy Green" <roedy.green@1:261/38.remove-yy0-this> - 2012-08-03 18:54 +0000
  Re: retriving escape unicode sequences from files ... "glen herrmannsfeldt" <glen.herrmannsfeldt@1:261/38.remove-5qr-this> - 2012-08-04 18:41 +0000
  Re: retriving escape unicode sequences from files ... "Arne Vajhøj" <arne.vajhøj@1:261/38.remove-5qr-this> - 2012-08-04 18:41 +0000
    Re: retriving escape unicode sequences from files ... "Daniel Pitts" <daniel.pitts@1:261/38.remove-5qr-this> - 2012-08-04 18:41 +0000
      Re: retriving escape unicode sequences from files ... "markspace" <markspace@1:261/38.remove-5qr-this> - 2012-08-04 18:41 +0000
        Re: retriving escape unicode sequences from files ... "Lew" <lew@1:261/38.remove-5qr-this> - 2012-08-04 18:41 +0000
      Re: retriving escape unicode sequences from files ... "Arne Vajhøj" <arne.vajhøj@1:261/38.remove-p82-this> - 2012-08-08 06:20 +0000

csiph-web