Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.java.programmer > #22466

Re: offsets in a FileChannel ...

From Robert Klemme <shortcutter@googlemail.com>
Newsgroups comp.lang.java.programmer
Subject Re: offsets in a FileChannel ...
Date 2013-02-23 15:39 +0100
Message-ID <aos2kgFqnonU1@mid.individual.net> (permalink)
References <kgain5$4pl$1@speranza.aioe.org>

Show all headers | View raw


On 23.02.2013 15:11, qwertmonkey@syberianoutpost.ru wrote:
>   What is missing in this code snippet to get the offsets in the underlying
> FileChannel on which the MappedByteBuffer and then the CharBuffer are built?
> ~
>   CharBuffer.position() gives you the position alright, but how about wanting
> to get the actual offset of certain characters in the actual data feed exposed
> through the FileInputStream?
> ~
>       char c;
>       long lPsx;
>       FIS = new FileInputStream(IFl);
>       FileChannel FlChnl = FIS.getChannel();
>       MappedByteBuffer MptbChnlBfr = FlChnl.map(FileChannel.MapMode.READ_ONLY,
> 0, FlChnl.size());
>       CharBuffer cBfrUTF8 = ChrStDkdr.decode(MptbChnlBfr);
> // __
>       while(cBfrUTF8.hasRemaining()){
>        c = cBfrUTF8.get();
>        lPsx = cBfrUTF8.position();
>        System.err.println("// __ |" + lPsx + "|" + c + "|" + (int)c + "|");
>       }
> // __
>       FlChnl.close();
>       FIS.close();
> ~
>   Or do you know of any other way to basically do the same thing?

UTF8 is not an encoding with a fixed width.  You would have to create 
more complex code if you want to align char position and byte position. 
  Basically you need to read the file from the beginning and observe the 
width of every char as it is being decoded.  You could of course apply 
heuristics if you have more knowledge about the file but I guess that 
soon gets messy.

Cheers

	robert

-- 
remember.guy do |as, often| as.you_can - without end
http://blog.rubybestpractices.com/

Back to comp.lang.java.programmer | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

offsets in a FileChannel ... qwertmonkey@syberianoutpost.ru - 2013-02-23 14:11 +0000
  Re: offsets in a FileChannel ... Robert Klemme <shortcutter@googlemail.com> - 2013-02-23 15:39 +0100
    Re: offsets in a FileChannel ... Roedy Green <see_website@mindprod.com.invalid> - 2013-02-25 04:09 -0800
      Re: offsets in a FileChannel ... Robert Klemme <shortcutter@googlemail.com> - 2013-02-25 21:50 +0100
        Re: offsets in a FileChannel ... Roedy Green <see_website@mindprod.com.invalid> - 2013-02-26 08:05 -0800

csiph-web