Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!news.glorb.com!npeer03.iad.highwinds-media.com!news.highwinds-media.com!feed-me.highwinds-media.com!post02.iad!not-for-mail From: lbrt chx _ gemale Newsgroups: comp.lang.java.programmer Subject: number of bytes for each (uni)code point while using utf-8 as encoding ... X-Newsreader: NetComponents Organization: Acecape, Inc. Organization: Newshosting.com - Highest quality at a great price! www.newshosting.com X-Complaints-To: abuse(at)newshosting.com Message-ID: <1341965282.664308@nntp.aceinnovative.com> Cache-Post-Path: nntp.aceinnovative.com!unknown@p70-44.acedsl.com X-Cache: nntpcache 3.0.1 (see http://www.nntpcache.org/) Date: 11 Jul 2012 00:08:02 GMT Lines: 8 X-Received-Bytes: 1074 Xref: csiph.com comp.lang.java.programmer:15933 ~ I obviously and I would say -very clearly- meant a -file's encoding- is either incorrectly set by authors or is corrupted in transit. (I never said anything about failing disks ...) ~ Sometimes we technical people sound like lawyers/politicians trying to correct peoples' minds and/or trying to prove something to one self ~ What I asked is an entirely technical question, namely; how to get the length of the sequence of bytes defining a code point ~ lbrtchx