Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.sys.atari.st > #1452

Re: ASCII files

From Arachide <houten.van@orange.fr>
Newsgroups comp.sys.atari.st
Subject Re: ASCII files
Date 2016-03-27 17:23 +0200
Organization Aioe.org NNTP Server
Message-ID <nd8tu5$14bk$1@gioia.aioe.org> (permalink)
References <ncu5th$308$1@dont-email.me>

Show all headers | View raw


Le 23/03/2016 14:35, Henk Robbers a écrit :
>
> LS
>
> I want to determine whether a file should not be
> considered ASCII text based on the percentage of non printable
> characters in the file.
>
> Which percentage is generally accepted?

It depends on the language I think.
In standard english, you don't use characters above 128. But in french 
(and others), there a a lot of letters with accents that appear above 128.

So, if you consider that the file is in english, you should have a 
maximum of:

- CR/LF (13 and 10)
- TAB (9)
- Space (32)
- characters from 33 to 127.

If the file is in another language, some characters above 128.

In my opinion, the marker for a text file would be the total lack of 
bytes under 32 except 9/10/13.

Guillaume.

Back to comp.sys.atari.st | Previous | NextPrevious in thread | Find similar | Unroll thread


Thread

ASCII files Henk Robbers <h.robbers@chello.nl> - 2016-03-23 14:35 +0100
  Re: ASCII files Miro Kropáček <miro.kropacek@gmail.com> - 2016-03-27 08:15 -0700
  Re: ASCII files Arachide <houten.van@orange.fr> - 2016-03-27 17:23 +0200

csiph-web