Path: csiph.com!x330-a1.tempe.blueboxinc.net!usenet.pasdenom.info!gegeweb.org!de-l.enfer-du-nord.net!feeder1.enfer-du-nord.net!tudelft.nl!txtfeed1.tudelft.nl!multikabel.net!newsfeed10.multikabel.net!xlned.com!feeder7.xlned.com!news2.euro.net!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.001 X-Spam-Evidence: '*H*': 1.00; '*S*': 0.00; 'situation.': 0.04; 'subject:Python': 0.05; 'ascii': 0.07; 'received:verizon.net': 0.07; 'terry': 0.07; 'length.': 0.09; 'received:80.91': 0.09; 'received:80.91.229': 0.09; 'received:gmane.org': 0.09; 'received:list': 0.09; 'transcoding': 0.09; 'utf-8': 0.09; 'am,': 0.12; '*before*': 0.16; '100-200': 0.16; '80s': 0.16; 'ascii,': 0.16; 'computation': 0.16; 'ebcdic': 0.16; 'ebcdic,': 0.16; 'labeled': 0.16; 'reedy': 0.16; 'roy': 0.16; 'set,': 0.16; 'subject:usage': 0.16; 'language': 0.16; 'wrote:': 0.18; 'jan': 0.19; 'header:In-Reply-To:1': 0.22; 'translation': 0.23; 'basis,': 0.23; 'interpreted': 0.23; 'subject:numbers': 0.23; 'byte': 0.24; 'code': 0.26; 'discussing': 0.29; 'unicode': 0.29; 'universal': 0.29; 'connected': 0.29; 'for,': 0.30; 'sets.': 0.30; 'translator': 0.30; 'least': 0.30; "didn't": 0.30; 'actually': 0.31; 'version': 0.32; 'opposed': 0.32; 'pure': 0.32; 'idea': 0.32; 'there': 0.33; 'header:User-Agent:1': 0.33; 'it.': 0.33; 'header:X-Complaints-To:1': 0.34; 'character': 0.34; 'realize': 0.34; 'rather': 0.34; 'was,': 0.34; 'to:addr:python-list': 0.35; 'however,': 0.35; 'something': 0.35; 'sets': 0.35; 'starting': 0.36; 'received:org': 0.36; 'else,': 0.37; 'encoding': 0.37; 'but': 0.37; 'machine': 0.37; 'replace': 0.38; 'could': 0.38; 'think': 0.38; 'should': 0.38; 'plain': 0.39; 'to:addr:python.org': 0.40; 'dedicated': 0.61; 'personal': 0.61; 'kind': 0.62; 'life,': 0.64; 'records,': 0.64; 'computers': 0.64; 'card': 0.65; 'importantly,': 0.67; 'nation': 0.67; 'worldwide,': 0.67; 'business': 0.70; 'american': 0.70; 'grew': 0.73; 'decades,': 0.84; 'packed': 0.84; 'repeat.': 0.84; 'pc.': 0.96 X-Injected-Via-Gmane: http://gmane.org/ To: python-list@python.org From: Terry Reedy Subject: Re: Python usage numbers Date: Sun, 12 Feb 2012 17:07:44 -0500 References: <4F36E2F5.9000505@gmail.com> <4f37229b$0$29986$c3e8da3$5496439d@news.astraweb.com> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Gmane-NNTP-Posting-Host: pool-74-109-121-73.phlapa.fios.verizon.net User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:8.0) Gecko/20111105 Thunderbird/8.0 In-Reply-To: X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.12 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 46 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1329084478 news.xs4all.nl 6939 [2001:888:2000:d::a6]:42994 X-Complaints-To: abuse@xs4all.nl Xref: x330-a1.tempe.blueboxinc.net comp.lang.python:20296 On 2/12/2012 10:13 AM, Roy Smith wrote: > Exactly.. ASCII was so successful > at becoming a universal standard which lasted for decades, I think you are overstating the universality and length. I used a machine in the 1970s with 60-bit words that could be interpreted as 10 6-bit characters. IBM used EBCDIC at least into the 1980s. The UCLA machine I used had a translator for ascii terminals that connected by modems. I remember discussing the translation table with the man in charge of it. Dedicated wordprocessing machines of the 70s and 80s *had* to use something other than plain ascii, as it is inadequate for business text, as opposed to pure computation and labeled number tables. Whether they used extended ascii or something else, I have no idea. Ascii was, however, as far as I know, the universal basis for the new personal computers starting about 1975, and most importantly, for the IBM PC. But even that actually used its version of extended ascii, as did each wordprocessing program. > people who > grew up with it don't realize there was once any other way. Not just > EBCDIC, but also SIXBIT, RAD-50, tilt/rotate, packed card records, > and so on. Transcoding was a way of life, and if you didn't know what > you were starting with and aiming for, it was hopeless. But because of the limitation of ascii on a worldwide, as opposed to American basis, we ended up with 100-200 codings for almost as many character sets. This is because the idea of ascii was applied by each nation or language group individually to their local situation. > Kind of like now where we are again with Unicode. The situation before ascii is like where we ended up *before* unicode. Unicode aims to replace all those byte encoding and character sets with *one* byte encoding for *one* character set, which will be a great simplification. It is the idea of ascii applied on a global rather that local basis. Let me repeat. Unicode and utf-8 is a solution to the mess, not the cause. Perhaps we should have a synonym for utf-8: escii, for Earthian Standard Code for Information Interchange. -- Terry Jan Reedy