Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!eternal-september.org!feeder.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: Marko Rauhamaa Newsgroups: comp.lang.python Subject: Re: Python 3.2 has some deadly infection Date: Fri, 06 Jun 2014 20:02:47 +0300 Organization: A noiseless patient Spider Lines: 29 Message-ID: <8761ke55u0.fsf@elektro.pacujo.net> References: <538C5BB8.1020702@chamonix.reportlab.co.uk> <538f1a61$0$29978$c3e8da3$5496439d@news.astraweb.com> <53902bb1$0$11109$c3e8da3@news.astraweb.com> <87wqcvu20h.fsf@elektro.pacujo.net> <7b3543f6-6f62-49c5-abdc-e2783fd6d629@googlegroups.com> <87oay7tnxt.fsf@elektro.pacujo.net> <87tx7z5hvw.fsf@elektro.pacujo.net> <87egz25dsd.fsf@elektro.pacujo.net> <87a99q5a08.fsf@elektro.pacujo.net> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Info: mx05.eternal-september.org; posting-host="ff5cf27ef3d5b31f034d3b72bdc27a41"; logging-data="5294"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/PBkk8wmYOd+d2WyEFzVen" User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.3 (gnu/linux) Cancel-Lock: sha1:Z2nEXXKnzKhuH4eYHo3mu3bvtIo= sha1:tMgOi6XL8X/1T+zXBtf9eqVKQ8k= Xref: csiph.com comp.lang.python:72862 Chris Angelico : > "ASCII" means two things: Firstly, it's a mapping from the letter A to > the number 65, from the exclamation mark to 33, from the backslash to > 92, and so on. And secondly, it's an encoding of those numbers into > the lowest seven bits of a byte, with the high byte left clear. > Between those two, you get a means of representing the letter 'A' as > the byte 0x41, and one of them is an encoding. The American Standard Code for Information Interchange [...] is a character-encoding scheme [...] > "Unicode", on the other hand, is only the first part. It maps all the > same characters to the same numbers that ASCII does, and then adds a > few more... a few followed by a few, followed by... okay, quite a lot > more. Unicode specifies that the character OK HAND SIGN, which looks > like 👌 if you have the right font, is number 1F44C in hex (128076 > decimal). This is the "Universal Character Set" or UCS. Unicode is a computing industry standard for the consistent encoding, representation and handling of text [...] Each standard assigns numbers to letters and other symbols. In a word, each is a code. That's what their names say, too. Marko