Path: csiph.com!eternal-september.org!feeder.eternal-september.org!mx02.eternal-september.org!.POSTED!not-for-mail From: Ben Bacarisse Newsgroups: comp.lang.python Subject: Re: How to waste computer memory? Date: Sun, 20 Mar 2016 14:55:55 +0000 Organization: A noiseless patient Spider Lines: 15 Message-ID: <874mc1mc5g.fsf@bsb.me.uk> References: <265377f4-741d-4aa2-9338-239f56f8bc57@googlegroups.com> <87twk3oli0.fsf@elektro.pacujo.net> <87k2kzo5y5.fsf@elektro.pacujo.net> <56ed0a71$0$1607$c3e8da3$5496439d@news.astraweb.com> <87lh5en79a.fsf@elektro.pacujo.net> <56ed68bb$0$1604$c3e8da3$5496439d@news.astraweb.com> <877fgylddm.fsf@elektro.pacujo.net> <56ed749e$0$1583$c3e8da3$5496439d@news.astraweb.com> <8737rmla4w.fsf@elektro.pacujo.net> <56ee2ebd$0$1597$c3e8da3$5496439d@news.astraweb.com> <12db8cba-8edf-4cd0-a91d-2f6b6634c9d3@googlegroups.com> Mime-Version: 1.0 Content-Type: text/plain Injection-Info: mx02.eternal-september.org; posting-host="017616aa25f81ec581c44d76d61ba2f3"; logging-data="17987"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18uhAdM8vIh3h0z4qKJMXM/zf/j0Am3LJk=" Cancel-Lock: sha1:SNl0DQnIkBERG/Q3yNjpiS+AxZY= sha1:Ye2ndoP9CN0NDOLtsgFjM2lQDhw= X-BSB-Auth: 1.eac97e12c461d7f09a44.20160320145555GMT.874mc1mc5g.fsf@bsb.me.uk Xref: csiph.com comp.lang.python:105303 Rustom Mody writes: > On Sunday, March 20, 2016 at 10:32:07 AM UTC+5:30, Steven D'Aprano wrote: >> Unicode (the character set part of it) is a set of abstract 23-bit numbers, > > 23? Or 21? It's 21. The reason being (or at least part of the reason being) that 21 bits can be UTF-8 encoded in 4 bytes: 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx (3 + 3*6). -- Ben.