Path: csiph.com!news.mixmin.net!weretis.net!feeder4.news.weretis.net!storethat.news.telefonica.de!feedme.news.telefonica.de!telefonica.de!fu-berlin.de!uni-berlin.de!not-for-mail From: Chris Angelico Newsgroups: comp.lang.python Subject: Re: The Cost of Dynamism Date: Sun, 13 Mar 2016 08:18:44 +1100 Lines: 20 Message-ID: References: <87oaajgahd.fsf@elektro.pacujo.net> <877fh7g6h0.fsf@elektro.pacujo.net> <2760288.I9OYO65Gag@PointedEars.de> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Trace: news.uni-berlin.de qUh5fIy3ujHA0zgJoviZAQRbdzKFgMiRYyhRlEuSv/Iw== Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.009 X-Spam-Evidence: '*H*': 0.98; '*S*': 0.00; 'received:209.85.223': 0.03; 'cc:addr:python-list': 0.09; '128': 0.09; 'encode': 0.09; 'non-ascii': 0.09; '2016': 0.16; '8-bit': 0.16; 'ascii,': 0.16; 'encodings': 0.16; 'from:addr:rosuav': 0.16; 'from:name:chris angelico': 0.16; 'received:io': 0.16; 'received:psf.io': 0.16; 'two.': 0.16; 'us-ascii,': 0.16; 'wrote:': 0.16; 'cc:2**0': 0.20; 'cc:addr:python.org': 0.20; 'referring': 0.22; 'am,': 0.23; 'header:In-Reply-To:1': 0.24; 'message-id:@mail.gmail.com': 0.27; "skip:' 10": 0.28; '13,': 0.29; 'markup': 0.29; 'character': 0.29; "i'm": 0.30; 'creating': 0.30; 'code': 0.30; 'source': 0.33; 'html,': 0.33; 'received:google.com': 0.35; 'fail': 0.35; 'unicode': 0.35; 'received:209.85': 0.36; 'subject:: ': 0.37; 'two': 0.37; 'say': 0.37; 'received:209': 0.38; 'where': 0.40; 'your': 0.60; 'subject:The': 0.61; 'documents': 0.61; 'thomas': 0.63; 'day.': 0.63; 'between': 0.65; 'mar': 0.65; 'chrisa': 0.84; 'difference.': 0.84; 'to:none': 0.91 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:cc :content-transfer-encoding; bh=Mu/uURdYi9hUKTWoPRWZCVIoosCs6BXfX0LXBCHbAlI=; b=eyFpP3CtusTaurl7KVE6Ouwie5rV1BcuKuf2NyRsdADXyR9E7Y0x5XSj0e+d1iKd1d M2oHnUkJer7pt4uZBFGmHSbFkr4VuMX5pV3Is9pKC//fup6loAbOBTkABQr1znbxD02a cjRUqBW7KclH+LKjJP31tsHr5TdWkkvZ50zm/aKS7hdb1I2jpx0M7UK3XZdiLFgqdtVO FrFo+nbdhfSq/AD5+8yawtvSfzrq2nCjhAFqR64SjXcPCQAm70YXcurkv+GldpOpq8uf CQ9PTw6/xhWHJCwZ8AJHSVNCukijZ6CjZ1E1JqZSyLLKFcPpi4hc8S0CXcmrlRBRT4pN IrOQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:cc:content-transfer-encoding; bh=Mu/uURdYi9hUKTWoPRWZCVIoosCs6BXfX0LXBCHbAlI=; b=BQk+BxKw02UgJMqUbb9ghXnBgAk73LoKTEQgKHb/GJSXF8m2cUqN2BW/boILDd2Fgs HYxJHUxg5Nhdus9ttKLc92FrCCAHNfAAHKqisdjiQViGzwTE6fX+rzTyev6i2mZSePmD jj6xdM3ftfD4iECW26LuvyyrwkvTYBz/4y1pgOTbFkL5jxzRCyUuHeeqloHQhkja/ENz uXEGHBBRDaEJHuPcMx7S92bRdCNep6mo/amtomgytg9qzywZjCVelwnYHJzU53OOjmMC m62xdB5+z+QSDic+Wp0UEGX2KgOPPIg/0uaGPLd02vwPCr8IayD2boTJg7rcfP9gqvyG 3nmw== X-Gm-Message-State: AD7BkJJxtqrn4O879tfxnhEi83i088C0SEt3moh6+upY8PNWNofPKD3S9w1hYTv7KJ+NPVHUiTmj3PqOUrgoxA== X-Received: by 10.107.47.163 with SMTP id v35mr16240856iov.19.1457817524671; Sat, 12 Mar 2016 13:18:44 -0800 (PST) In-Reply-To: <2760288.I9OYO65Gag@PointedEars.de> X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Xref: csiph.com comp.lang.python:104727 On Sun, Mar 13, 2016 at 6:24 AM, Thomas 'PointedEars' Lahn wrote: > Marko Rauhamaa wrote: > >> [=E2=80=A6] HTML markup is all ASCII. > > Wrong. I am creating HTML documents whose source code contains Unicode > characters every day. > > Also, the two of you fail to differentiate between US-ASCII, a 7-bit > character encoding, and 8-bit or longer encodings which can *also* encode > characters that can be *encoded with* US-ASCII. Where are the non-ASCII characters in your HTML documents? Are they in the *markup* of HTML, or in the *text*? This is the difference. And I'm not conflating those two. When I say ASCII, I am referring to the 128 characters that have Unicode codepoints U+0000 through U+007F. ChrisA