Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!xlned.com!feeder3.xlned.com!newsfeed.xs4all.nl!newsfeed1.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.005 X-Spam-Evidence: '*H*': 0.99; '*S*': 0.00; 'encoding': 0.05; 'allowed.': 0.07; 'utf-8': 0.07; 'cc:addr:python-list': 0.11; 'jan': 0.12; 'from:addr:rosuav': 0.16; 'from:name:chris angelico': 0.16; 'marker': 0.16; 'sat,': 0.16; 'wrote:': 0.18; 'cc:addr:python.org': 0.22; 'cc:2**0': 0.24; 'defined': 0.27; 'header:In-Reply-To:1': 0.27; 'am,': 0.29; 'message- id:@mail.gmail.com': 0.30; 'subject:the': 0.34; 'subject:from': 0.34; 'received:google.com': 0.35; 'there': 0.35; 'scheme': 0.36; 'skip:u 10': 0.60; 'to:none': 0.92 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:cc :content-type; bh=IcKA3fjNTmenUYJC4JLtj9bPnzQ0SzhCO1GkzUrAZ40=; b=uje9EN/UZMK/H7vfqsSRaq5eI9p2IVlDjK7y2NVaefJNi6JmbGbvYhvnbikMst8HOu nCI1oJZWs2ySuo5bjF5lkVUw83oNX0u8KXNvXCsJwo98Wz+ql6zvbtw56lEZx/pWnbAa tkYB6tswaSZSN4hKD2Tg5GsT6uU24ytn128J/7ulTbRcvVlG8HG0+RJUyNYwP+sXaXAc ijjw2iE3vgXw6wH2QVHYonRVaf6WwtnqlTlQWeVGe37aRUsW/ZYPgtjpi3cd8jQM6O4P G+pM/6L1dIAKldjHKmXXlA5Hbftt2WU8mwCvH2QfdpZjk1ONEoW9YM/tgr9YmKlRGuEn mzQg== MIME-Version: 1.0 X-Received: by 10.68.247.6 with SMTP id ya6mr3058152pbc.45.1389976424651; Fri, 17 Jan 2014 08:33:44 -0800 (PST) In-Reply-To: <86zjmufubv.fsf@gmail.com> References: <1389901049.40172.YahooMailBasic@web163804.mail.gq1.yahoo.com> <32c1b684-1ff7-48c0-af7a-cd15235ea531@googlegroups.com> <86zjmufubv.fsf@gmail.com> Date: Sat, 18 Jan 2014 03:33:44 +1100 Subject: Re: Guessing the encoding from a BOM From: Chris Angelico Cc: "python-list@python.org" Content-Type: text/plain; charset=UTF-8 X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 7 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1389976433 news.xs4all.nl 2977 [2001:888:2000:d::a6]:49125 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:64177 On Sat, Jan 18, 2014 at 3:26 AM, Pete Forman wrote: > It would have been nice if there was an eighth encoding scheme defined > there UTF-8NB which would be UTF-8 with BOM not allowed. Or call that one UTF-8, and the one with the marker can be UTF-8-MS-NOTEPAD. ChrisA