Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #4465

Re: codec for UTF-8 with BOM

References <cn6298-us4.ln1@satorlaser.homedns.org>
Date 2011-05-02 02:47 -0700
Subject Re: codec for UTF-8 with BOM
From Chris Rebert <clp2@rebertia.com>
Newsgroups comp.lang.python
Message-ID <mailman.1068.1304329675.9059.python-list@python.org> (permalink)

Show all headers | View raw


On Mon, May 2, 2011 at 1:34 AM, Ulrich Eckhardt
<ulrich.eckhardt@dominolaser.com> wrote:
> Hi!
>
> I want to write a file starting with the BOM and using UTF-8, and stumbled
> across some problems:
>
> 1. I would have expected one of the codecs to be 'UTF-8 with BOM' or
> something like that, but I can't find the correct name. Also, I can't find a
> way to get a list of the supported codecs at all, which strikes me as odd.

If nothing else, there's
http://docs.python.org/library/codecs.html#standard-encodings

The correct name, as you found below and as is corroborated by the
webpage, seems to be "utf_8_sig":
>>> u"FOøbar".encode('utf_8_sig')
'\xef\xbb\xbfFO\xc3\xb8bar'

This could definitely be documented more straightforwardly.

<snip>
> 3. The docs mention encodings.utf_8_sig, available since 2.5, but I can't
> locate that thing there either. What's going on here?

Works for me™:
Python 2.6.6 (r266:84292, Jan 12 2011, 13:35:00)
[GCC 4.2.1 (Apple Inc. build 5664)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from encodings import utf_8_sig
>>>

Cheers,
Chris
--
http://rebertia.com

Back to comp.lang.python | Previous | NextPrevious in thread | Next in thread | Find similar


Thread

codec for UTF-8 with BOM Ulrich Eckhardt <ulrich.eckhardt@dominolaser.com> - 2011-05-02 10:34 +0200
  Re: codec for UTF-8 with BOM Chris Rebert <clp2@rebertia.com> - 2011-05-02 02:47 -0700
    Re: codec for UTF-8 with BOM Ulrich Eckhardt <ulrich.eckhardt@dominolaser.com> - 2011-05-02 12:30 +0200
      Re: codec for UTF-8 with BOM Peter Otten <__peter__@web.de> - 2011-05-02 13:42 +0200

csiph-web