Re: question about csv.DictReader

Date	2013-04-03 21:52 -0500
From	Tim Chase <python.list@tim.thechases.com>
Subject	Re: question about csv.DictReader
References	<846339ea-366a-4bb2-b234-0e03bf87489e@googlegroups.com>
Newsgroups	comp.lang.python
Message-ID	<mailman.88.1365043854.3114.python-list@python.org> (permalink)

Show all headers | View raw

On 2013-04-03 18:26, Norman Clerman wrote:
> Can anyone explain the presence of the characters "\xref\xbb\xbf"
> before the first field contents "Holdings" ?

(you mean "\xef", not "\xref")

This is a byte-order-mark (BOM), which you can read about at [1].  In
this case, it denotes the file as UTF-8 encoded.  Certain programs
insert these, though it's more important with UTF-16 or UTF-32
encodings where the byte-order and endian'ness actually matters.  I
believe Notepad and Visual Studio on Win32 were both offenders when
it came to inserting unbidden BOMs.

-tkc

[1]
http://en.wikipedia.org/wiki/Byte_order_mark

Thread

question about csv.DictReader Norman Clerman <norm.clerman@gmail.com> - 2013-04-03 18:26 -0700
  Re: question about csv.DictReader MRAB <python@mrabarnett.plus.com> - 2013-04-04 03:46 +0100
  Re: question about csv.DictReader Tim Chase <python.list@tim.thechases.com> - 2013-04-03 21:52 -0500
  Re: question about csv.DictReader Norman Clerman <norm.clerman@gmail.com> - 2013-04-04 12:09 -0700

csiph-web