Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.lang.python > #54695 > unrolled thread
| Started by | "J. Bagg" <j.bagg@kent.ac.uk> |
|---|---|
| First post | 2013-09-24 14:01 +0100 |
| Last post | 2013-09-24 14:01 +0100 |
| Articles | 1 — 1 participant |
Back to article view | Back to comp.lang.python
removing BOM prepended by codecs? "J. Bagg" <j.bagg@kent.ac.uk> - 2013-09-24 14:01 +0100
| From | "J. Bagg" <j.bagg@kent.ac.uk> |
|---|---|
| Date | 2013-09-24 14:01 +0100 |
| Subject | removing BOM prepended by codecs? |
| Message-ID | <mailman.294.1380027842.18130.python-list@python.org> |
I'm using: outputfile = codecs.open (fn, 'w+', 'utf-8', errors='strict') to write as I know that the files are unicode compliant. I run the raw files that are delivered through a Python script to check the unicode and report problem characters which are then edited. The files use a whole variety of languages from Sanskrit to Cyrillic and more obscure ones too. I'll probably have to remove it in the servlet as we have standardised on utf-8. This was done some years ago when utf-16 was rare (apart from Macs). J
Back to top | Article view | comp.lang.python
csiph-web