Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #54715

removing BOM prepended by codecs?

Path csiph.com!usenet.pasdenom.info!aioe.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed3.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
Return-Path <j.bagg@kent.ac.uk>
X-Original-To python-list@python.org
Delivered-To python-list@mail.python.org
X-Spam-Status OK 0.049
X-Spam-Evidence '*H*': 0.90; '*S*': 0.00; 'puts': 0.07; 'ascii': 0.09; 'python': 0.11; 'wrote': 0.14; 'creates': 0.14; 'stage.': 0.16; 'library': 0.18; 'normally': 0.19; 'machine': 0.22; 'header :User-Agent:1': 0.23; 'errors.': 0.24; 'unicode': 0.24; 'script': 0.25; 'updating': 0.26; 'characters': 0.30; "i'm": 0.30; 'getting': 0.31; 'cgi': 0.31; 'restricted': 0.31; 'file': 0.32; 'run': 0.32; 'problem': 0.35; 'editor': 0.35; 'but': 0.35; 'version': 0.36; 'done': 0.36; 'subject:?': 0.36; 'too': 0.37; 'checks': 0.38; 'to:addr:python-list': 0.38; 'files': 0.38; 'recent': 0.39; 'use.': 0.39; 'to:addr:python.org': 0.39; 'received:org': 0.40; 'new': 0.61; 'back': 0.62; 'more': 0.64; 'results': 0.69; 'computers': 0.72; 'exporter': 0.84
Date Tue, 24 Sep 2013 18:32:53 +0100
From "J. Bagg" <j.bagg@kent.ac.uk>
Organization Dept of Anthropology, University of Kent
User-Agent Mozilla/5.0 (Windows; U; Windows NT 6.1; en-GB; rv:1.9.1.16) Gecko/20101125 Thunderbird/3.0.11
MIME-Version 1.0
To python-list@python.org
Subject removing BOM prepended by codecs?
Content-Type text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding 7bit
X-Mailman-Approved-At Tue, 24 Sep 2013 21:00:26 +0200
X-BeenThere python-list@python.org
X-Mailman-Version 2.1.15
Precedence list
List-Id General discussion list for the Python programming language <python-list.python.org>
List-Unsubscribe <https://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe>
List-Archive <http://mail.python.org/pipermail/python-list/>
List-Post <mailto:python-list@python.org>
List-Help <mailto:python-list-request@python.org?subject=help>
List-Subscribe <https://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe>
Newsgroups comp.lang.python
Message-ID <mailman.304.1380049229.18130.python-list@python.org> (permalink)
Lines 19
NNTP-Posting-Host 2001:888:2000:d::a6
X-Trace 1380049229 news.xs4all.nl 15980 [2001:888:2000:d::a6]:48103
X-Complaints-To abuse@xs4all.nl
Xref csiph.com comp.lang.python:54715

Show key headers only | View raw


My editor is JEdit. I use it on a Win 7 machine but have everything set 
up for *nix files as that is the machine I'm normally working on.

The files are mailed to me as updates. The library where the indexers 
work do use MS computers but this is restricted to EndNote with an 
exporter into the old Bib-Refer format which we use. I then run them 
through a Python program to check the unicode for new characters that 
also creates an ascii transliteration of the main fields and checks for 
errors.

The problem is occuring at the search stage. This stage creates a script 
with directives to search particular years and then puts the results 
into a file in /tmp. The process is left over from an old CGI version 
but is efficient and so has been kept. This has been done with a very 
old C program that a collegue wrote back in the 90s with more recent 
updates. I'm in the process of updating this to Python as it is getting 
too difficult to maintain.

J

Back to comp.lang.python | Previous | Next | Find similar | Unroll thread


Thread

removing BOM prepended by codecs? "J. Bagg" <j.bagg@kent.ac.uk> - 2013-09-24 18:32 +0100

csiph-web