Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #35198

Re: how to detect the encoding used for a specific text data ?

Path csiph.com!usenet.pasdenom.info!weretis.net!feeder4.news.weretis.net!ecngs!feeder2.ecngs.de!newsfeed.freenet.ag!news2.euro.net!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
Return-Path <christian@python.org>
X-Original-To python-list@python.org
Delivered-To python-list@mail.python.org
DKIM-Signature v=1; a=rsa-sha256; c=relaxed/simple; d=python.org; s=200901; t=1356013178; bh=2fb6iHF1+uPLEzmVRB+/HyVxSWsj2Zi98cwr0Pphr6o=; h=Message-ID:Date:From:MIME-Version:To:CC:Subject:References: In-Reply-To:Content-Type:Content-Transfer-Encoding; b=UxonYGu9x0Z9urZUzoN0jy8VrpNGEsBApWqjWrLZDhMBJovMX0tm8ie5Wn6IPuvpV 9NqsKYhmFrx6RysMpHPY4T3mbjLjjlVWok2cdZb8Kf6boKXKDMz1ITrCMIBsZ6Tdzj 9Bfrz3ya/pLsDcH5fMsAesaECkHCCsyEwhfr2og4=
Date Thu, 20 Dec 2012 15:19:37 +0100
From Christian Heimes <christian@python.org>
User-Agent Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/17.0 Thunderbird/17.0
MIME-Version 1.0
To iMath <redstone-cold@163.com>
Subject Re: how to detect the encoding used for a specific text data ?
References <c6eeb756-65be-4c50-88a8-1f94bd772fe8@googlegroups.com>
In-Reply-To <c6eeb756-65be-4c50-88a8-1f94bd772fe8@googlegroups.com>
Content-Type text/plain; charset=ISO-8859-1
Content-Transfer-Encoding 7bit
Cc python-list@python.org
X-BeenThere python-list@python.org
X-Mailman-Version 2.1.15
Precedence list
List-Id General discussion list for the Python programming language <python-list.python.org>
List-Unsubscribe <http://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe>
List-Archive <http://mail.python.org/pipermail/python-list/>
List-Post <mailto:python-list@python.org>
List-Help <mailto:python-list-request@python.org?subject=help>
List-Subscribe <http://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe>
Newsgroups comp.lang.python
Message-ID <mailman.1099.1356013179.29569.python-list@python.org> (permalink)
Lines 11
NNTP-Posting-Host 2001:888:2000:d::a6
X-Trace 1356013179 news.xs4all.nl 6974 [2001:888:2000:d::a6]:40607
X-Complaints-To abuse@xs4all.nl
Xref csiph.com comp.lang.python:35198

Show key headers only | View raw


Am 20.12.2012 12:57, schrieb iMath:
>  how to detect the encoding used for a specific text data ?

You can't.

It's not possible unless the file format can specify the encoding
somehow, e.g. like XML's header <?xml version="1.0" encoding="UTF-8"?>.
Sometimes you can try and make an educated guess. But it's just a guess
and it may give you wrong results.

Christian

Back to comp.lang.python | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

how to detect the encoding used for a specific text data ? iMath <redstone-cold@163.com> - 2012-12-20 03:57 -0800
  Re: how to detect the encoding used for a specific text data ? iMath <redstone-cold@163.com> - 2012-12-20 04:06 -0800
  Re: how to detect the encoding used for a specific text data ? Jussi Piitulainen <jpiitula@ling.helsinki.fi> - 2012-12-20 14:48 +0200
  Re: how to detect the encoding used for a specific text data ? "Stefan H. Holek" <stefan@epy.co.at> - 2012-12-20 14:17 +0100
    Re: how to detect the encoding used for a specific text data ? iMath <redstone-cold@163.com> - 2012-12-20 05:50 -0800
      Re: how to detect the encoding used for a specific text data ? Jussi Piitulainen <jpiitula@ling.helsinki.fi> - 2012-12-20 16:10 +0200
    Re: how to detect the encoding used for a specific text data ? iMath <redstone-cold@163.com> - 2012-12-20 05:50 -0800
  Re: how to detect the encoding used for a specific text data ? Christian Heimes <christian@python.org> - 2012-12-20 15:19 +0100
  Re: how to detect the encoding used for a specific text data ? rurpy@yahoo.com - 2012-12-20 09:48 -0800
  Re: how to detect the encoding used for a specific text data ? Oscar Benjamin <oscar.j.benjamin@gmail.com> - 2012-12-21 12:38 +0000
  Re: how to detect the encoding used for a specific text data ? Dave Angel <d@davea.name> - 2012-12-21 09:14 -0500

csiph-web