Path: csiph.com!usenet.pasdenom.info!weretis.net!feeder4.news.weretis.net!newsfeed.fsmpi.rwth-aachen.de!npeer.de.kpn-eurorings.net!npeer-ng0.de.kpn-eurorings.net!news.tele.dk!news.tele.dk!small.news.tele.dk!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.004 X-Spam-Evidence: '*H*': 0.99; '*S*': 0.00; 'encoding': 0.05; 'mrab': 0.05; 'string': 0.09; 'english,': 0.09; 'received:80.91': 0.09; 'received:80.91.229': 0.09; 'received:gmane.org': 0.09; 'received:list': 0.09; 'assume': 0.14; '12:38,': 0.16; 'encoding.': 0.16; 'received:80.91.229.3': 0.16; 'received:plane.gmane.org': 0.16; 'recognise': 0.16; 'subject:issue': 0.16; 'language': 0.16; 'wrote:': 0.18; 'header :User-Agent:1': 0.23; 'recognize': 0.24; 'server.': 0.24; 'header:X-Complaints-To:1': 0.27; 'said,': 0.30; 'you?': 0.31; 'obscure': 0.31; 'linux': 0.33; 'sense': 0.34; 'maybe': 0.34; 'skip:d 20': 0.34; "can't": 0.35; 'but': 0.35; 'in.': 0.36; 'words,': 0.36; 'skip:\xcf 20': 0.38; 'whatever': 0.38; 'to:addr :python-list': 0.38; 'does': 0.39; 'to:addr:python.org': 0.39; 'received:org': 0.40; 'dave': 0.60; 'most': 0.60; 'default': 0.69; '8bit%:100': 0.72; 'characters,': 0.84; 'greek': 0.84; 'angel': 0.91 X-Injected-Via-Gmane: http://gmane.org/ To: python-list@python.org From: Dave Angel Subject: Re: UnicodeDecodeError issue Date: Mon, 2 Sep 2013 12:24:16 +0000 (UTC) References: <5222fc40$0$6599$c3e8da3$5496439d@news.astraweb.com> <52247CED.9050101@mrabarnett.plus.com> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Gmane-NNTP-Posting-Host: 174.32.174.36 User-Agent: XPN/1.2.6 (Street Spirit ; Linux) X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 33 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1378124707 news.xs4all.nl 15901 [2001:888:2000:d::a6]:44398 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:53479 On 2/9/2013 07:56, MRAB wrote: > On 02/09/2013 12:38, Dave Angel wrote: >> ¶γνωστοόνομα συστήματος >> >> I don't have a clue what it might be; it's not English, and I don't >> know whatever language it may be in. >> > You don't recognise Greek? I recognize most of those as Greek characters, but as I said, I don't know Greek. And because I can't recognize words, I can't assume it might not be some other language that uses the same glyphs. > >> Does that string make any sense to you? You may want to try it on your >> own machine, since the email may obscure the encoding. Or you might >> want to do the decode using whatever the default encoding is for that >> server. >> >> The Linux 'file' utility thinks this string is in ISO-8859, so you might >> want to try a decode('ISO-8859-1') as well. (and maybe ISO-8859-2, -3, >> -4, and -5) >> > It's ISO-8859-7 (Greek). -- DaveA