Path: csiph.com!x330-a1.tempe.blueboxinc.net!usenet.pasdenom.info!aioe.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed6.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.010 X-Spam-Evidence: '*H*': 0.98; '*S*': 0.00; 'encoded': 0.05; 'ascii': 0.07; 'encoding.': 0.09; 'am,': 0.12; 'cc:addr:python-list': 0.15; 'codec': 0.16; 'encode': 0.16; 'encoded.': 0.16; 'ordinal': 0.16; 'received:192.168.1.104': 0.16; 'wrote:': 0.16; 'string,': 0.18; '(most': 0.21; 'header:In-Reply-To:1': 0.22; 'byte': 0.24; 'traceback': 0.24; 'cc:2**0': 0.25; "skip:' 30": 0.28; 'unicode': 0.28; "skip:' 10": 0.29; 'print': 0.29; 'cc:addr:python.org': 0.29; 'subject:?': 0.30; "can't": 0.32; 'header:User-Agent:1': 0.33; 'last):': 0.34; 'try:': 0.34; 'something': 0.35; 'file': 0.35; 'two': 0.37; 'but': 0.37; 'could': 0.37; 'received:192': 0.38; 'should': 0.38; 'subject:how': 0.39; 'sense': 0.39; 'received:192.168.1': 0.39; 'subject:: ': 0.39; 'your': 0.61; 'header:Reply-To:1': 0.71; 'reply-to:no real name:2**0': 0.72; 'charset:gb2312': 0.78 Date: Thu, 26 Jan 2012 08:17:49 -0500 From: Dave Angel User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.24) Gecko/20111109 Thunderbird/3.1.16 MIME-Version: 1.0 To: contro opinion Subject: Re: how to get my character? References: In-Reply-To: Content-Type: text/plain; charset=GB2312 Content-Transfer-Encoding: 8bit X-Provags-ID: V02:K0:n7WPPJ4rErvk+poVf97VUIJK/fG1zCUrLBtQALhiuNE EPO0QiyBfnYT/L6EXQHKopdyBZBlg8WVHKKKDbI7Qy1FX5fNfx ROkV4C/H7EcBgqHYoqhhhwk38fZai/a67bCJeVmAjSK4iRxupr yOVIUsH0afUW/meU1f2I+XQWcV6cZCBkqQ/yfLqatChp1G5pzD fW/zKRw4/39jKbY6jcg38i3hb9wCgfSIS1QvCJjGsr6LtqcHkY rQW6f+Vwsi8kmFBHMxXKipJ75ZR0PTWXM3883j5t3JrVhN6xFq /Taa+f0Z71rpSMNbMsYvhwdwpY5F4WfGwKAnnKy0HTI1+2vqA= = Cc: python-list X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.12 Precedence: list Reply-To: d@davea.name List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 34 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1327583906 news.xs4all.nl 6963 [2001:888:2000:d::a6]:52945 X-Complaints-To: abuse@xs4all.nl Xref: x330-a1.tempe.blueboxinc.net comp.lang.python:19487 On 01/26/2012 07:52 AM, contro opinion wrote: > my system:xp+python27 the codec, xp gbk;python 27 ascii > > a = 'ÄãºÃ' > a > '\xc4\xe3\xba\xc3' > print a > ÄãºÃ > '\xc4\xe3\xba\xc3'.decode('gbk') > u'\u4f60\u597d' > '\xc4\xe3\xba\xc3'.encode('gbk') > Traceback (most recent call last): File "", line 1, in UnicodeDecodeError: > 'ascii' codec can't decode byte 0xc4 in position 0: ordinal not in > range(128) > > how can i get "ÄãºÃ" from '\xc4\xe3\xba\xc3' ? > I don't have 'gbk' as my encoding. But on your system, if you simply print it, you should get the proper characters. Try: a = '\xc4\xe3\xba\xc3' print repr(a) print a And see if it now make sense. You're looking at the encoded form of the two characters. You could decode it to the two-character unicode string, as you showed above. But it makes no sense to try to encode something that's already encoded. -- DaveA