Path: csiph.com!newsfeed.hal-mli.net!feeder3.hal-mli.net!newsfeed.hal-mli.net!feeder1.hal-mli.net!news.stack.nl!newsfeed.xs4all.nl!newsfeed6.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.011 X-Spam-Evidence: '*H*': 0.98; '*S*': 0.00; 'character,': 0.07; 'cmd': 0.09; 'encoding.': 0.09; 'subject:error': 0.11; 'decoding': 0.16; 'ignores': 0.16; 'skips': 0.16; 'subject:unicode': 0.16; 'symbol,': 0.16; 'value"': 0.16; 'wrote:': 0.17; 'bytes': 0.17; 'thu,': 0.17; 'tries': 0.17; 'unicode': 0.17; '(or': 0.18; 'header :In-Reply-To:1': 0.25; 'values': 0.26; 'message- id:@mail.gmail.com': 0.27; 'character.': 0.29; 'seemingly': 0.29; 'character': 0.29; 'definition': 0.29; 'related': 0.30; 'usually': 0.30; 'received:209.85.215.46': 0.30; 'code': 0.31; 'print': 0.32; 'right?': 0.33; 'problem': 0.33; 'to:addr:python-list': 0.33; 'received:google.com': 0.34; 'nov': 0.35; 'pm,': 0.35; 'subject:?': 0.35; 'received:209.85': 0.35; 'characters': 0.36; 'display': 0.36; 'does': 0.37; 'why': 0.37; 'received:209': 0.37; 'subject:: ': 0.38; 'some': 0.38; 'page': 0.38; 'gives': 0.39; 'to:addr:python.org': 0.39; 'header:Received:5': 0.40; 'real': 0.61; 'map': 0.61; 'different': 0.63; 'to:name:python': 0.84; 'visually': 0.84 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=ZaZ8m2aVznMteSlkcFV391LJJXTBlhGJPILy8D+qOuw=; b=VqOu1UjF0xZnWdiOBDKnH4X5fzvEtk8EWHdUxAEGtdPfAe/PuNwAgz/nJphfhkD32E zQ2o2FOTEVrAUalzM4D8mkmKE8Iuyb3sMXoWLTil3kD3l5s0LAf2W1ZWjLaKpdVvrG7V Tfa8641SaKcEUv7hZb9ykDRkGWZFuMP/0Jj0k+omghaeqZZuIVoK9kT/zK9nZmalgO1O StJVDbZdUhysxFBQRg9OFZHWFYQqJByx3FlwB7aUy63d+VSnqUvG9pEoCC7R40NShO9X +IRroVipN0i9lddYW+8Mef7aRbQy3fhC0oMNVeLW/6x+F3fTa7kE8bO5MvzVH0d3rRaY sugA== MIME-Version: 1.0 In-Reply-To: <5B80DD153D7D744689F57F4FB69AF4741678B1A0@SCACMX008.exchad.jpmchase.net> References: <09a3d20b-5871-47f4-9218-df119698e405@m4g2000yqf.googlegroups.com> <509AF3EF.8050108@gmail.com> <65910cea-f145-409c-a579-9f0cda499546@googlegroups.com> <5B80DD153D7D744689F57F4FB69AF4741678B1A0@SCACMX008.exchad.jpmchase.net> From: Ian Kelly Date: Thu, 8 Nov 2012 14:07:15 -0700 Subject: Re: Right solution to unicode error? To: Python Content-Type: text/plain; charset=ISO-8859-1 X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 14 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1352408866 news.xs4all.nl 6955 [2001:888:2000:d::a6]:57453 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:32982 On Thu, Nov 8, 2012 at 1:54 PM, Prasad, Ramit wrote: > Why would font not matter? Unicode is the abstract definition > of all characters right? From that we map the abstract > character to a code page/set, which gives real values for an > abstract character. From that code page we then visually display > the "real value" based on the font. If that font does > not have a glyph for a specific character page (or a different > glyph) then that is a problem and not related encoding. Usually though when the font is missing a glyph for a Unicode character, you just get a missing glyph symbol, such as an empty rectangle. For some reason when using the default font, cmd seemingly ignores the active code page, skips decoding the characters, and tries to print the individual bytes as if using code page 437.