Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.004 X-Spam-Evidence: '*H*': 0.99; '*S*': 0.00; 'cpython': 0.05; 'subject:Python': 0.05; 'names.': 0.07; 'default.': 0.09; 'internally': 0.09; 'locale': 0.09; 'api': 0.09; 'am,': 0.12; 'filesystem': 0.16; 'subject:usage': 0.16; 'utf8': 0.16; 'what?': 0.16; 'wrote:': 0.18; 'stick': 0.18; '(i.e.,': 0.21; 'received:209.85.210.174': 0.21; 'received:mail- iy0-f174.google.com': 0.21; 'header:In-Reply-To:1': 0.22; 'versions': 0.23; 'default,': 0.23; 'subject:numbers': 0.23; 'expect': 0.25; 'windows': 0.26; 'code': 0.26; 'unicode': 0.29; 'message-id:@gmail.com': 0.31; 'germany': 0.31; "can't": 0.33; 'header:User-Agent:1': 0.33; 'file': 0.34; 'steven': 0.34; 'to:addr:python-list': 0.35; 'encoding': 0.37; 'page': 0.37; 'but': 0.37; 'received:google.com': 0.37; 'received:209.85': 0.38; 'uses': 0.38; 'received:192': 0.38; 'problems': 0.38; 'files': 0.39; 'received:192.168.1': 0.39; 'unless': 0.39; 'received:209': 0.39; 'application': 0.40; 'to:addr:python.org': 0.40; 'us,': 0.71; '12:10': 0.84; 'either,': 0.84; 'encodings.': 0.84; 'localized': 0.84 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:x-enigmail-version:content-type :content-transfer-encoding; bh=2aMZkRjNTCkqdUJ19C53SnEdGQhqkG4pe2UvOd89wV8=; b=RkGexrk35hSBUOfrua1t3LBh5b3d+nJQ7dBttTAIjA5+9xWTFRt5AxtrASFLkeew6q vCn87HBYRyrRJ71Vn/6RBznmSQl0pEO5nS8N5oHTZAkYJl/obbTtAwLUqd98jKvUUdJn I+B/GG6lRostF/oTiCC3DeKuMlf3XKj+2mNw4= Date: Sun, 12 Feb 2012 01:05:35 -0600 From: Andrew Berg User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:9.0) Gecko/20111220 Thunderbird/9.0 MIME-Version: 1.0 To: "comp.lang.python" Subject: Re: Python usage numbers References: <4F36E2F5.9000505@gmail.com> <4f37229b$0$29986$c3e8da3$5496439d@news.astraweb.com> <4f3757cc$0$29986$c3e8da3$5496439d@news.astraweb.com> In-Reply-To: <4f3757cc$0$29986$c3e8da3$5496439d@news.astraweb.com> X-Enigmail-Version: 1.3.5 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.12 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 16 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1329030346 news.xs4all.nl 6873 [2001:888:2000:d::a6]:52338 X-Complaints-To: abuse@xs4all.nl Path: csiph.com!x330-a1.tempe.blueboxinc.net!usenet.pasdenom.info!bete-des-vosges.org!feed.ac-versailles.fr!gegeweb.org!de-l.enfer-du-nord.net!feeder2.enfer-du-nord.net!newsfeed.eweka.nl!eweka.nl!feeder3.eweka.nl!newsfeed.xs4all.nl!newsfeed6.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Xref: x330-a1.tempe.blueboxinc.net comp.lang.python:20254 On 2/12/2012 12:10 AM, Steven D'Aprano wrote: > It's not just UTF8 either, but nearly all encodings. You can't even > expect to avoid problems if you stick to nothing but Windows, because > Windows' default encoding is localised: a file generated in (say) Israel > or Japan or Germany will use a different code page (encoding) by default > than one generated in (say) the US, Canada or UK. Generated by what? Windows will store a locale value for programs to use, but programs use Unicode internally by default (i.e., API calls are Unicode unless they were built for old versions of Windows), and the default filesystem (NTFS) uses Unicode for file names. AFAIK, only the terminal has a localized code page by default. Perhaps Notepad will write text files with the localized code page by default, but that's an application choice... -- CPython 3.2.2 | Windows NT 6.1.7601.17640