Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!news.albasani.net!rt.uk.eu.org!newsfeed.xs4all.nl!newsfeed1.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: UNSURE 0.360 X-Spam-Level: *** X-Spam-Evidence: '*H*': 0.44; '*S*': 0.16; 'strings.': 0.09; 'base64': 0.16; 'base64,': 0.16; 'hint:': 0.16; 'received:192.168.1.4': 0.16; 'coding': 0.22; 'header:User-Agent:1': 0.23; 'byte': 0.24; 'unicode': 0.24; 'header:In-Reply-To:1': 0.27; 'knowledge': 0.35; 'but': 0.35; 'should': 0.36; 'to:addr:python-list': 0.38; 'to:addr:python.org': 0.39; 'how': 0.40; 'skip:u 10': 0.60; 'email addr:gmail.com': 0.63; 'information': 0.63; 'charset:windows-1252': 0.65; 'pardon': 0.84; 'received:195.238': 0.84; 'received:195.238.6': 0.84; 'received:195.238.6.173': 0.84; 'received:belgacom.be': 0.84; 'received:isp.belgacom.be': 0.84; 'received:mailrelay007.isp.belgacom.be': 0.84 X-Belgacom-Dynamic: yes X-Cloudmark-SP-Filtered: true X-Cloudmark-SP-Result: v=1.1 cv=K/4FL2PCLwLz0yUCLDM8BdkkXYERL8kpd1e6lZPJPhE= c=1 sm=2 a=a21qFd9_iNQA:10 a=N659UExz7-8A:10 a=pGLkceISAAAA:8 a=CpcmgzqU1GGNBkoho1gA:9 a=pILNOxqGKmIA:10 a=MSl-tDqOz04A:10 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: ApcBAMb271FtgEev/2dsb2JhbAANToZNvkCBLIMZAQEEOEARCyEWDwkDAgECAQ82EAMGAgKHegOmfYlfDVeIB40VgT+BMBaDagOVdoFpjCeIPIFn Date: Fri, 26 Jul 2013 22:38:59 +0200 From: Antoon Pardon User-Agent: Mozilla/5.0 (X11; Linux i686; rv:17.0) Gecko/20130704 Icedove/17.0.7 MIME-Version: 1.0 To: python-list@python.org Subject: Re: RE Module Performance References: <571a6dfe-fd66-42cf-92fc-8b97cbe6e9e4@googlegroups.com> <51DFDE65.5040001@Gmail.com> <4f1067f6-bc99-42ad-9166-37fb228b90e8@googlegroups.com> <51f14395$0$29971$c3e8da3$5496439d@news.astraweb.com> <51f15e03$0$29971$c3e8da3$5496439d@news.astraweb.com> <8203e802-9dc5-44c5-9547-6e1947ee224b@googlegroups.com> In-Reply-To: <8203e802-9dc5-44c5-9547-6e1947ee224b@googlegroups.com> Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit X-Mailman-Approved-At: Mon, 29 Jul 2013 09:27:00 +0200 X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 17 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1375082820 news.xs4all.nl 15888 [2001:888:2000:d::a6]:50703 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:51424 Op 26-07-13 15:21, wxjmfauth@gmail.com schreef: > > Hint: To understand Unicode (and every coding scheme), you should > understand "utf". The how and the *why*. No you don't. You are mixing the information with how the information is coded. utf is like base64, a way of coding the information that is usefull for storage or transfer. But once you have decode the byte stream, you no longer need any understanding of base64 to process your information. Likewise, once you have decode the bytestream into uniocde information you don't need knowledge of utf to process unicode strings. -- Antoon Pardon