Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!feeds.phibee-telecom.net!newsfeed.xs4all.nl!newsfeed2.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.001 X-Spam-Evidence: '*H*': 1.00; '*S*': 0.00; 'encoding': 0.05; 'one?': 0.05; 'indexing': 0.07; 'string': 0.09; 'ascii': 0.09; 'bits': 0.09; 'character,': 0.09; 'explanation': 0.09; 'methods,': 0.09; 'system?': 0.09; 'url:unicode': 0.09; 'python': 0.11; 'thread': 0.14; '&&': 0.16; '127': 0.16; 'andreas': 0.16; 'charset': 0.16; 'confuse': 0.16; 'ebcdic': 0.16; 'integers.': 0.16; 'opened.': 0.16; 's[0]': 0.16; 'sees': 0.16; 'simple.': 0.16; 'subject:questions': 0.16; 'weblog': 0.16; 'zeros': 0.16; 'properly': 0.19; '>>>': 0.22; 'example': 0.22; 'header:User- Agent:1': 0.23; 'error': 0.23; '>>>': 0.24; 'byte': 0.24; 'bytes': 0.24; 'integer': 0.24; 'unicode': 0.24; 'decide': 0.24; '>': 0.26; 'post': 0.26; 'asking': 0.27; 'tried': 0.27; 'correct': 0.29; 'character': 0.29; "doesn't": 0.30; 'characters': 0.30; 'work.': 0.31; 'sets.': 0.31; 'this.': 0.32; 'says': 0.33; 'could': 0.34; 'subject: (': 0.35; 'created': 0.35; 'but': 0.35; 'received:google.com': 0.35; 'google': 0.35; 'object,': 0.36; 'right?': 0.36; 'sequence': 0.36; 'method': 0.36; 'possible': 0.36; 'url:org': 0.36; 'list.': 0.37; 'represent': 0.38; 'message- id:@gmail.com': 0.38; 'thank': 0.38; 'to:addr:python-list': 0.38; 'to:addr:python.org': 0.39; 'how': 0.40; 'even': 0.60; 'tell': 0.60; 'extended': 0.61; 'new': 0.61; 'from:charset:utf-8': 0.61; 'first': 0.61; 'you.': 0.62; 'act': 0.63; 'reply': 0.66; 'mail.': 0.69; '8bit%:92': 0.71; '8bit%:100': 0.72; 'url:wordpress': 0.78; 'ps.': 0.84; 'sets,': 0.84; '2013': 0.98 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject :content-type; bh=tin3tDopp+HqeKoUsTGsC8Esk77qw2CXia2f0eRToIk=; b=ojxmb+UmGdCEPnei11vdSG8n78GtFd0IEYqgHAxwh5rS8H7Ob9GtvLIgsc5sD4WjQA mk+R3vlcVtnZg2bxSaXYeCwu+lj//P7i21kR+1s+SZhaKC+2jlYFII6WjBzGEAcErTZG 9wHrkIvaXnmpDV6XM8Pd4WKvk4T7OUsXol8zy5CEyJjbMbLd9Kn2TLEdUFqJVb+Vdbl/ dRwSD7+2jmCkDeVs8xZPArzdHN0qQKwGFH3gCIljmHjNvlU1s9kyA0fECz+fEgq6pu9B 0RgBUy/gb3C2kclIqd3dr/XzSfd9F6TzcRDJyH7nRzXwsNaShcQv5y9+hcz8sqntLgmu VmYA== X-Received: by 10.205.103.67 with SMTP id dh3mr1399317bkc.19.1370862784142; Mon, 10 Jun 2013 04:13:04 -0700 (PDT) Date: Mon, 10 Jun 2013 14:13:00 +0300 From: =?UTF-8?B?zp3Ouc66z4zOu86xzr/PgiDOms6/z43Pgc6xz4I=?= User-Agent: Mozilla/5.0 (Windows NT 6.2; WOW64; rv:22.0) Gecko/20100101 Thunderbird/22.0 MIME-Version: 1.0 To: python-list@python.org Subject: Encoding questions (continuation) Content-Type: multipart/alternative; boundary="------------010207000701060506020308" X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 163 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1370862786 news.xs4all.nl 15893 [2001:888:2000:d::a6]:55801 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:47547 This is a multi-part message in MIME format. --------------010207000701060506020308 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Τη Δευτέρα, 10 Ιουνίου 2013 1:42:25 μ.μ. UTC+3, ο χρήστης Andreas Perstinger έγραψε: > >>> s = b'\xce\xb1' > > >>> s[0] > > 206 's' is a byte object, how can you treat it as a string asking to present you its first character? > > >>> s[1] > > 177 's' is a byte object, how can you treat it as a string asking to present you its first character? > A byte object is a sequence of bytes (= integer values) and support indexing A sequeence of bystes is a a sequence of bits which is zeros and one's not integers. > Because your method doesn't work. > If you use all possible 256 bit-combinations to represent a valid > character, how do you decide where to stop in a sequence of bytes? How you mean? please provice an example so i can understand this. > > EBCDIC and ASCII and Unicode are charactet sets, correct? > > iso-8859-1, iso-8859-7, utf-8, utf-16, utf-32 and so on are encoding methods, right? > Look at http://www.unicode.org/glossary/ for an explanation of all the terms I did but docs confuse me even more. Can you pleas ebut it simple. Unicode as i udnerstand it was created out of need for a big character set that could be able to hold all worlds symboles, whiel ascii could only store first 127 and extended 246. ascii and unicode are character sets. everything else sees to be an encoding system that work upne the characters set(never deen them though act on the scii charset) but iso-8859-7 is both a charset and an encoding system? ps. i tried to post a reply to the thread i opend via thunderbird mail client, but not as a reply to somne other reply but as new mail send to python list. because of that a new thread will be opened. How can i tell thunderbird to reply to the original thread and not start a new one? Sorry for that but i can' even post via google groups any more. it says that an error occured. Sorry for this. please xplain hpw to reply properly to correct hread via mail. thank you. -- Webhost && Weblog --------------010207000701060506020308 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: 8bit Τη Δευτέρα, 10 Ιουνίου 2013 1:42:25 μ.μ. UTC+3, ο χρήστης Andreas Perstinger έγραψε:

>  >>> s = b'\xce\xb1'
>
>  >>> s[0]
>
> 206

's' is a byte object, how can you treat it as a string asking to present you its first character?

>
>  >>> s[1]
>
> 177

's' is a byte object, how can you treat it as a string asking to present you its first character?

> A byte object is a sequence of bytes (= integer values) and support  indexing

A sequeence of bystes is a a sequence of bits which is zeros and one's not integers.


> Because your method doesn't work.
> If you use all possible 256 bit-combinations to represent a valid
> character, how do you decide where to stop in a sequence of bytes?

How you mean? please provice an example so i can understand this.

> > EBCDIC and ASCII and Unicode are charactet sets, correct?

> > iso-8859-1, iso-8859-7, utf-8, utf-16, utf-32 and so on are encoding methods, right?

> Look at http://www.unicode.org/glossary/ for an explanation of all the terms

I did but docs confuse me even more. Can you pleas ebut it simple.

Unicode as i udnerstand it was created out of need for a big character set that could be able to hold all worlds symboles, whiel ascii could only store first 127 and extended 246.

ascii and unicode are character sets.

everything else sees to be an encoding system that work upne the characters set(never deen them though act on the scii charset)

but iso-8859-7 is both a charset and an encoding system?



ps. i tried to post a reply to the thread i opend via thunderbird mail client, but not as a reply to somne other reply but as  new mail send to python list.
because of that a new thread will be opened.
How can i tell thunderbird to reply to the original thread and not start a new one?
Sorry for that but i can' even post via google groups any more. it says that an error occured.
Sorry for this. please xplain hpw to reply properly to correct hread via mail.
thank you.
--------------010207000701060506020308--