Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!newsfeed.xs4all.nl!newsfeed3.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
MIME-Version: 1.0
In-Reply-To: <kpf7hr$spl$16@news.ntua.gr>
References: <kpef1e$p37$3@news.ntua.gr> <mailman.3292.1371206432.3114.python-list@python.org> <kpf7hr$spl$16@news.ntua.gr>
Date: Fri, 14 Jun 2013 11:21:39 -0400
Subject: Re: A few questiosn about encoding
From: Joel Goldstick <joel.goldstick@gmail.com>
To: Nick the Gr33k <support@superhost.gr>
Content-Type: multipart/alternative; boundary=089e01184210b4d15d04df1ecd45
Cc: "python-list@python.org" <python-list@python.org>
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.3312.1371223308.3114.python-list@python.org>
Lines: 135
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:48163

--089e01184210b4d15d04df1ecd45
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

let's cut to the chase and start with telling us what you DO know Nick.
That would take less typing


On Fri, Jun 14, 2013 at 9:58 AM, Nick the Gr33k <support@superhost.gr>wrote=
:

> On 14/6/2013 1:14 =CE=BC=CE=BC, Cameron Simpson wrote:
>
>> Normally a character in a b'...' item represents the byte value
>> matching the character's Unicode ordinal value.
>>
>
> The only thing that i didn't understood is this line.
> First please tell me what is a byte value
>
>
>  \x1b is a sequence you find inside strings (and "byte" strings, the
>> b'...' format).
>>
>
> \x1b is a character(ESC) represented in hex format
>
> b'\x1b' is a byte object that represents what?
>
>
> >>> chr(27).encode('utf-8')
> b'\x1b'
>
> >>> b'\x1b'.decode('utf-8')
> '\x1b'
>
> After decoding it gives the char ESC in hex format
> Shouldn't it result in value 27 which is the ordinal of ESC ?
>
> > No, I mean conceptually, there is no difference between a code-point
>
> > and its ordinal value. They are the same thing.
>
> Why Unicode charset doesn't just contain characters, but instead it
> contains a mapping of (characters <--> ordinals) ?
>
> I mean what we do is to encode a character like chr(65).encode('utf-8')
>
> What's the reason of existence of its corresponding ordinal value since i=
t
> doesn't get involved into the encoding process?
>
> Thank you very much for taking the time to explain.
>
> --
> What is now proved was at first only imagined!
> --
> http://mail.python.org/**mailman/listinfo/python-list<http://mail.python.=
org/mailman/listinfo/python-list>
>



--=20
Joel Goldstick
http://joelgoldstick.com

--089e01184210b4d15d04df1ecd45
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">let&#39;s cut to the chase and start with telling us what =
you DO know Nick.=C2=A0 That would take less typing<br></div><div class=3D"=
gmail_extra"><br><br><div class=3D"gmail_quote">On Fri, Jun 14, 2013 at 9:5=
8 AM, Nick the Gr33k <span dir=3D"ltr">&lt;<a href=3D"mailto:support@superh=
ost.gr" target=3D"_blank">support@superhost.gr</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div class=3D"im">On 14/6/2013 1:14 =CE=BC=
=CE=BC, Cameron Simpson wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
Normally a character in a b&#39;...&#39; item represents the byte value<br>
matching the character&#39;s Unicode ordinal value.<br>
</blockquote>
<br></div>
The only thing that i didn&#39;t understood is this line.<br>
First please tell me what is a byte value<div class=3D"im"><br>
<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
\x1b is a sequence you find inside strings (and &quot;byte&quot; strings, t=
he<br>
b&#39;...&#39; format).<br>
</blockquote>
<br></div>
\x1b is a character(ESC) represented in hex format<br>
<br>
b&#39;\x1b&#39; is a byte object that represents what?<br>
<br>
<br>
&gt;&gt;&gt; chr(27).encode(&#39;utf-8&#39;)<br>
b&#39;\x1b&#39;<br>
<br>
&gt;&gt;&gt; b&#39;\x1b&#39;.decode(&#39;utf-8&#39;)<br>
&#39;\x1b&#39;<br>
<br>
After decoding it gives the char ESC in hex format<br>
Shouldn&#39;t it result in value 27 which is the ordinal of ESC ?<br>
<br>
&gt; No, I mean conceptually, there is no difference between a code-point<d=
iv class=3D"im"><br>
&gt; and its ordinal value. They are the same thing.<br>
<br></div>
Why Unicode charset doesn&#39;t just contain characters, but instead it con=
tains a mapping of (characters &lt;--&gt; ordinals) ?<br>
<br>
I mean what we do is to encode a character like chr(65).encode(&#39;utf-8&#=
39;)<br>
<br>
What&#39;s the reason of existence of its corresponding ordinal value since=
 it doesn&#39;t get involved into the encoding process?<br>
<br>
Thank you very much for taking the time to explain.<div class=3D"HOEnZb"><d=
iv class=3D"h5"><br>
-- <br>
What is now proved was at first only imagined!<br>
-- <br>
<a href=3D"http://mail.python.org/mailman/listinfo/python-list" target=3D"_=
blank">http://mail.python.org/<u></u>mailman/listinfo/python-list</a><br>
</div></div></blockquote></div><br><br clear=3D"all"><br>-- <br><div dir=3D=
"ltr"><div>Joel Goldstick<br></div><a href=3D"http://joelgoldstick.com" tar=
get=3D"_blank">http://joelgoldstick.com</a><br></div>
</div>

--089e01184210b4d15d04df1ecd45--