X-Received: by 10.224.218.2 with SMTP id ho2mr25123018qab.8.1357115052375; Wed, 02 Jan 2013 00:24:12 -0800 (PST) Received: by 10.49.24.13 with SMTP id q13mr6934471qef.33.1357115052359; Wed, 02 Jan 2013 00:24:12 -0800 (PST) Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!news.glorb.com!ee4no11380215qab.0!news-out.google.com!k2ni4226qap.0!nntp.google.com!ee4no11495593qab.0!postnews.google.com!glegroupsg2000goo.googlegroups.com!not-for-mail Newsgroups: comp.lang.java.programmer Date: Wed, 2 Jan 2013 00:24:11 -0800 (PST) In-Reply-To: <0f28108e-6d35-43a1-a9df-b6c5636fb0ec@googlegroups.com> Complaints-To: groups-abuse@google.com Injection-Info: glegroupsg2000goo.googlegroups.com; posting-host=50.14.119.173; posting-account=lPVJQwoAAACjT2AlnY0YSj2LC4j2qtwQ NNTP-Posting-Host: 50.14.119.173 References: <0f28108e-6d35-43a1-a9df-b6c5636fb0ec@googlegroups.com> User-Agent: G2/1.0 MIME-Version: 1.0 Message-ID: <28e48a66-15c3-4ea8-99ca-b8addb95ef07@googlegroups.com> Subject: Re: question on java lang spec chapter 3.3 (unicode char lexing) From: "Aryeh M. Friedman" Injection-Date: Wed, 02 Jan 2013 08:24:12 +0000 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Xref: csiph.com comp.lang.java.programmer:20865 On Wednesday, January 2, 2013 3:20:12 AM UTC-5, Aryeh M. Friedman wrote: > If I am lexer for Java in a 100% unicode environment (it already uses uni= code for all internal representation of text) and 100% of the code that I w= ill be lexing is from that environment do I need still deal with unicode es= capes (\uXXXX) in real life [vs. theortically complete lexing]... assume th= at no code will be imported from non-unicode environments Just a follow up this is for a Java to native (x86) compiler written in Jav= a I am doing for fun (no practical purpose except for practice in compiler = writing [not for school or work])