[e-lang] Source code character sets and Unicode

Ka-Ping Yee e-lang at zesty.ca
Mon May 28 19:37:45 EDT 2007


Hi, all.

I've been thinking and arguing a lot about Unicode issues lately
because of a recently accepted proposal to allow non-ASCII
identifiers in the next version of Python (see references below).

So I thought I'd ask: what are E's policies concerning

  - the character encoding of E source files
  - non-ASCII in comments
  - non-ASCII in string or string-like literals
  - non-ASCII in identifiers

and how, if at all, are these policies likely to change?


-- ?!ng

[1] http://www.python.org/dev/peps/pep-3131/
    PEP 3131, Supporting Non-ASCII identifiers

[2] http://mail.python.org/pipermail/python-3000/2007-May/007744.html
    Guido's acceptance of PEP 3131

[3] http://mail.python.org/pipermail/python-3000/2007-May/thread.html#7744
    The thread ensuing from [2]

[4] http://interglacial.com/javascript_spec/a-7.html#a-7.6
    ECMAScript identifier syntax

[5] http://java.sun.com/docs/books/jls/third_edition/html/lexical.html
    Java identifier syntax

[6] http://msdn2.microsoft.com/en-us/library/aa664670(vs.71,d=printer).aspx
    C# identifier syntax


More information about the e-lang mailing list