identifiers & "unialpha"

Thomas Kuehne thomas-dloop at kuehne.cn
Fri Sep 22 07:23:05 PDT 2006


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

http://www.digitalmars.com/d/lex.html#identifier
# Identifiers start with a letter, _, or universal alpha, and are followed
# by any number of letters, _, digits, or universal alphas. Universal
# alphas are as defined in ISO/IEC 9899:1999(E) Appendix D. (This is the
# C99 Standard.)

Why is D referencing "ISO/IEC 9899:1999 (E) Appendix D" for defining
"universal alpha"? "ISO/IEC 9899:1999 (E) Appendix D" isn't listing
"universal alpha".

Sample:
\u00B7 (MIDDLE DOT, Other_Punctuation) isn't an "universal alpha" but
allowed by Appendix D in identifiers.

"ISO/IEC 9899:1999 (E) Appendix D" itself is referencing
"ISO/IEC TR 10176:1998" for the character data. I strongly suggest to
drop the redirection via "Appendix D" and use
"ISO/IEC TR 10176 (current)" instead of the dated version
"ISO/IEC TR 10176:1998". The 1998 version didn't yet include quite a
chunk of CJK and Math characters that can be found in the current version.

Thomas


-----BEGIN PGP SIGNATURE-----

iD8DBQFFE/7wLK5blCcjpWoRAmkaAKCrkQoYh52hH1EO97xUMU4iQaJaywCgiR6E
tE8uxEORDcyK2epapicDHHY=
=Oop9
-----END PGP SIGNATURE-----



More information about the Digitalmars-d mailing list