[Issue 1357] Cannot use FFFF and FFFE in Unicode escape sequences.

d-bugmail at puremagic.com d-bugmail at puremagic.com
Tue Oct 2 10:48:18 PDT 2007


http://d.puremagic.com/issues/show_bug.cgi?id=1357





------- Comment #11 from smjg at iname.com  2007-10-02 12:48 -------
(In reply to comment #9)
> I guess Walter has to decide what a D lexer should do in case it 
> encounters a noncharacter in the source text.  My suggestion would 
> be to ignore noncharacters in favour of a faster lexer (although 
> probably not many people are going to stuff their source text with 
> unialpha identifiers and comments/strings with Unicode characters.)

That's a little off-topic to this issue.  Handling of actual non-characters in
the source code is a quite different matter from handling of escaped
representations of non-characters.

(In reply to comment #10)
> Testing for being under 0x10FFFF is redundant. dchar.max already is 0x10FFFF:

That doesn't follow.  It's perfectly possible for values greater than 0x10FFFF
to find their way into a file or a piece of memory intended to contain UTF-32
text.  .max doesn't constrain the contents of memory in any way.


-- 



More information about the Digitalmars-d-bugs mailing list