std.stringbuffer

Matti Niemenmaa see_signature at for.real.address
Wed Apr 30 06:22:39 PDT 2008


Janice Caron wrote:
> If any of you have plans to uppercase or lowercase UTF-8 in place,
> forget that now. It just ain't possible. (You can uppercase ASCII,
> UTF-16, or UTF-32 in place. But not UTF-8, and char[], by definition,
> is UTF-8).

It's possible that, in some obscure case, you can't uppercase UTF-16 in place 
either.

A code point in the private use area (U+E000 to U+F8FF), which can be 
represented with one UTF-16 code unit, may uppercase to something in the 
supplementary private use areas (U+F0000 upwards), whose code points require two 
UTF-16 code units each. Of course the toUpper function in question must be aware 
of this configuration of the private use areas.

This is an extremely contrived case and I doubt it'll ever come up in practice, 
anywhere, but in theory it might. <g>

-- 
E-mail address: matti.niemenmaa+news, domain is iki (DOT) fi



More information about the Digitalmars-d mailing list