std.string and unicode
Thomas Kuehne
thomas-dloop at kuehne.cn
Sun Dec 17 22:33:48 PST 2006
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Todor Totev schrieb am 2006-12-17:
<snip>
> It is compeltelly unrelated but I have a vague memory that the german
> letter "ß" (U+DF) when upcased is replaced with "SS". I'm not sure
> if this is true but if it is then std.uni.toUniUpper() has a bug because
> I don't see its code to check for this case. Could someone speaking
> german check this, please?
The uppercase version of "ß" is "SS". (At least accroding to Unicode and
DIN, many Germans however treat "ß" as caseless ...). Unicode allows to
types of toUpper/toLower: complete and simplified. The simplified
version doesn't change the casing if the number of codepoints would
change. Phobos currently excludes all changes where the simplified
version would cause a change the length of the UTF-8 encoded string.
For an updated std.uni see
http://www.digitalmars.com/pnews/read.php?server=news.digitalmars.com&group=digitalmars.D&artnum=34218
Thomas
-----BEGIN PGP SIGNATURE-----
iD8DBQFFhkNNLK5blCcjpWoRAmMYAJ9Yjua974pcPmYzt+zP6NVsqixDVACaA6KM
/4dVwg+nCCQ9gOW6zyWr8A8=
=lKh2
-----END PGP SIGNATURE-----
More information about the Digitalmars-d
mailing list