The length of strings vs. # of chars vs. sizeof
Daniel Keep
daniel.keep.lists at gmail.com
Sun Nov 1 21:59:47 PST 2009
Rainer Deyke wrote:
> Jesse Phillips wrote:
>> I believe the documentation you are looking for is:
>>
>> http://www.prowiki.org/wiki4d/wiki.cgi?DanielKeep/TextInD
>>
>> It is more about understanding UTF than it is about learning strings.
>
> One thing that page fails to mention is that D has no awareness of
> anything higher-level than code points. In particular:
> - dchar contains a code point, not a logical character.
> - D has no awareness of canonical forms and precomposed/decomposed
> characters (at the language level). (Some characters can be represented
> as either one or two code points. D does not know that these are
> supposed to represent the same character.)
> - Although D stops you from outputting an incomplete code point, it
> does not stop you from outputting an incomplete logical character.
>
> Also, some D library functions only work on the ASCII subset of utf-8.
Well, it *is* on a Wiki.
More information about the Digitalmars-d-learn
mailing list