The Case Against Autodecode

ag0aep6g via Digitalmars-d digitalmars-d at puremagic.com
Tue May 31 12:34:47 PDT 2016


On 05/31/2016 07:21 PM, Andrei Alexandrescu wrote:
> Could you please substantiate that? My understanding is that code unit
> is a higher-level Unicode notion independent of encoding, whereas code
> point is an encoding-dependent representation detail. -- Andrei

You got the terms mixed up. Code unit is lower level. Code point is 
higher level.

One code point is encoded with one or more code units. char is a UTF-8 
code unit. wchar is a UTF-16 code unit. dchar is both a UTF-32 code unit 
and a code point, because in UTF-32 it's a 1-to-1 relation.


More information about the Digitalmars-d mailing list