Let's stop parser Hell

Philippe Sigaud philippe.sigaud at gmail.com
Wed Aug 1 13:47:47 PDT 2012


On Wed, Aug 1, 2012 at 8:24 PM, Jacob Carlborg <doob at me.com> wrote:

> Don't know if that makes it clearer.

It does! Particularly this:

> All these encodings can store *every* character in the Unicode standard. What's
> different is how the characters are stored and how many bytes a single
> character takes to store in the string.
(emphasis mine)

I somehow thought that with UTF-8 you were limited to a part of
Unicode, and to another, bigger part with UTF-16.
I equated Unicode with UTF-32.
This is what completely warped my vision. It's good to learn something
new everyday, I guess.

Thanks Jacob!


More information about the Digitalmars-d mailing list