Of possible interest: fast UTF8 validation

Walter Bright newshound2 at digitalmars.com
Wed May 16 20:42:11 UTC 2018


On 5/16/2018 1:11 PM, Andrei Alexandrescu wrote:
> If you could share some details on why you think UTF8 is badly designed and how 
> you believe it could be/have been better, I'd be in your debt!

Me too. I think UTF-8 is brilliant (and I suffered for years under the lash of 
other multibyte encodings prior to UTF-8). Shift-JIS: shudder!

Perhaps you're referring to the redundancy in UTF-8 - though illegal encodings 
are made possible by such redundancy.


More information about the Digitalmars-d mailing list