The Case For Autodecode

ag0aep6g via Digitalmars-d digitalmars-d at puremagic.com
Fri Jun 3 07:44:14 PDT 2016


On 06/03/2016 03:56 PM, Kagamin wrote:
> A lot of discussion is disagreement on understanding of correctness of
> unicode support. I see 4 possible meanings here:
> 1. Implemented according to spec.
> 2. Provides level 1 unicode support.
> 3. Provides level 2 unicode support.
> 4. Achieves the goal of unicode, i.e. text processing according to
> natural language rules.

Speaking of that, the document that Walter dug up [1], which talks about 
supports levels, is about regular expression engines in particular. It's 
not about general language support.

The version he linked to is also pretty old. A more recent revision [2] 
calls level 1 (code points) the "minimally useful level of support", 
speaks warmly about level 2 (graphemes), and says that level 3 (locale 
dependent behavior) is "only useful for specific applications".


[1] http://unicode.org/reports/tr18/tr18-5.1.html
[2] http://www.unicode.org/reports/tr18/tr18-17.html


More information about the Digitalmars-d mailing list