The Case Against Autodecode
Marco Leise via Digitalmars-d
digitalmars-d at puremagic.com
Fri May 13 12:26:40 PDT 2016
Am Fri, 13 May 2016 10:49:24 +0000
schrieb Marc Schütz <schuetzm at gmx.net>:
> In fact, even most European languages are affected if NFD
> normalization is used, which is the default on MacOS X.
>
> And this is actually the main problem with it: It was introduced
> to make unicode string handling correct. Well, it doesn't,
> therefore it has no justification.
+1 for leaning back and contemplate exactly what auto-decode
was aiming for and how it missed that goal.
You'll see that an ö may still be cut between the o and the ¨.
Hangul symbols are composed of pieces that go in different
corners. Those would also be split up by auto-decode.
Can we handle real world text AT ALL? Are graphemes good
enough to find the column in a fixed width display of some
string (e.g. line+column or an error)? No, there my still be
full-width characters in there that take 2 columns. :p
--
Marco
More information about the Digitalmars-d
mailing list