Today's programming challenge - How's your Range-Fu ?
Chris via Digitalmars-d
digitalmars-d at puremagic.com
Mon Apr 20 02:30:39 PDT 2015
On Saturday, 18 April 2015 at 17:04:54 UTC, Tobias Pankrath wrote:
>> Isn't this solved commonly with a normalization pass? We
>> should have a normalizeUTF() that can be inserted in a
>> pipeline.
>
> Yes.
>
>> Then the rest of Phobos doesn't need to mind these combining
>> characters. -- Andrei
>
> I don't think so. The thing is, even after normalization we
> have to deal with combining characters because in all
> normalization forms there will be combining characters left
> after normalization.
Yes, again and again I encountered length related bugs with
Unicode characters. Normalization is not 100% reliable. I don't
know anyone who works with non English characters who doesn't
have problems with Unicode related issues sometimes.
More information about the Digitalmars-d
mailing list