State of the Unicode in D

Walter Bright newshound2 at digitalmars.com
Sat Jul 30 12:44:16 PDT 2011


On 7/30/2011 12:09 PM, Dmitry Olshansky wrote:
> Let me expand a bit my reply on bugzilla.
> There are other things I'd like to note, besides conforming to unicode regex
> standard, that is (going to be) fully supported in upcoming next-gen std.regex.
> Things I'd love to see in an upgrade of std.uni:
> - normalization (at least NFC)
> - unicode version 5.0 ---> 6.0
> - grapheme support, via a special range on top of string or at least plain
> "stride" function that tells the length of a cluster a-la the one that does
> UTF-8 decoding
> I had to (re)implement a lot of stuff, with the end result that the unicode
> support in regex is self-contained right now.
> Of course, I'd be willing to make arrangements to gradually shift some of this
> stuff back where it belongs, once I'm finished with regexes.
>

Sounds great!


More information about the Digitalmars-d mailing list