State of the Unicode in D

Dmitry Olshansky dmitry.olsh at gmail.com
Sat Jul 30 12:09:49 PDT 2011


On 30.07.2011 5:21, Walter Bright wrote:
> On 7/29/2011 4:24 PM, Walter Bright wrote:
>> http://training.perl.com/OSCON2011/index.html
>>
>> This is a good starting point for seeing where we are with Unicode 
>> support and
>> where we need to go.
>
> One problem: http://d.puremagic.com/issues/show_bug.cgi?id=6403

Let me expand a bit my reply on bugzilla.
There are other things I'd like to note, besides conforming to unicode 
regex standard, that is (going to be) fully supported in  upcoming 
next-gen std.regex.
Things I'd love to see in an upgrade of std.uni:
     - normalization (at least NFC)
     - unicode version 5.0 ---> 6.0
     - grapheme support, via a special range on top of string or at 
least plain "stride" function that tells the length of a cluster a-la 
the one that does UTF-8 decoding
I had to (re)implement a lot of stuff, with the end result that the 
unicode support in regex is self-contained right now.
Of course, I'd be willing to make arrangements to gradually shift some 
of this stuff back where it belongs, once I'm finished with regexes.

-- 
Dmitry Olshansky



More information about the Digitalmars-d mailing list