Formal Review of std.regex (FReD)

Dmitry Olshansky dmitry.olsh at gmail.com
Wed Oct 12 04:27:34 PDT 2011


On 12.10.2011 0:04, Jerry wrote:
> I have 2 thoughts.
>
> 1) Minor doc typo:
>
> Long form for hex notation should be \U00YYYYYY.

Yeah, \U it is.

>
> 2) Unicode set syntax
>
> If you're going to provide unicode set support, why not use ICU syntax
> rather than invent another one?

Looks like I was tricked by their technical standard then.
I can't immediately recall where this syntax was ever used but:
http://unicode.org/reports/tr18/#Subtraction_and_Intersection

The prime reason cited here is that e.g. '--' is (almost) unambigious 
with range notation '-' and also allows to skip [] where applicable
[\p{letter}--a-z] vs [[\p{letter}]-[a-z]].
Come to think of it '--' is cleaner in this case.
>
> Jerry


-- 
Dmitry Olshansky


More information about the Digitalmars-d mailing list