I have 2 thoughts. 1) Minor doc typo: Long form for hex notation should be \U00YYYYYY. 2) Unicode set syntax If you're going to provide unicode set support, why not use ICU syntax rather than invent another one? Jerry