Accented Characters and Counting Syllables
"Nordlöw" via Digitalmars-d-learn
digitalmars-d-learn at puremagic.com
Sun Dec 7 06:30:13 PST 2014
On Saturday, 6 December 2014 at 23:11:49 UTC, H. S. Teoh via
Digitalmars-d-learn wrote:
> This is a Unicode issue. What you want is neither byCodeUnit nor
> byCodePoint, but byGrapheme. A grapheme is the Unicode
> equivalent of
> what lay people would call a "character". A Unicode character
> (or more
> precisely, a "code point") is not necessarily a complete
> grapheme, as
> your example above shows; it's just a numerical value that
> uniquely
> identifies an entry in the Unicode character database.
>
>
> T
Ok, thanks.
I just noticed that byGrapheme() lacks bidirectional access.
Further it also lacks graphemeStrideBack() in complement to
graphemeStride()? Similar to stride() and strideBack(). Is this
difficult to implement?
More information about the Digitalmars-d-learn
mailing list