what's the correct way to handle unicode? - trying to print out graphemes here.

ag0aep6g anonymous at example.com
Tue Jul 3 14:37:42 UTC 2018


On Tuesday, 3 July 2018 at 13:32:52 UTC, aliak wrote:
> foreach (c; "👩‍👩‍👦‍👦🏳️‍🌈") {
>   writeln(c);
> }
>
> So basically the above just doesn't work. Prints gibberish.

Because you're printing one UTF-8 code unit (`char`) per line.

> So I figured, std.uni.byGrapheme would help, since that's what 
> they are, but I can't get it to print them back out? Is there a 
> way?
>
> foreach (c; "👩‍👩‍👦‍👦🏳️‍🌈".byGrapheme) {
>   writeln(c.<????>);
> }

You're looking for `c[]`. But that won't work, because std.uni 
apparently doesn't recognize those as grapheme clusters. The 
emojis may be too new. std.uni is based on Unicode version 6.2, 
which is a couple years old.


More information about the Digitalmars-d-learn mailing list