Regex and utf8
Roman Balitskiy
realis_toleroATtoleroDOTorg_fake at fake.com
Wed Aug 13 11:20:34 PDT 2008
Walter Bright Wrote:
>> When I try to parse cyrillic text I get "Error: 4invalid UTF-8
>> sequence". I use dmd 1.030 on Ubuntu 8.04 with utf8 locale. I have
>> tryed upcomming gdc 0.25 with the same results.
>> if (auto m = std.regexp.search(`abжdef`, `[ж]`))
>> // Here is cyrillic letter 'je'
>> writefln("%s[%s]%s", m.pre, m.match(0), m.post);
>>
> That's a bug with the regex engine, then. Who wants to put it in
> bugzilla? <g>
Is there any progress towards fix of that bug?
More information about the Digitalmars-d
mailing list