regex on binary data

ketmar via Digitalmars-d-learn digitalmars-d-learn at puremagic.com
Wed Dec 31 14:36:23 PST 2014


On Wed, 31 Dec 2014 15:36:16 +0000
Darrell via Digitalmars-d-learn <digitalmars-d-learn at puremagic.com>
wrote:

> So far attempts to run regex on binary data causes
> "Invalid UTF-8 sequence".
> 
> Attempts to pass ubyte also didn't work out.

current regex engine assumes that you are using UTF-8 encoded text. i
really want regex engine to support user-supplied input ranges instead,
so decoding can be done by range (and regex engine can work on
anything, not only on strings), but i'm not ready for that challenge
yet. maybe i'll try to do something with it in 2015. ;-)
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 181 bytes
Desc: not available
URL: <http://lists.puremagic.com/pipermail/digitalmars-d-learn/attachments/20150101/8dbffeda/attachment.sig>


More information about the Digitalmars-d-learn mailing list