Let's stop parser Hell
Jacob Carlborg
doob at me.com
Wed Aug 1 02:14:52 PDT 2012
On 2012-08-01 08:11, Jonathan M Davis wrote:
> I'm not using regexes at all. It's using string mixins to reduce code
> duplication, but it's effectively hand-written. If I do it right, it should be
> _very_ difficult to make it any faster than it's going to be. It even
> specifically avoids decoding unicode characters and operates on ASCII
> characters as much as possible.
That's good idea. Most code can be treated as ASCII (I assume most
people code in english). It would basically only be string literals
containing characters outside the ASCII table.
BTW, have you seen this:
http://woboq.com/blog/utf-8-processing-using-simd.html
--
/Jacob Carlborg
More information about the Digitalmars-d
mailing list