Improving CSV parsing performance, Episode 2 (Was: Re: Speed of csvReader)

H. S. Teoh via Digitalmars-d-learn digitalmars-d-learn at puremagic.com
Mon Jan 25 22:27:49 PST 2016


On Sun, Jan 24, 2016 at 06:07:41AM +0000, Jesse Phillips via Digitalmars-d-learn wrote:
[...]
> My suggestion is to take the unittests used in std.csv and try to get
> your code working with them. As fastcsv limitations would prevent
> replacing the std.csv implementation the API may not need to match,
> but keeping close to the same would be best.

My thought is to integrate the fastcsv code into std.csv, such that the
current std.csv code will serve as fallback in the cases where fastcsv's
limitations would prevent it from being used, with fastcsv being chosen
where possible.

It may be possible to lift some of fastcsv's limitations, now that a few
performance bottlenecks have been identified (validation, excessive
number of small allocations, being the main ones). The code could be
generalized a bit more while preserving the optimizations in these key
areas.


T

-- 
BREAKFAST.COM halted...Cereal Port Not Responding. -- YHL


More information about the Digitalmars-d-learn mailing list