Replacing tango.text.Ascii.isearch
bachmeier
no at spam.net
Fri Oct 7 12:19:59 UTC 2022
On Friday, 7 October 2022 at 07:16:19 UTC, Siarhei Siamashka
wrote:
> On Friday, 7 October 2022 at 06:34:50 UTC, Siarhei Siamashka
> wrote:
>> Also are we allowed to artificially construct needle and
>> haystack to blow up this test rather than only benchmarking it
>> on typical real data?
>
> Such as generating the input data via running:
>
> python -c "print(('a' * 49 + 'b') * 20000)" > test.lst
>
> And then using
> "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa" (the
> character 'a' replicated 50 times) as the needle to search for.
> Much longer needles work even better. In Linux the command line
> size is limited by 128K, so there's a huge room for improvement.
https://www.cs.utexas.edu/users/moore/best-ideas/string-searching/
"the longer the pattern is, the faster the algorithm goes"
More information about the Digitalmars-d-learn
mailing list