Replacing tango.text.Ascii.isearch

bachmeier no at spam.net
Fri Oct 7 12:19:59 UTC 2022


On Friday, 7 October 2022 at 07:16:19 UTC, Siarhei Siamashka 
wrote:
> On Friday, 7 October 2022 at 06:34:50 UTC, Siarhei Siamashka 
> wrote:
>> Also are we allowed to artificially construct needle and 
>> haystack to blow up this test rather than only benchmarking it 
>> on typical real data?
>
> Such as generating the input data via running:
>
>     python -c "print(('a' * 49 + 'b') * 20000)" > test.lst
>
> And then using 
> "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa" (the 
> character 'a' replicated 50 times) as the needle to search for. 
> Much longer needles work even better. In Linux the command line 
> size is limited by 128K, so there's a huge room for improvement.

https://www.cs.utexas.edu/users/moore/best-ideas/string-searching/

"the longer the pattern is, the faster the algorithm goes"


More information about the Digitalmars-d-learn mailing list