[Issue 16998] New: Provide a uniq & group range methods that doesn't rely on sortedness

via Digitalmars-d-bugs digitalmars-d-bugs at puremagic.com
Wed Dec 21 18:27:50 PST 2016


https://issues.dlang.org/show_bug.cgi?id=16998

          Issue ID: 16998
           Summary: Provide a uniq & group range methods that doesn't rely
                    on sortedness
           Product: D
           Version: D2
          Hardware: x86_64
                OS: Linux
            Status: NEW
          Severity: enhancement
          Priority: P1
         Component: phobos
          Assignee: nobody at puremagic.com
          Reporter: greeenify at gmail.com

Sorting is in O(n log n) and requires a random access range (=allocation!)
However plain iteration can be done in O(n) and groups can be created with a
simple hashmap and e.g. the simplest implementation of uniq is sth. like this:

auto uniq(R)(R r)
{
    import std.algorithm.iteration : each;
    bool[dchar] d;
    r.each!(key => d[key] = true);
    return d.byKey;
}

unittest
{
    import std.algorithm.iteration : walkLength;
    assert("cbbba"d.uniq.walkLength == 3);
}

The idea is that for the common use case (unsorted ranges), these non-lazy
range functions allocate less and should perform faster.

--


More information about the Digitalmars-d-bugs mailing list