random cover of a range

Bill Baxter wbaxter at gmail.com
Fri Feb 13 20:20:22 PST 2009


On Sat, Feb 14, 2009 at 1:17 PM, Bill Baxter <wbaxter at gmail.com> wrote:

> On Sat, Feb 14, 2009 at 1:03 PM, Andrei Alexandrescu <
> SeeWebsiteForEmail at erdani.org> wrote:
>
>> bearophile wrote:
>>
>>> Andrei Alexandrescu:
>>>
>>>> Say at some point there are k available (not taken) slots out of
>>>> "n". There is a k/n chance that a random selection finds an
>>>> unoccupied slot. The average number of random trials needed to find
>>>> an unoccupied slot is proportional to 1/(k/n) = n/k. So the total
>>>> number of random trials to span the entire array is quadratic.
>>>> Multiplying that by 0.9 leaves it quadratic.
>>>>
>>>
>>> It's like in hashing: if you want to fill 99% of the available space
>>> in a hash, then you take ages to find empty slots. But if you fill it
>>> only at 75-90%, then on average you need only one or two tries to
>>> find an empty slot. So your time is linear, with a small
>>> multiplicative constant. When the slots start to get mostly full, you
>>> change algorithm, copying the empty slots elsewhere.
>>>
>>
>> Well I don't buy it. If you make a point, you need to be more precise than
>> such hand-waving. It's not like in hashing. It's like in the algorithm we
>> discuss. If you make a clear point that your performance is better than
>> O(n*n) by stopping at 90% then make it. I didn't go through much formalism,
>> but my napkin says you're firmly in quadratic territory.
>
>
> Well he has a point that the number of trials required to find an empty
> depends not on the absolute number of empty items, but only the ratio of
> empties to fulls.   Even your own claim about average number of trials was
> n/k -- not sure how you got that though.  If he stops when that reaches a
> maximum of 9 then the algo can't be quadratic up to that point.  It's O(n)
> expected, with a possibly fairly high hidden constant.  It also has a fairly
> long tail, as in there's a small probability of it taking a very long time
> to complete.
>

... the real gotcha is that he's using O(n) extra memory.   10% of n is
still O(n) memory for shuffling the last 10% of the items.


--bb
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.puremagic.com/pipermail/digitalmars-d/attachments/20090214/3b7ab9ee/attachment.html>


More information about the Digitalmars-d mailing list