Faster Virtual Method Dispatch

Mon Apr 24 19:21:09 PDT 2006

Craig Black wrote:
> "BCS" <BCS_member at pathlink.com> wrote in message 
> news:e2isuc$2jmd$1 at digitaldaemon.com...
>> In article <e2inmo$2bjp$1 at digitaldaemon.com>, Craig Black says...
>>> "Dan" <Dan_member at pathlink.com> wrote in message
>>> news:e2im9o$29gi$1 at digitaldaemon.com...
>>>> Ultimately though, I think the GC causes the highest level of 
>>>> performance
>>>> troubles on the benchmarks.  I love it to death though and wish someone
>>>> would
>>>> get their hands dirty making it smaller and sexier.
>>> Or (truly) optional. :)  The notion that the GC is optional in D is really
>>> misleading.  Everything uses it.  While you could certainly make it faster
>>> at the expense of more complexity, I don't think that GC that scans the
>>> stack testing for non-null pointers will ever provide the performance of 
>>> C's
>>> malloc and free.  GC is simply performing much more work.  I have heard 
>>> many
>>> arguments to the contrary, but I still don't see how it is possible that 
>>> GC
>>> could compete with malloc and free.
>>>
>>
>> A bad GC will definitely make for bad performance. And any GC will be 
>> slower
>> than an optimum malloc/free solution. However (and I'm still undecided on 
>> this
>> one) I think the argument is that in any non trivial project, manually 
>> freeing
>> all of the memory without any memory leaks or axing something to soon 
>> would
>> require more overhead than using GC. The trade-off is not in the freeing 
>> time
>> but in the can-I-free-this check.
> 
> I have heard similar.  "The code required to manually free everything 
> outweighs the overhead of automatic GC."  Unless you are doing something 
> rediculous in your code when freeing heap objects, this statement is false. 
> Manual freeing does not require gobs of code or eat up lots of processor 
> cycles.  Typically it involves testing a pointer to see if it is null, 
> calling free() if it is not, and then setting it to null.  How could this 
> ever be slower than scanning the entire stack for non-null pointers?  GC is 
> simply doing way more work.  IMO this statement is simply obsurd.

There's more to it than that. One of the big benefits of GC is that 
fewer allocations are necessary - and a sure way to better allocation 
performance is to do less of them. The reason fewer allocations are 
necessary is that the normal behavior in C/C++, when faced with two 
pointers to the same object, is to copy the object. That way, there's no 
grief with worrying about freeing the object that someone else points to.

To see this effect at work, see www.digitalmars.com/d/cppstrings.html.

This problem is bad enough that the notion of shared_ptr<> has appeared. 
shared_ptr<> implements reference counting to eliminate the need for 
extra copies. But it has its own overhead problems:

1) for every allocation, shared_ptr<> allocates an *extra* block of 
memory to hold its accounting information. So you've got, out of the 
box, twice as many allocations/deallocations happening.

2) for every copy of the shared_ptr<>, like passing one to a function, 
returning one, etc., the reference count has to be incremented, then 
decremented. The inc/dec must be done in an exception safe manner, i.e. 
they need to be unwindable. Even worse, for multithreaded apps these 
inc/dec's need to be synchronized.

And even with all that falderol, shared_ptr<> still can't do slicing, 
which are an important efficiency improvement.