Disable GC entirely
Paulo Pinto
pjmlp at progtools.org
Mon Apr 8 02:36:27 PDT 2013
On Monday, 8 April 2013 at 08:21:06 UTC, Manu wrote:
> On 8 April 2013 16:35, Paulo Pinto <pjmlp at progtools.org> wrote:
>
>> On Monday, 8 April 2013 at 03:13:00 UTC, Manu wrote:
>>
>>> On 7 April 2013 20:59, Paulo Pinto <pjmlp at progtools.org>
>>> wrote:
>>>
>>> I am not giving up speed. It just happens that I have been
>>> coding since
>>>> 1986 and I am a polyglot programmer that started doing
>>>> system programming
>>>> in the Pascal family of languages, before moving into C and
>>>> C++ land.
>>>>
>>>> Except for some cases, it does not matter if you get an
>>>> answer in 1s or
>>>> 2ms, however most single language C and C++ developers care
>>>> about the 2ms
>>>> case even before starting to code, this is what I don't
>>>> approve.
>>>>
>>>>
>>> Bear in mind, most remaining C/C++ programmers are realtime
>>> programmers,
>>> and that 2ms is 12.5% of the ENTIRE AMOUNT OF TIME that you
>>> have to run
>>> realtime software.
>>> If I chose not to care about 2ms only 8 times, I'll have no
>>> time left. I
>>> would cut off my left nut for 2ms most working days!
>>> I typically measure execution times in 10s of microseconds,
>>> if something
>>> measures in milliseconds it's a catastrophe that needs to be
>>> urgently
>>> addressed... and you're correct, as a C/C++ programmer, I DO
>>> design with
>>> consideration for sub-ms execution times before I write a
>>> single line of
>>> code.
>>> Consequently, I have seen the GC burn well into the ms on
>>> occasion, and as
>>> such, it is completely unacceptable in realtime software.
>>>
>>
>>
>> I do understand that, the thing is that since I am coding in
>> 1986, I
>> remember people complaining that C and Turbo Pascal were too
>> slow, lets
>> code everything in Assembly. Then C became alright, but C++
>> and Ada were
>> too slow, god forbid to call virtual methods or do any
>> operator calls in
>> C++'s case.
>>
>
> The C++ state hasn't changed though. We still avoid virtual
> calls like the
> plague.
> One of my biggest design gripes with D, hands down, is that
> functions are
> virtual by default. I believe this is a critical mistake, and
> the biggest
> one in the language by far.
There I agree with you, I prefer the C++ and C# model as it is
better suited for languages with AOT compilation.
Virtual by default, in terms of implementation, is not an issue
if the code is JITed, but with AOT compilation you need PGO to be
able to inline virtual calls.
>
> Afterwards the same discussion came around with JVM and .NET
> environments,
>> which while making GC widespread, also had the sad side-effect
>> to make
>> younger generations think that safe languages require a VM
>> when that is not
>> true.
>>
>
> I agree with this sad trend. D can help address this issue if
> it breaks
> free.
Even Go and Rust are a help in that direction I would say.
>
> Nowadays template based code beats C, systems programming is
> moving to C++
>> in mainstream OS, leaving C behind, while some security
>> conscious areas are
>> adopting Ada and Spark.
>>
>
> I don't see a significant trend towards C++ in systems code?
> Where are you
> looking?
Mainly at the following examples.
Microsoft stating C90 is good enough in their tooling and C++ is
way forward as the Windows system programming language.
On BUILD 2012 there is a brief mention from Herb Sutter that
kernel team is making the code C++ compliant, on his presentation
about Modern C++. I can search that on the videos, or maybe if
someone was there can confirm it.
Oh, and the new Windows APIs since XP are mostly COM based, thus
C++, because no sane person should try to use COM from C.
Mac OS X driver subsystem uses a C++ subset.
Symbian and BeOS/Haiku are implemented in C++.
OS/400 is a mix of Assembly, Modula-2 and C++.
Both gcc and clang now use C++ as implementation language.
Sometimes I think UNIX is what keeps C alive in a way.
> The main reason people are leaving C is because they've had
> quite enough of
> the inconvenience... 40 years is plenty thank you!
> I think the main problem for the latency is that nothing
> compelling enough
> really stood in to take the helm.
>
> Liberal use of templates only beats C where memory and
> bandwidth are
> unlimited. Sadly, most computers in the world these days are
> getting
> smaller, not bigger, so this is not a trend that should be
> followed.
> Binary size is, as always, a critical factor in performance
> (mainly
> relating to the size of the targets icache). Small isolated
> templates
> produce some great wins, over-application of templates results
> in crippling
> (and very hard to track/isolate) performance issues.
> These performance issues are virtually impossible to fight;
> they tend not
> to appear on profilers, since they're evenly distributed
> throughout the
> code, making the whole program uniformly slow, instead of
> producing
> hot-spots, which are much easier to combat.
> They also have the effect of tricking their authors
> into erroneously thinking that their code is performing really
> well, since
> the profilers show no visible hot spots. Truth is, they didn't
> both writing
> a proper basis for comparison, and as such, they will happily
> continue to
> believe their program performs well, or even improves the
> situation
> (...most likely verified by testing a single template version
> of one
> function over a generalised one that was slower, and not
> factoring in the
> uniform slowless of the whole application they have introduced).
>
> I often fear that D promotes its powerful templates too much,
> and that
> junior programmers might go even more nuts than in C++. I
> foresee that
> strict discipline will be required in the future... :/
I agree there.
Since D makes meta-programming too easy when compared with C++, I
think some examples are just too clever for average developers.
>
> So for me when someone claims about the speed benefits of C and
> C++
>> currently have, I smile as I remember having this kind of
>> discussions with
>> C having the role of too slow language.
>
>
> C was mainly too slow due to the immaturity of compilers, and
> the fact that
> computers were not powerful enough, or had enough resources to
> perform
> decent optimisations.
> [...]
Yeah the main issue was immature compilers.
Which is still true when targeting 8 and 16 processors as they
still have a similar environment, I imagine.
> With a good suite of intrinsics available to express
> architecture-specific
> concepts outside the language, I haven't had any reason to
> write assembly
> for years, the compiler/optimiser produce perfect code (within
> the ABI,
> which sometimes has problems).
I am cleaning up a toy compiler done on my final year (1999) and
I wanted to remove the libc dependency on the runtime, which is
quite small anyway, only allowing for int, boolean and string IO.
After playing around some hours writing Assembly from scratch, I
decided to use the C compiler as high level assembler, disabling
the dependency on the C runtime and talking directly to the
kernel,
It is already good enough to get myself busy with Assembly in the
code generator.
>
> Also, 6502 and z80 processors don't lend themselves to generic
> workloads.
> It's hard to develop a good general ABI for those machines; you
> typically
> want the ABI to be application specific... decent ABI's only
> started
> appearing for the 68000 line which had enough registers to
> implement a
> reasonable one.
>
> In short, I don't think your point is entirely relevalt. It's
> not the
> nature of C that was slow in those days, it's mainly the
> immaturity of the
> implementation, combined with the fact that the hardware did
> not yet
> support the concepts.
> So the point is fallacious, you basically can't get better
> performance if
> you hand-write x86 assembly these days. It will probably be
> worse.
I do lack real life experience in the game and real time areas,
but sometimes the complains about new language features seem to
be a thing of old generations don't wanting to learn the new ways.
But I have been proven wrong a few times. specially when I tend
to assume stuff even without proper field experience.
>
>
> Walter's claim is that D's inefficient GC is mitigated by the
> fact that D
>>> produces less garbage than other languages, and this is true
>>> to an extent.
>>> But given that is the case, to be reliable, it is of critical
>>> importance
>>> that:
>>> a) the programmer is aware of every allocation they are
>>> making, they can't
>>> be hidden inside benign looking library calls like
>>> toUpperInPlace.
>>> b) all allocations should be deliberate.
>>> c) helpful messages/debugging features need to be available
>>> to track where
>>> allocations are coming from. standardised statistical output
>>> would be most
>>> helpful.
>>> d) alternatives need to be available for the functions that
>>> allocate by
>>> nature, or an option for user-supplied allocators, like STL,
>>> so one can
>>> allocate from a pool instead.
>>> e) D is not very good at reducing localised allocations to
>>> the stack, this
>>> needs some attention. (array initialisation is particularly
>>> dangerous)
>>> f) the GC could do with budgeting controls. I'd like to
>>> assign it 150us
>>> per
>>> 16ms, and it would defer excess workload to later frames.
>>>
>>
>>
>> No doubt D's GC needs to be improved, but I doubt making D a
>> manual memory
>> managed language will improve the language's adoption, given
>> that all new
>> system programming languages either use GC or reference
>> counting as default
>> memory management.
>>
>
> I don't advocate making D a manual managed language. I advocate
> making it a
> _possibility_. Tools need to be supplied, because it wastes a
> LOT of time
> trying to assert your code (or subsets of your code, ie, an
> frame execution
> loop), is good.
Sorry about the confusion.
>
>
> What you need is a way to do controlled allocations for the few
> cases that
>> there is no way around it, but this should be reserved for
>> modules with
>> system code and not scattered everywhere.
>>
>>
>>> Of course I think given time D compilers will be able to
>>> achieve C++ like
>>>
>>>> performance, even with GC or who knows, a reference counted
>>>> version.
>>>>
>>>> Nowadays the only place I do manual memory management is
>>>> when writing
>>>> Assembly code.
>>>>
>>>>
>>> Apparently you don't write realtime software. I get so
>>> frustrated on this
>>> forum by how few people care about realtime software, or any
>>> architecture
>>> other than x86 (no offense to you personally, it's a general
>>> observation).
>>> Have you ever noticed how smooth and slick the iPhone UI
>>> feels? It runs at
>>> 60hz and doesn't miss a beat. It wouldn't work in D.
>>> Video games can't stutter, audio/video processing can't
>>> stutter. ....
>>>
>>
>> I am well aware of that and actually I do follow the game
>> industry quite
>> closely, being my second interest after systems/distributed
>> computing. And
>> I used to be a IGDA member for quite a few years.
>>
>> However I do see a lot of games being pushed out the door in
>> Java, C# with
>> local optimizations done in C and C++.
>>
>
>> Yeah most of they are no AAA, but that does make them less
>> enjoyable.
>>
>
> This is certainly a prevaling trend. The key reason for this is
> productivity I think. Game devs are sick of C++. Like, REALLY
> sick of it.
> Just don't want to waste their time anymore.
> Swearing about C++ is a daily talk point. This is an industry
> basically
> screaming out for salvation, but you'll find no real consensus
> on where to
> go. People are basically dabbling at the moment.
> They are also lead by the platform holders to some extent, MS
> has a lot of
> influence (holder of 2 majorplatforms) and they push C#.
>
> But yes, also as you say, the move towards 'casual' games,
> where the
> performance requirements aren't really critical.
> In 'big games' though, it's still brutally competitive. If you
> don't raise
> the technology/performance bar, your competition will.
> D is remarkably close to offering salvation... this GC business
> is one of
> the final hurdles I think.
This is what I see with most system programming languages. The
only ones that succeed in the long run, where the ones pushed by
the platform holders.
That is what got me dragged from Turbo Pascal/Delphi land into C
and C++, as I wanted to use the default OS languages, even though
I preferred the former ones.
>
>
>> I also had the pleasure of being able to use the Native Oberon
>> and AOS
>> operating systems back in the late 90's at the university,
>> desktop
>> operating systems done in GC systems programming languages.
>> Sure you could
>> do manual memory management, but only via the SYSTEM pseudo
>> module.
>>
>> One of the applications was a video player, just the decoder
>> was written
>> in Assembly.
>>
>> http://ignorethecode.net/blog/**2009/04/22/oberon/<http://ignorethecode.net/blog/2009/04/22/oberon/>
>>
>>
>> In the end the question is what would a D version just with
>> manual memory
>> management have as compelling feature against C++1y and Ada,
>> already
>> established languages with industry standards?
>>
>> Then again my lack of experience in the embedded world
>> invalidates what I
>> think might be the right way.
>>
>
> C++11 is a joke. Too little, too late if you ask me.
> It barely addresses the problems it tries to tackle, and a lot
> of it is
> really lame library solutions. Also, C++ is too stuck. Bad
> language design
> that can never be changed.
> It's templates are a nightmare in particular, and it'll be
> stuck with
> headers forever. I doubt the compile times will ever be
> significantly
> improved.
I agree with you there, but the industry seems to be following
along anyway.
>
> But again, I'm not actually advocating a D without the GC like
> others in
> this thread. I'm a realtime programmer, and I don't find the
> concepts
> incompatible, they just need tight control, and good
> debug/analysis tools.
> If I can timeslice the GC, limit it to ~150us/frame, that would
> do the
> trick. I'd pay 1-2% of my frame time for the convenience it
> offers for sure.
> I'd also rather it didn't stop the world. If it could collect
> on one thread
> while another thread was still churning data, that would really
> help the
> situation. Complex though...
> It helps that there are basically no runtime allocations in
> realtime
> software. This theoretically means the GC should have basically
> nothing to
> do! The state of the heap really shouldn't change from frame to
> frame, and
> surely that temporal consistency could be used to improve a
> good GC
> implementation? (Note: I know nothing about writing a GC)
> The main source of realtime allocations in D code come from
> array concatenation, and about 95% of that, in my experience,
> are
> completely local and could be relaxed onto the stack! But D
> doesn't do this
> in most cases (to my constant frustration)... it allocates
> anyway, even
> thought it can easily determine the allocation is localised.
Agreed.
Thanks for the explanation, it is always quite interesting to
read your counterarguments.
--
Paulo
More information about the Digitalmars-d
mailing list