Recent changes to GDC.

Iain Buclaw ibuclaw at ubuntu.com
Mon Jan 16 12:49:40 PST 2012


On 16 January 2012 19:14, Artur Skawina <art.08.09 at gmail.com> wrote:
> On 01/16/12 17:43, Iain Buclaw wrote:
>> On 16 January 2012 14:41, Artur Skawina <art.08.09 at gmail.com> wrote:
>>> On 01/15/12 23:34, Iain Buclaw wrote:
>>>> * Merged in the work Walter has done for __vector type support.  There are now newly available GCC builtins for vector operations via gcc.builtins module.
>>>
>>> Allowing !=128 bits wide types would be just a matter of removing the frontend
>>> restrictions, right? (i didn't look at the latest commits, maybe you already did
>>> that)
>>> As the vector ops often will need to be wrapped, lack of cross module function
>>> inlining is a problem.
>>>
>>
>> If you want cross-module inlining, compile all sources in one command.
>>
>> gdc file1.d file2.d -o myapp
>
> Does this work for '-c' too? For building libs, like eg phobos?
>
> Hmm, '-fonly" seems like it could work, together with '-c' -- I have never
> used that option, will have to try it later - but if it works, shouldn't it
> be on by default (for '-[cS]' invocations)?
>

It won't.  For inlining to work, the function body (that is available
at compile-time in DMDFE AST) needs to be converted into GCC trees for
the backend to inline.  -fonly, whilst it processes the semantic
analysis on the other modules, does not generate any codegen for them.
 For cross module inlining to properly work in the one-file-at-a-time
compilation method, there needs to be an intermediate routine that
says 'build this body now, but don't add the function to the global to
output list'.


> It's not that "i want cross-module inlining" - in a language like D it's
> practically required, for anything bigger than a single module. There are
> even cases in runtime where "{return 1;}" functions are not inlined (eg the
> malloc/mmap wrappers).
>
> When i did the gcbits asms it was prompted by this profile:
>
> 37.62%  uint gc.gcx.Gcx.fullcollect(void*)
> 20.47%  uint gc.gcbits.GCBits.test(uint)
> 13.80%  uint gc.gcbits.GCBits.testSet(uint)
> 10.15%  pure nothrow @safe bool std.uni.isGraphical(dchar)
>  3.33%  _D3std5array17__T8AppenderTAyaZ8Appender10__T3putTwZ3putMF
>  2.78%  0x11025a
>  2.13%  _D3std6format65__T13formatElementTS3std5array17__T8Appende
>  1.64%  _D3std6format56__T10formatCharTS3std5array17__T8AppenderTA
>  1.37%  pure @safe uint std.utf.encode(ref char[4], dchar)
>  0.88%  void* gc.gcx.GC.malloc(uint, uint, uint*)
>  0.50%  pure nothrow @safe bool std.uni.binarySearch2(dchar, immutable(dchar[2][]))
>  0.45%  void gc.gcbits.GCBits.set(uint)
>  0.37%  void gc.gcbits.GCBits.clear(uint)
>  0.34%  __divdi3
>
> Clearly the garbage collector needs some love, but the 33%+ of the total
> app runtime (!) spent in those two trivial functions makes judging any
> GC improvements much harder. Even when inlined, the gcbits.* methods are
> still the hottest parts of the whole app, and while in this case it's the
> GC that needs fixing, that's not going to always be the case.
>
> A hack that can often be used to get cross module inlining working is
> adding a pair of parentheses - GDC will not throw away the function
> (well, now template) bodies and will happily inline them. But that's not
> really a good solution...
>

It's not a priority, but you will not be ignored.  A solution I'm
happy with needs to be drafted first though.


>>>> * GDC's default calling convention has now been switched back to that of the default for the target platform.
>>>
>>> Hmm. You just invented a new ABI. "extern(D)" will mean different things to
>>> different compilers on the same platform. Only makes sense if you're sure to
>>> win the ABI war, and the other party to practically disappear.
>>
>> I did not invent an ABI.  extern(D) is only defined for the x86
>> platform - and it is no secret that only DMD adheres to it.
>
> But what GDC emits for "extern(D)" is different both from what other D
> compilers do (what does LDC do on x86, BTW?) *and* it's different from
> the C/C++ convention...
>

I think LDC uses some fancy LLVM hook to control how structs are
returned, but that's about it.


>>> This new ABI still isn't compatible with the default "C/C++" one, because of
>>> name mangling. (could it be made, at least partially, C++ compatible?)
>>> As you still cannot easily call D code from C, without the equivalent of
>>> "extern(D)", why not default to a more sane calling conventions, such as
>>> regparm on 32-bit x86?
>>
>> The default calling convention is cdecl - this is directly ABI
>> compatible with C and C++ code with the exception to name mangling
>> requiring special treatment.
>>
>> eg:
>>
>> extern "C" int _Dmain(struct string args);
>
> That one exception makes a difference - it's still not directly
> callable from C/C++ w/o some kind of "extern(D)" declaration.
> (the _Dmain identifier is not the best example, other functions
> will not have similarly simple names...)
> And, yes, in theory, it could be done, but do you really expect
> C/C++ modules to directly use the demangled names?...
>
> Things like:
>
> pragma(attribute, ifunc("_D5ifunc6pick_fFZPFiZv")) void f(int i);
>
> is about the only time i had to do it (and this is actually a bug -
> ifunc should demangle the name).
>
> BTW, is the GDC exception handling compatible with C++?
>
>

It's practically rewrite of the C++ exception routine in D. But don't
quote me on it being directly compatible.


>>> GCC made a mistake years ago to not mangle the names when using a nonstd
>>> convention (eg by prepending '@' like some other compilers did) which
>>> resulted in this feature being unusable, other than for application-internal
>>> functions, as calling the wrong version will silently build and link, only to
>>> blow up at runtime; you also cannot place both versions in a library etc.
>>> Hence "fixing" this had to wait for a new architecture (x86_64) and whole
>>> program optimizations/LTO, which can figure out automatically when it's safe
>>> to switch to a more efficient convention.
>>>
>>
>> What makes you think changing the calling convention will turn off
>> name mangling?
>
> My point is that modifying the ABI has a significant cost and can't easily
> be undone. I don't see much gain from switching from one nonstd convention
> to a different nonstd convention, which is /similar/ to the C one, but still
> not directly accessible from C.
> What does your change mean? Programs now use a worse calling convention (not
> even one arg gets passed in a register), object files and libraries generated
> by different D compilers are incompatible (by design, not just because of some
> implementation quirk), worse, mixing libs produced by different toolchains may
> seem to work, only to fail at runtime. Where's the gain?
>

1.  My point is that modifying the ABI has a significant cost and
can't easily be undone. I don't see much gain from switching from one
nonstd convention to a different nonstd convention, which is /similar/
to the C one, but still not directly accessible from C.

A calling convention that is used by the target platform is better
than a calling convention that is used by no one but yourself.  Also,
I don't see how name mangling is part of the calling convention, but
maybe that's just me (I am aware that name mangling is part of various
Windows calling conventions, but not in *nix. :-)


2. What does your change mean? Programs now use a worse calling
convention (not even one arg gets passed in a register),

*cough* for x86 *cough*

I can't see how you can use this as an argument. Parameter passing
hasn't changed for x86, only the way certain values are returned. So
it is *still* as bad as it was *before* the change.


3. object files and libraries generated by different D compilers are
incompatible (by design, not just because of some implementation
quirk), worse, mixing libs produced by different toolchains may seem
to work, only to fail at runtime. Where's the gain?

It has always been this way. I've never known people to frequently mix
compilers when building code.


>
>>>> The D_InlineAsm family of version identifiers are now turned off by default as we no longer pretend to follow DMD's calling convention. For those who want to turn on D_InlineAsm(_X86/_x86_64) there is a -fd-inline-asm compiler switch.
>>>
>>> Why is it a compiler option? Ie is it safe to turn it on (why optional then?)
>>> or will it result in wrong code being generated (why have the option then?).
>>> D_InlineAsm allowed things like the cpuid code to work, will every asm{} now
>>> need to be rewritten, or could a safe subset of the "D" asms be reused?
>>>
>>
>> All Inline Asm code in Druntime and Phobos is written with the DMD
>> calling convention in mind, so ie: when one of the naked functions in
>> std.math pops the stack when returning a float, the caller won't clean
>> this in GDC generated code.  Other than that, it works perfectly well
>> (tm).
>
> Ah, ok, thank you for clarifying that. It would then be possible to keep
> D_InlineAsm working and only fail (or warn) when "naked" is used in "D" asm,
> right?
>
>

The only patch to the calling convention that I've needed to do to get
D Naked Functions working is to tell the caller that the callee will
pop the stack if is returning a float, double or real (or their
complex equivalents).  A better solution would be to implement this
"quirk" as another attribute, ie: callee_pop_float_return, which is
then handled in ix86_return_pops_args.  If that could be accepted in
GCC, then we'd be flying again.


>>>> * GDC will emit the GNU_InlineAsm version identifier to tell user code that we support GNU Extended Inline Assembly.
>>>
>>> BTW, a identifier, which is set for *both* x86 and x86_64 would be good - because
>>> of how D's version() works and the fact that on those platforms the asm code will
>>> often be identical.
>>>
>>>> * All patches to GCC proper have now been removed, GDC can now build applications without relying on changes to the backend, with the exception of naked functions.
>>>>
>>>> * "naked" has now been implemented now as a function attribute of the x86 platform.  It is applied to all naked functions, and implies noinline and noclone.
>>>
>>> I've never really understood the reason for "naked". Custom prologue and epilogue
>>> could be useful sometimes, but turning the compiler into an assembler?... You can
>>> already write "naked" asm functions in gcc - with asm()s outside of functions
>>> plus some as(1) section push/pop magic, iirc.
>>> Now you've removed the D builtin assembler, but decided to introduce "naked".
>>> Is it really necessary? There are no back-compat concerns...
>>> How does naked work will the various gcc features the modify prologue (profiling,
>>> stack usage checking, forced stack aligment etc)?
>>>
>>
>> I have not removed the D builtin assembler. I have just removed
>> D_InlineAsm version identifers.  GDC's "naked" function support was
>> originally a (pretty bad) hack in GCC.  It is now implemented cleanly
>> as a function attribute, and does not affect the code generation of
>> other frontends.
>
> I meant something like "practically removed the availability of D builtin
> assembler by removing the guarding version identifiers", sorry.
> But doesn't this mean that, right now, there are exactly zero users of this
> feature (gcc naked functions)?..
>
> artur

There is naked support for other architectures that have been accepted
in GCC (arm, avr, m68k, mcore, rx, spu) - yes I had to  look them up.
:-)


-- 
Iain Buclaw

*(p < e ? p++ : p) = (c & 0x0f) + '0';


More information about the D.gnu mailing list