The Future of D Runtime
Adam Wilson
flyboynw at gmail.com
Wed Jun 5 23:58:14 UTC 2024
History and Problem
-------------------
Now that the Editions DIP is up public comment it is time to have
a frank discussion about where the D Runtime (DRT for the
remainder) fits into the puzzle that Editions introduces into the
D ecosystem. The reason I bring this up is that it appears that,
at present, most people treat DRT as a particularly vexing
extension of the compiler that nobody wants to touch. As a
result, the presumption seems to be that, with the advent of
editions, we should just freeze DRT wherever it is when the first
edition drops and never touch it again because doing so might
break something across editions. While this position is certainly
expedient, I would argue that this position is going to severely
limit the future of Phobos and D.
Furthermore, I would argue that this position makes a presumption
that all DRT should ever be is a compiler extension library. I
propose that this view is both incorrect and a significant
limitation on what we can do with Phobos 3 and even future
editions.
I want to call back to DConf 2023 where I spent much of
conference working to undo the deprecation of the ODBC bindings
in Phobos because they were being replaced with `version
(Windows)` bindings in DRT, which is utter nonsense, ODBC can be
used on POSIX quite easily (and I do every day). The explanation
I was given as to why this was done is that the bindings have
nothing to do with system resources, but because the bindings
were provided by DRT they should be removed from Phobos because
Phobos is not in the business of providing library bindings.
This creates a rule for Phobos. Phobos does not expose system
resources or bindings. IMO this is a good rule. Phobos is
bespoke, and that is the proper situation. The standard library
really should not be in the business of exposing system level
constructs.
Then why not use the DRT bindings? Because the bindings in DRT
are automatically generated and are fundamentally broken because
of it. At the time it was noted to me that those bindings really
should not be in DRT *either* because DRT isn’t supposed to
provide access to non-System resources like ODBC and they were
only there by accident because the ODBC headers got swept into
the auto-generation process.
This creates a rule for DRT. DRT is only to expose system
resources, nothing more. And indeed, this is a good rule as well.
The purpose of a runtime is to expose system resources in a
uniform manner, and to the extent that DRT does this, it does a
fantastic job.
Currently we call DRT a runtime, but when you catalog what it
does, it’s really a “compiler support library that provides some
runtime services.” There is no argument from me that the GC is a
runtime service, the same for threads, but the rest of the listed
items that DRT provides are very definitely aimed at enabling the
compiler to do things like AA’s for example. AA’s are a feature
that is only accessible from the language hence “compiler
support”.
The most often given reason for keeping DRT as-is is that “we
only need to port DRT to a new platform, Phobos is platform
independent.” I submit that this is factually incorrect. There
are currently 31 files in Phobos with `version (WinXXXX)`
statements in them. 14 with `version (linux)`. And IIRC everybody
who has tried a port D to a new platform recently has just not
bothered to port Phobos at all. To be sure, some of these version
statements are benign, but take a look at `std.stdio`, I wish
whoever wants to port that much luck in their endeavors. It has
32 instances `version (WinXXXX)` alone. And that is our basic I/O
module.
Runtimes are not compiler support libraries, they are
application-system interfaces. The .NET runtime does much more
than threads and the GC. The Java runtime does much more than
threads and the GC. The C runtime does much more than memory
management. I could go on, but you get the point.
The purpose of a runtime is to provide a system resource access
layer for the standard library. Consider the C Runtime and the C
Standard Library. The C Runtime provides the system specific
implementation, and the Standard Library provides the
standardized application interface. No matter what system you are
on, calling `printf` will print a string to the terminal via the
runtime. If you get a `FILE*` you will work with a file on disk
no matter what filesystem the system is using. These are not
compiler supports. They are a universal system-application
interface.
An example of why this matters is found in `std.stdio`. We
currently use the C Runtime to do basic I/O operations. While
that works and is expedient, it limits us. For example, doing
colored terminal output is so difficult nobody does it, or, we
are limited to the C file interface, etc. To gain access to more
advanced capabilities, we need to use the system API’s, but these
are diverse, and putting them in Phobos would significantly
expand the code required and make maintenance a nightmare. What
is interesting about this is that, in fact, D has two runtimes.
The C Runtime, which we use as a universal system interface, and
the DRT which is a bunch of compiler support tools that the CRT
doesn’t have.
Proposed Design
---------------
My proposal is that we expand DRT into a universal system
interface for the standard library. DRT would contain low-level
interfaces for Terminal and File I/O, threads, event-loops,
sockets, cryptography, anything that would normally access a
system API. Why go to all this effort? Simple, so that we can
move beyond the limitations imposed by the C Runtime. Using the
CRT was certainly expedient in the early days of D, but we’ve
grown beyond that and it’s time to upgrade our capabilities.
However, this will significantly expand the scope of DRT and will
make porting harder. I think the answer to this problem came out
of left field in a conversation I had last Thursday with an
ex-Google engineer who has no prior knowledge of D. What he said
was: “you need to shard your runtime.” This statement caused
several things to “click” and got me started writing this missive.
Before I get into the design of the DRT I want to propose rules
that will allow us to continue to evolve DRT in the future
without breaking past editions. From an ABI standpoint, there are
only three actions that can be taken: Add, Rename, Remove.
1. Remove. Banned. We cannot remove symbols from the DRT API. But
we can rename them provided we follow the rename rules.
2. Rename. Renames include changing the parameters in any way. We
can support this by offering either a shim that redirects to the
new method or leaving the original method alone and building a
new one next to it.
3. Add. The is probably the easiest. New symbols can be added at
any time.
If we follow these rules, then we can safely use the *latest* DRT
version with all prior editions.
For DRT I propose a sharded design with a split between the
compiler support modules and the universal system interface
shards. What this means is that there will be a “Core-DRT” that
only contains things that the compiler needs to function. Then,
for each subsystem in Phobos, there would be a corresponding DRT
subsystem that can be built and linked (statically or
dynamically). This means that the only mandatory component that
would have to be ported is the “Core-DRT” component for the
compiler. The porting engineer would then be free to choose to
port as much or as little of what remains of DRT and know that
the corresponding Phobos subsystem is guaranteed to work once the
port is complete. Another to view these components is as an
“onion” where each one builds on the one above it. Core must
exist for System, System exists for Cryptography, etc. Below is
a, likely incomplete, list of potential runtime shards.
Core:
1. Compiler Hooks
2. GC
3. Threads
4. Event Loop (in later iterations)
System:
1. Console/Terminal I/O
2. File I/O
Cryptography:
1. System Cryptography Primitives
2. Optional OpenSSL extended Cryptography Primitives (to support
what the OS does not).
Network:
1. Sockets
2. SSL/TLS
I’m not married to the above so please destroy. One thing that I
think would be beneficial for ease-of-porting is researching if
it would be possible to move Threads and the Event Loop out of
Core and into the System component then provide a mechanism for
notifying the GC of thread creation/deletion without actually
requiring the Threads (and thus the Event Loop) implementation to
be in the Core DRT so that people porting to systems that don’t
have Threads (or simply don’t want to support them) don’t have to
port them to get basic D code running.
The goal of Core is to be the most reduced component of the
runtime that can still produce a functional program in D. The
larger goal of this DRT expansion project is to eventually drop
the requirement for the C Runtime. By doing so we free ourselves
from the limitations of the CRT, such as being unable to handle
Console Colors on systems that support them, and we remove a
layer of abstraction by calling directly into the system API’s.
This does mean more work for porting in terms of needing to call
the system API’s directly as opposed to using the CRT, however,
one option could be to provide an opt-in CRT-enabled build of DRT
that uses the CRT for a limited set of features.
I want to point out that these are ideas intended to address the
limitations of the current situation and the proposal to make
changes is a result of looking at the work we’re going to have to
do with Phobos 3. I fully understand that I can be wrong on
things here, but the present situation with DRT is not going to
remain tenable as we move forward with expanding Phobos.
Another angle that might be worth pursuing is building DRT up
into a Runtime that can be used by many languages, similar to how
the CRT is used. It would also increase the appeal of D to folks
looking for a platform to start building a language on, like how
an ARM backend will help the hobbyist language community. I
understand that LLVM and GCC are there, but they are not simple
to work with. If we can provide an ecosystem that is simple to
work with for language experimentation and development, we can
capture people looking for a tool to start designing languages
quickly and still provide them a path to integrate with LLVM or
GCC via our existing integrations with those tools.
These are ideas, not directions, so please destroy.
More information about the Digitalmars-d
mailing list