nothrow by default
Johannes Pfau
nospam at example.com
Sun Jan 5 10:32:23 UTC 2020
Am Sat, 04 Jan 2020 13:38:53 -0800 schrieb Walter Bright:
>
> The short rationale is that exceptions being a "pay only if you use
> them" is a complete fraud. They're expensive to support, meaning
> performance programs use other ways of signalling errors and use
> nothrow.
I totally agree to that (memory overhead, TypeInfo, support code). In
addition, this overhead means that exceptions are unlikely to be supported
in embedded systems, which will lead to a language ecosystem split (this
effectively happened to C++).
But I'm not sure if that's a good rationale for nothrow by default. We
already tell users to use exceptions only for "exceptional cases" and use
other error handling mechanism for common error paths. With this change,
we'd force users even stronger to use other error handling methods. But
those don't have language support and we're back to C times, where users
forget to check error codes, ...
Herb Sutter came to the same conclusion recently and proposed an
alternative exception implementation for C++. If you didn't see this
already Walter, it well worth to have a look:
https://www.youtube.com/watch?v=os7cqJ5qlzo
https://www.youtube.com/watch?v=ARYP83yNAWk
I'll summarize the idea here, as far as I remember and adapted to D
terminology / context:
Rationale: There are two common ways to handle errors: Backtrace/
exception based and error code based. Both have drawbacks (exceptions:
performance, memory / implementation overhead, not supported in embedded
systems) (error codes: not "bubbling up" the call stack automatically, no
way to force user to check, so might accidentially ignore errors).
Solution:
1) Implement exceptions like return codes: Each function which can throw
returns a union ValueOrError{T result; Error err}. The discriminator
bit is stored as a CPU flag register which is not used across
function calls (e.g. the OVERFLOW status bit). Error is a two-word
error code: Error {size_t code; size_t context}. This explains the
ABI, for now nothing of this is user-visible.
Whenever a call to a function which might throw occurs and there's a
try-catch handler, insert this code after the call:
if (OVERFLOW == 1)
goto catch_handler;
If there's no catch handler installed and a function might throw:
if (OVERFLOW == 1)
return;
The return value is already in the return registers. Therefore
propagating error codes upwards is cheap.
The benefits now are simple: Implementation code / memory overhead is
almost zero, no TypeInfo required, trivial to implement on embedded
systems, .... At the same time, exceptions bubble up properly so they
can't be accidentially unhandled. The runtime overhead in the success
case is obviously higher than for some exception systems, but it's
only a simple conditional jump. It is cheaper than manual error codes,
as we reuse the same registers for return value and error code,
benefitting register allocation. Error propagation also uses the same
registers in all functions and is therefore very efficient.
For D / legacy exception support, you just store
code=LEGACY_EXCEPTION, context=Exception pointer. To allocate the
error codes in a distributed way and to check
for different error types, we can simply store dummy values in the
data segment to get unique addresses:
ubyte OutOfMemoryError; ... ; if (ret.code == &OutOfMemoryError)...
2) (Optional): Herb arguest that because of throw ... it is easy to spot
where an exception originates, but it's more difficult to find where
an exception was propagated. As a solution,
whenever calling a throws function, the calls should be preceeded by
throw:
auto value = throw myThrowingFunction();
Here throw does essentially this: If myThrowingFunction threw, rethrow
the exception. Otherwise return the return value.
3) (C++ specific): Error codes instead of exceptions, "Type based Errors":
I don't really know what is meant by this "Type-Based Errors"
terminology, seems to be a C++ marketing thing ("we do everything with
types now")... The important takeaway is to not allocate complex
exception objects. Use the error code and context value. If really
more context than one word is necessary, it's still possible to stuff
a pointer into context.
4) (D-specific): Not mentioned in the C++ proposal, but some interesting
extension: A more local error handling solution:
In D we have try/catch if we want to handle errors from multiple
functions calls and scope(failure) to handle all errors in a function.
But fine-grain error handling is annoying:
-----------------
try
foo();
catch(Exception e)
{
switch(e)
{
case DNS_ERROR:
writeln("Dns erorr");
retry();
case ...
}
}
// success code here
-----------------
quite some overhead here. Traditional error codes are better here:
-----------------
T ret;
switch(ret = foo())
{
case ...:
return;
}
// Success here
-----------------
Maybe we could formalize this: We can expose the ValueOrError union to
the user. foo() then returns ValueOrError!T and the 'throw' in 2)
could simply check for errors, otherwise return ValueOrError!(T).value.
We could then add an opCast(bool) overload to ValueOrError to make
if (value) work, add a function to check/abort and convert
ValueOrError.getValue and overload the switch statement on this:
-----------------
auto res = foo();
switch(res.error)
{
case DNS_ERROR:
return;
default: // Other error
return;
}
// With flow-typing we'd know that ValueOrError is no error here.
Without, we have to explicitly convert the type somehow:
T value;
// Switch magically overloaded on error / value tuple
switch ((value, error) = foo())
{
writeln(error);
return;
}
writeln(value);
-----------------
However, with these ideas in 4), we'd have to force the user somehow
to check the error bit and make the value only accessible after the
check.
Caveats:
* I have not thought about how exception chaining fits into all this.
* These exceptions do not naturally propagate through foreign language
interfaces, although I think we don't guarantee this in D right now
either.
If there's any real interest in this, I'd be happy to write a proper
DIP(s) for it (I think 1, 2 and 4 are all independent DIPs). If C++
really gets this, we might have to support it anyway for C++ interop.
PS: Back to the original question: nothrow by default: With 2) every
function which might throw and where errors are not handled locally needs
to have throw in front of the function call. Obviously, we don't want to
have too many of these, so nothrow functions should be the common case.
And the common case should be the default, so nothrow by default also
makes sense in this context.
--
Johannes
More information about the Digitalmars-d
mailing list