The state of string interpolation...one year later

Nick Treleaven nick at geany.org
Sun Mar 17 10:40:28 UTC 2019


On Sunday, 17 March 2019 at 06:01:35 UTC, Jonathan Marler wrote:
>     text("a is", a, ", b is ", b, " and the sum is: ", a + b)
>
> Ironically, his example had a mistake, but it was hard to
> notice.

Maybe he wrote that on the forum. In a text editor syntax 
highlighting would make the mistake clearer, but it can still 
happen. The interpolated syntax is definitely clearer.

I think interpolated strings as a language feature is 
justifiable, but that a mixin solution can work with some small 
more general changes to the language, and that these changes are 
both simpler to support and more useful and flexible.

> CON 1. The syntax is "not nice".  This defeats the entire point
> of interpolated strings.

What about `$ident!targs(args)` being sugar for 
`mixin(ident!targs(args))`? Then we can have a function template 
in object.d invoked as:

$iStr!"v = $v"
becomes:
mixin(iStr!"v = $v")

> With library solutions,
> you can't point syntax errors inside interpolated strings to
> source locations.  That information is not available to the
> language.  When you get a string, you don't know where each
> character inside that string originated from, only the compiler
> knows that.

Yes, but the language could support a way to get the file and 
line from template alias arguments inside the template. So at 
least the starting line of the string literal could be known and 
given in an error message.

> CON 3. Performance.  No matter what we do, any library solution
> will never be as fast as a language solution.

True, but what matters is whether a library solution is fast 
enough. If dmd isn't currently then perhaps the CTFE rewrite will 
be.

> CON 4. IDE/Editor Support.  A library solution won't be able to
> have IDE/Editor support for syntax highlighting, auto-complete,
> etc.  When the editor sees an interpolated string, it will be
> able to highlight the code inside it just like normal code.

The editor has to be updated for interpolated syntax, so it could 
just as easily be updated to recognise invocation of $iStr if 
iStr was in object.d.

> The library solution needs to parse that interpolated string
> but needs to know that the right paren at `")"` is actually
> just a string literal inside the expression and not a right
> paren to delimit the end of the expression.  This is a
> contrived example, but if you have anything less than a full
> lexer/parser then developers are going to have a hard time
> being able to know what can and can't go inside an interpolated
> expression.

Good point, but I think banning all string delimiters `'" is a 
reasonable and easy to understand restriction. It's good to 
discourage people from putting complex expressions inside 
strings, and this also makes iStr editor highlighting easier to 
implement vs the unrestricted language solution (which could also 
be restricted).

> That being said, I consider the implementation
> and complexity it adds to be quite minimal (see the PR for more
> details). As for the usefullness, I can say personally I would
> use this feature to replace almost all my usages of
> writefln/format and writeln which would be a big shift for my
> projects.  Instead of:
>
> writefln("My name is %s and my age is %s and my favorite hex is
> %s", name, age, favnum);

Actually this should be `writefln!"My name is..."(name, ...)`. 
Formatting can be more efficient if the format string is known at 
compile-time.

> I will be writing:
>
> writeln(i"My name is $name and my age is $age and my favorite
> hex $(favnum.formatHex)");

Does formatHex exist?

>      return text(iq{
>         $returnType $name($type left, $type right)
>         {
>             return cast($returnType)(left $op right);
>         }
>     });

Here there are advantages to the $iStr solution vs your 
implementation. First I would have two mixin functions, iSeq to 
do what you want, and iStr which includes the call to 
std.conv.text:

1. iStr doesn't need to import std.conv.text, which is a very 
common case. Having to import `text` explicitly would often make 
me avoid using the interpolated string feature and use existing 
string syntax instead. (A different language implementation could 
expose text as a property of the interpolated string though).

2. The iStr template gets the string literal passed at compile 
time,
so the length is available for buffer pre-allocation without 
summing the string fragment lengths at runtime. (A different 
language implementation could expose the original string length 
as an enum, then text, writef and format can take advantage of 
it).




More information about the Digitalmars-d mailing list