This is a good point. One of the things I have to balance with the book is teach...

wool_gather · on March 18, 2020

Probably sufficient to add it as a chapter footnote/challenge like you did with flexible array members for the string implementation. Maybe with a pointer to an example if you have time to find one.

_8ljf · on March 19, 2020

This. Compromises are fine: real-world solutions make them all the time. What’s valuable to readers is displaying the reasoning behind them.

kazinator · on March 18, 2020

> technique of compiling language constructs to runtime function calls.

That can be as basic as generating a function call AST node directly from a construct, or doing a node-to-node transformation.

The compiler doesn't see anything but a function call node (though any source code tracking info and whatnot still references the original construct).

If you're doing things with the AST other than just generating code, you may want to have a node for that original construct. (For instance, feeding cross-referencing info to a language server or whatever.)

munificent · on March 18, 2020

The implementation is actually a single-pass compiler so it goes from token stream straight to bytecode generation with no intermediate AST or other IR in between. Even so, it would be straightforward to emit the bytecode instruction sequence for a call to a special runtime function.

joe_the_user · on March 18, 2020

Well, the thing about this approach, I think, is that your code itself essentially becomes something like an image of your syntax tree (a la recursive descent parsing). That's fine if you want a sort of "toy" interpreter or interpreter that doesn't care about speed much.

But you're also using bytecode, which involves more caring about speed, so you've got a bit of a mismatch - as your code tries to match the multiple layers you're skipping, your code will get more complicated or your VM gets more arbitrary, so of the situation you have here.

munificent · on March 19, 2020

> or interpreter that doesn't care about speed much.

This is the same approach taken by Lua and many of the original Pascal compilers, which had to run on very slow hardware, so there's pretty good precedent that you can get adequate performance.

> But you're also using bytecode, which involves more caring about speed, so you've got a bit of a mismatch

I wouldn't describe it as a mismatch as much as it is a trade-off. Because the compiler only has a peephole view into the source when it needs to generate code, many optimizations are off the table. However, because we have full control over the instruction set, we can sometimes tweak the bytecode format in order to more naturally align with the compiler.

In return for not needing an intermediate representation or AST, we get a much simpler compiler, especially in C where you have to worry about memory management.

joe_the_user · on March 19, 2020

It is reasonable to say this is a tradeoff. However, one of the costs is having the VM structure depend on the language structure, which kind of a software engineering no-no. But yes, tradeoffs always happen.

munificent · on March 19, 2020

> which kind of a software engineering no-no.

Optimization along any axis (code size, simplicity, compile-time performance, runtime performance, memory size, etc.) always involves some level of violating software engineering norms.

One way to think of software engineering is that it's the practice of optimizing long-term developer velocity. Any practice that optimizes other factors generally does so at the expense of that. That's OK. Meta-software engineering is about knowing when long-term maintainability is the right factor to optimize for. :)

kazinator · on March 19, 2020

That is really only a political no-no. The issue is that if your VM takes off in popularity, some people get upset that it's a poor fit for their favorite language.

Technically, it's perfectly fine for a VM to be designed so that the semantic gap is small between it and the intended language family.

withinboredom · on March 18, 2020

Languages evolve, and changes need to be made.

Many languages didn't support "arrow functions" AKA lambdas until fairly recently. Seeing how one would go about making large changes to the language is a good lesson to learn. One could argue that we should have known superclasses were going to be supported from the beginning, and I'd agree, hindsight is always 20:20.

enedil · on March 18, 2020

Off-point. Lambdas are different, in the sense that with such functions you 1) need to think about the closure of the function 2) might need to reconsider what part of memory will be executable (you certainly don't want an executable heap). The change here is in the other direction - you remove a feature that is easily swapped with a builtin, which is implemented in terms of other already existing language features.

vmsp · on March 18, 2020

Hi Bob! I just want to tell you that I've just started reading your book and I'm enjoying it very much.

Maybe you could add a design note in the chapter with the mentioned content?

Keep it up!

munificent · on March 18, 2020

> Maybe you could add a design note in the chapter with the mentioned content?

Great idea!

Made a note of it: https://github.com/munificent/craftinginterpreters/issues/62...

tus88 · on March 19, 2020

[flagged]

doteka · on March 19, 2020

Yeah, that’s rude and uncalled for. Have you done anything comparable to Bob’s work on programming languages? Otherwise I’d suggest to shush.

rs23296008n1 · on March 19, 2020

Especially as I've seen plenty of goats on their hind legs. Excellent climbers.

This book is a great idea.