Possible optimization: tree of closures #32

cosmos72 · 2022-05-10T20:51:44Z

Directly interpreting the abstract syntax tree is slow, as you point out.
And Go is not really well-suited to write bytecode interpreters.

An approach that works quite well in Go, as I found out for my Go interpreter gomacro, is to convert the abstract syntax tree to a tree of closures (lambdas) and then execute it.

eigenhombre · 2022-05-11T15:53:09Z

@cosmos72 thank you for the idea! gomacro looks cool.

Why is Go ... not really well-suited to write bytecode interpreters?

cosmos72 · 2022-05-11T16:10:20Z

Because it's a too high-level language:

there's no way to suggest the compiler which variables should be kept in registers,
there's no way to declare global register variables. See for example https://gcc.gnu.org/onlinedocs/gcc/Global-Register-Variables.html for C/C++ counterpart.
it does not have computed gotos. See for example https://eli.thegreenplace.net/2012/07/12/computed-goto-for-efficient-dispatch-tables
it does not have tail-call optimization. There is a known technique, where VM state is kept in registers which are passed around using tail calls. It requires that language ABI passes function arguments in registers (which Go added recently) and tail call optimization (which Go core team does not want to implement). Unluckily I cannot find any detailed description for it at the moment.

So the core loop of any bytecode interpreter will either:

call a lot of small functions (up to one function per bytecode to be executed)
or contain a huge switch, which is very difficult to optimize for any compiler, and register allocation will suffer a lot
Both solutions cause a lot state being kept in memory and loaded/stored very often, instead of being kept in memory

Anyway, if you are start optimizing l1 heavily, you will soon notice that keeping everything wrapped in reflect.Value introduces a large overhead too.

eigenhombre · 2022-05-16T02:12:40Z

Interesting, thanks. However, for the (short-running) toy problems I've looked at so far l1 has been surprisingly fast so far. I will probably hold off on any optimizations until the language gets a little farther along, at least through macros. But it's nice to start thinking about how one might optimize things if/when that makes sense.

eigenhombre added the enhancement New feature or request label Aug 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible optimization: tree of closures #32

Possible optimization: tree of closures #32

cosmos72 commented May 10, 2022 •

edited

Loading

eigenhombre commented May 11, 2022

cosmos72 commented May 11, 2022 •

edited

Loading

eigenhombre commented May 16, 2022

Possible optimization: tree of closures #32

Possible optimization: tree of closures #32

Comments

cosmos72 commented May 10, 2022 • edited Loading

eigenhombre commented May 11, 2022

cosmos72 commented May 11, 2022 • edited Loading

eigenhombre commented May 16, 2022

cosmos72 commented May 10, 2022 •

edited

Loading

cosmos72 commented May 11, 2022 •

edited

Loading