Compiling Ruby. Part 4: progress update

Published on Nov 30, 2023

This article is part of the series "Compiling Ruby," in which I'm documenting my journey of building an ahead-of-time (AOT) compiler for DragonRuby, which is based on mruby and heavily utilizes MLIR and LLVM infrastructure.

This series is mostly a brain dump, though sometimes I'm trying to make things easy to understand. Please, let me know if some specific part is unclear and you'd want me to elaborate on it.

Here is what you can expect from the series:

Motivation: some background reading on what and why
Compilers vs Interpreters: a high level overview of the chosen approach
RiteVM: a high-level overview of the mruby Virtual Machine
MLIR and compilation: covers what is MLIR and how it fits into the whole picture
Progress update: short progress update with what's done and what's next
Exceptions: an overview of how exceptions work in Ruby
Garbage Collection (TBD): an overview of how mruby manages memory
Fibers (TBD): what are fibers in Ruby, and how mruby makes them work

Note: the list of TBD articles may change as I may want to split some parts into smaller chunks.

It’s been a while since I wrote the last blog post. One of the reasons is that so far, I had to change a lot of things in the implementation due to the exception support.

I’m writing a short progress update on where we are and what’s coming next.

What Happened

During this year, I gave two short talks related to this project:

a high-level overview of the project (EuroLLVM dev meeting)
intro into exception handling in LLVM (LLVM Social Berlin)

The state as of EuroLLVM (May 2023) was as follows:

compiler supported 104 out of 107 bytecode operations
it could compile ~150 out of ~180 files
it could compile ~15KLoC out of ~20KLOC
~72% of tests were passing (1033 out of 1416 it could compile)

Current Status

The three missing opcodes were all about exception handling, and this is what (so far) took the most time to implement. I have some drafts on the details, and I plan to publish them before the end of the year.

With the proper exception handling in place, things are finally starting to take the right shape. There is still much work to do, but it’s more predictable now.

Some new stats:

all bytecode operations are implemented 🎉
all the ruby code in the repo is now compiled (stdlib, gems, tests) 🎉
~95% of the tests are passing (1378 out of 1450) 🎉

Next Steps

The test suite now drives the next steps:

the majority of the failing tests (42 out of 71) are due to the missing fibers implementation
the second biggest group is various proc/methods metadata for runtime reflection
the next big part is related to JIT/runtime evaluation (i.e., when you can execute arbitrary Ruby code not known/visible at compile time)
and there is a long tail of more minor things

Besides that, I need to figure out a better build system for all of it. Currently, It’s a mess glued together by CMake scripts and CMake templates. It works perfectly for development and testing, but I’d hate to use such a system as an end user.

Ideally, I want a one-click solution that would take Ruby files as input and produce a native executable.

What is the state of the art when it comes to build systems/orchestration of compilation? Please let me know if you have any pointers 🙌

Thank you so much for reaching this far!

The next article is about exceptions - Exceptions

Compiling Ruby. Part 4: progress update

What Happened

Current Status

Next Steps

Favorite Categories

Recent Posts