Visualizing persistent vectors with Rust and WebAssembly

Cloning a vector means copying every element. For small collections this is fine, but as vectors grow, so does the cost. In functional languages, where collections are immutable, cheap copies are vital for performance.

In Rust, this comes up too. Ownership means you must clone a vector if you want to keep using it after passing it somewhere else. As Niko Matsakis pointed out, Rust collections already behave like values; they just have an expensive clone. What if that clone could be nearly free?

That post got me interested in the problem. After some research, I ended up building pvec-rs, a Rust library implementing persistent vectors based on RRB trees, a variant designed for efficient splitting and concatenation. This post is a visual introduction to how it all works.

A quick intro to persistent vectors

The idea is to represent a vector as a tree internally. Instead of copying every element, you reuse most of the existing structure. To produce a modified version, you only need to copy the path from the root to the changed leaf. This is called path copying. The unchanged nodes are shared between the old and new versions of the tree. This is known as structural sharing, and a data structure that preserves previous versions of itself after modification is called persistent.

A flat vector of 10 elements and the same data represented as a tree with branching factor 4.

The diagrams in this post use a branching factor of 4 to keep things readable, but in practice 32 or 64 is used for better performance. Wide nodes are CPU cache friendly, and with a branching factor of 32, even ~4 billion elements (the limit of a 32-bit index) fit in a tree just 7 levels deep. That means at most 7 hops to reach any value, making random access effectively O(1).

This data structure is called a Radix Balanced Tree (RB tree). It also keeps a small tail buffer for recent pushes, avoiding a tree traversal on every append. New elements are first collected in the tail and only flushed into the tree when it fills up.

Path copying: Vector A shares its left subtree with Vector B. Only the root and right branch were copied.

Path copying in practice. Vector B is a clone of Vector A with 4 extra elements pushed. Red nodes are shared between both vectors. Orange nodes were created when pushing to the clone: only the root and the right branch needed to be copied.

The data structure has its roots in Phil Bagwell’s Ideal Hash Trees, which Rich Hickey adapted to build Clojure’s persistent vector. If you want to learn how it works in more detail, I would highly recommend Jean Niklas L’orange’s posts: part 1, part 2.

RRB trees

RB tree based vectors have a key weakness: splits and concatenations are O(n). RRB trees (Relaxed Radix Balanced) fix this by allowing nodes to be partially filled, which brings both operations down to O(log n). This matters for parallel code: you can split a vector across threads, process each part independently, then concatenate the results.

Because nodes can now be partially filled, the tree can no longer compute subtree sizes from its shape alone. Each relaxed node stores a size table to keep track.

After splitting: Vector A keeps elements 0-9 with a tail, Vector B gets elements 10-15 with a partially filled node and a size table.

A 16-element vector split at index 10. Vector A keeps elements 0-9, with the last two sitting in the tail. Vector B gets elements 10-15. Its left leaf only has 2 of 4 slots filled, so the root needs a size table (the numbers above it: 2, 6) to know how many elements each subtree holds.

Visualizing it

It took a couple of years of on-and-off hacking, but web-vis is finally in good enough shape to share. It compiles pvec-rs to WebAssembly and renders the tree in the browser. Each vector is drawn as a tree, and shared nodes are color-coded, so when you clone and push, you can see exactly which parts were copied and which are still shared. You can grow, shrink, clone, split, and concatenate vectors. If you split or concatenate, you will also see the size tables that relaxed nodes require.

Start by cloning a vector, then push to one of the clones to see path copying in action. Try it out.

If you want the full deep-dive (performance benchmarks, implementation details, and all the edge cases), it’s all in the thesis.