WebAssembly Won't Magically Make Your Code Faster

June 10, 2024·Kevin Karsopawiro

wasm

rust

javascript

performance

WebAssembly is often pitched as a performance silver bullet: compile your code to Wasm, get "near-native speed" in the browser, and leave JavaScript in the dust.

The reality is more nuanced. Wasm is a powerful tool, but it comes with constraints that can actually make your code slower if you use it wrong. Let's cut through the hype and understand when WebAssembly genuinely helps.

The Counterintuitive Truth About V8

Here's something that surprises many people: modern JavaScript engines are incredibly sophisticated.

V8 (Chrome, Node.js) doesn't just interpret your JavaScript. It profiles execution, identifies hot paths, and generates optimized machine code specifically tuned for your actual usage patterns. If you always pass numbers to a function, V8 generates fast number-only code.

WebAssembly doesn't get this treatment. It's compiled ahead of time. What you ship is what runs. There's no runtime profiling, no speculative optimization, no JIT magic.

This means mediocre WebAssembly competes against JavaScript that V8 has specifically optimized for your data. Sometimes the optimized JavaScript wins.

Surprised Pikachu:

The Boundary Crossing Tax

The most common Wasm performance mistake is making too many calls across the JavaScript/WebAssembly boundary.

Consider image processing:

// Naive approach: call Wasm for each pixel
for (let i = 0; i < pixels.length; i++) {
    pixels[i] = wasmModule.processPixel(pixels[i]);
}

Every call from JavaScript to WebAssembly has overhead. It's small, maybe a few hundred nanoseconds, but we're making millions of calls. The overhead dominates the actual work.

The fix is obvious once you understand the problem:

// Better: one call, process everything in Wasm
const result = wasmModule.processAllPixels(pixelBuffer);

One boundary crossing instead of millions. The actual processing happens entirely within WebAssembly.

Rule of thumb: Cross the JS/Wasm boundary rarely. Send work in batches. Let Wasm chew on large chunks of data.

The Memory Dance

WebAssembly and JavaScript have separate memory spaces. Data doesn't automatically flow between them.

When you pass an array from JavaScript to WebAssembly, it often gets copied. For small data, this is negligible. For large images or video frames, the copying overhead can eat your performance gains.

The solution is shared memory:

#[wasm_bindgen]
pub fn allocate_buffer(size: usize) -> *mut u8 {
    let mut buf = Vec::with_capacity(size);
    let ptr = buf.as_mut_ptr();
    std::mem::forget(buf);
    ptr
}

#[wasm_bindgen]
pub fn process_buffer(ptr: *mut u8, len: usize) {
    let slice = unsafe { std::slice::from_raw_parts_mut(ptr, len) };
    for byte in slice.iter_mut() {
        *byte = transform(*byte);
    }
}

// Allocate in Wasm memory
const ptr = wasmModule.allocate_buffer(imageData.length);

// Get a view into Wasm memory
const wasmMemory = new Uint8Array(
    wasmModule.memory.buffer, ptr, imageData.length
);

// Write directly into Wasm memory (one copy)
wasmMemory.set(imageData);

// Process in place (no copy)
wasmModule.process_buffer(ptr, imageData.length);

// Read results directly (no copy)
// wasmMemory now contains the processed data

This is more complex than the naive approach, but it eliminates unnecessary copies. For large data, this matters.

When WebAssembly Actually Shines

With those caveats established, let's talk about where Wasm genuinely excels:

CPU-Bound Computation

Cryptographic operations are the canonical example. You pass in bytes, the algorithm does intensive CPU work, and you get bytes back. Minimal boundary crossings, maximal computation.

#[wasm_bindgen]
pub fn hash_sha256(data: &[u8]) -> Vec<u8> {
    use sha2::{Sha256, Digest};
    let mut hasher = Sha256::new();
    hasher.update(data);
    hasher.finalize().to_vec()
}

This will outperform JavaScript implementations. The algorithm is CPU-bound, data movement is minimal, and Rust's compiled output is genuinely faster than what V8 can generate.

Other good candidates:

Image/video processing (with proper memory management)
Physics simulations
Compression/decompression
Audio processing
Parsing complex file formats

Predictable Performance

JavaScript's JIT compilation means variable performance. Cold functions are slow. Hot functions get optimized. Type changes can trigger deoptimization.

WebAssembly performance is consistent. The first call is the same speed as the millionth. For real-time applications (games, audio), this predictability can be more valuable than raw speed.

Porting Existing Code

If you have battle-tested C++ or Rust code, compiling to Wasm is often easier and safer than rewriting in JavaScript. You preserve the original logic, edge case handling, and years of bug fixes.

This is how projects like AutoCAD and Figma brought complex native applications to the browser.

When WebAssembly Is Probably Overkill

And here's where Wasm often disappoints:

DOM Manipulation

WebAssembly can't touch the DOM directly. Every DOM operation requires calling back into JavaScript. If your bottleneck is DOM updates (it usually is in web apps), Wasm won't help.

Network-Bound Work

If you're waiting for API responses, your performance ceiling is network latency. Optimizing CPU usage is irrelevant.

Most Web Applications

The performance bottleneck in typical web apps is rarely JavaScript execution speed. It's usually:

Network requests
DOM layout and rendering
Memory allocation patterns
Animation frame budget

Wasm doesn't address any of these.

Simple Algorithms

For basic operations, V8's optimized JavaScript is genuinely fast. The overhead of setting up Wasm might not be worth it for simple computations.

The Decision Framework

Before reaching for WebAssembly, ask:

Is the bottleneck actually CPU-bound computation? Profile first. Most web performance issues aren't CPU-bound.
Can you batch the work? If you need millions of small calls, Wasm will hurt more than help. You need to send chunks of work.
Is the data transfer overhead acceptable? Calculate how much data needs to move between JS and Wasm. If it's huge and can't be avoided, the copying might eat your gains.
Do you have existing optimized code to port? Compiling existing C++/Rust is compelling. Rewriting JavaScript in Rust just for Wasm is a bigger commitment.
Have you optimized the JavaScript first? Often the JS is slow due to algorithmic issues that would be equally slow in any language.

The Technical Reality

WebAssembly is a target format. It's not inherently faster than optimized machine code because it becomes machine code. The advantages are:

Sandboxed execution in untrusted environments (browsers)
Portable bytecode that runs on any Wasm runtime
Predictable performance without JIT warmup
Access to systems languages like Rust and C++

The performance story is more nuanced than "Wasm = fast." It's "Wasm + the right use case + proper memory management = fast."

A Balanced View

WebAssembly is a genuinely valuable technology. It enables applications that weren't possible in browsers before. Games, CAD software, video editors, and more.

But it's not a performance cheat code. Using it well requires understanding its constraints: boundary crossing costs, memory management complexity, and the situations where it actually helps.

The best approach: profile your actual bottleneck, consider whether Wasm addresses it, and if so, design your interface to minimize boundary crossings and data copying.

And if JavaScript turns out to be fast enough? That's also fine. V8 is an engineering marvel. There's no shame in using it.