Visualizing bsdiff: The Delta Compression Algorithm Used by macOS and Google Chrome

In 2003, the conventional wisdom was clear: to efficiently compress differences between compiled binaries, you needed a complete decompiler. You had to understand the executable format, parse the instruction set, track relocations, analyze the symbol table. Only then could you identify what actually changed between versions.

Then a PhD student at Oxford published a 4-page paper with a radically different approach. Instead of trying to understand the binary structure, he treated executables as opaque byte streams and applied a surprisingly simple technique: bytewise subtraction of almost matching regions.

The results spoke for themselves. On a FreeBSD security update—97 modified binaries, 36MB of changes—traditional tools produced 3.3MB patches. The commercial tool .RTPatch generated 750KB. This new algorithm? Just 621KB.

That's a 58x compression ratio achieved without understanding a single line of assembly.

The algorithm was called bsdiff, and it fundamentally changed how the industry thought about binary patching. Within a few years it became the foundation for software updates everywhere—from macOS to Chrome to mobile app stores. Today, every time your phone downloads a small security patch instead of re-downloading gigabytes of applications, you're benefiting from this counterintuitive insight.

What makes this algorithm so effective? And why does a relatively simple technique from 2003 still power billions of software updates today? Let's explore how bsdiff works.

The problem with binary patching

Traditional binary patch tools rely on simple COPY and ADD operations—they find matching regions and copy them from the old file, adding new bytes for everything else. This works fine for large changes, but completely falls apart for small security patches. You can avoid some of these by understanding the binary structure and decompiling, but that's not easy.

Here's the problem: when you fix a one-line bug and recompile, that tiny change cascades through the entire binary. Every function after the change shifts to a new address. Every pointer, function call, and jump instruction that references those addresses must update. A one-line source change can balloon into thousands of individual changes in the binary.

Before patch (old binary):
0x1000: push   %rbp
0x1001: mov    %rsp,%rbp
0x1004: mov    $0x5,%eax
0x1009: cmp    $0x0,%eax
...
0x1014: func_b() { ... }
0x1032: func_c() { ... }
0x105A: call 0x1014
0x105F: call 0x1032
After patch (new binary):
0x1000: push   %rbp
0x1001: mov    %rsp,%rbp
0x1004: mov    $0x5,%eax
0x1009: test   %eax,%eax     ← NEW!
0x100B: cmp    $0x0,%eax
...
0x1018: func_b() { ... }
0x1036: func_c() { ... }
0x105E: call 0x1018
0x1063: call 0x1036

The cascade effect from one instruction:

test %eax,%eax

→

Adds 4 bytes to func_a

func_b relocates:

func_c relocates:

All calls to these functions update throughout the binary

Traditional diff tools see all these address changes as independent new data, producing huge patches even though only one instruction actually changed.

Bsdiff's insight

Instead of treating address changes as completely new data, bsdiff does something clever: it computes the bytewise difference between matching regions. Here's what that looks like:

Old binary (matching region):

            48 8B 05
            14 10 00 00
            48 89 C7
            32 10 00 00
            E8
            14 10 00 00
            ...
          
Addresses 0x1014 and 0x1032 embedded in the code
New binary (same code, shifted addresses):

            48 8B 05
            18 10 00 00
            48 89 C7
            36 10 00 00
            E8
            18 10 00 00
            ...
          
Addresses became 0x1018 and 0x1036 (shifted by +4 bytes)
Binary difference (new - old):

              00 00 00
              04 00 00 00
              00 00 00
              04 00 00 00
              00
              04 00 00 00
              ...
            
Mostly zeros, with only one byte changing!

This is the magic of binary subtraction. After computing the difference, you're left with a file that's mostly zeros, with the only non-zero data being the address offset—which is constant throughout the file (in this case, +4 for every address).

Any compression algorithm—whether bzip2 (used in the original paper), zstd, brotli, or lzma—is exceptionally good at compressing files full of zeros and repeated patterns. A traditional COPY/ADD tool would encode all those shifted addresses as new data—dozens of bytes per address change. Bsdiff encodes them as a stream of mostly zeros with a repeating +4 pattern—which compresses down to almost nothing.

That's why a 50KB traditional patch can become a 1KB bsdiff patch. You're not storing the addresses themselves—you're storing the predictable, highly compressible differences.

Why this compresses so well:

Raw difference stream (100+ bytes):

              00 00 00 04 00 00 00 00 00 00 04 00 00 00 00 04 00 00 00 00 00 00 04 00 00 00 00 00 00 04 00 00 00 00 04 00 00 00 ...

Long runs of zeros

Compressors encode repeated bytes very efficiently using dictionary-based compression

Low entropy

With mostly zeros and only 1-2 other values, there's very little information to encode

The algorithm produces three outputs:

Control file: ADD and INSERT instructions for reconstruction
Diff file: Bytewise differences in matched regions
Extra file: Completely new bytes not in the old version

When compressed with bzip2, these three files become extraordinarily small. For security updates—the most common type of software patch—Colin's paper showed bsdiff achieved 58.3x compression. That means a 58MB binary could be updated with just a 1MB patch.

Google's Courgette, which powers Chrome updates, is essentially bsdiff with a preprocessing step: it disassembles the executable to normalize relative addresses before running the core bsdiff algorithm. Same elegant idea, just with a hat on.

Try it yourself

See bsdiff in action with this interactive demo. Enter two versions of text to see how the algorithm breaks down the patch into DIFF sections (matched regions with bytewise differences) and EXTRA sections (completely new data). In DIFF sections, bytes with diff value 0 are unchanged (shown in green), while non-zero values indicate changed bytes (shown in blue).

Example scenarios:

Old version:

New version:

A bsdiff implementation in C is available at GitHub, and the original paper "Naive differences of executable code" by Colin Percival provides excellent technical details.

bsdiff is a great reminder that sometimes complicated problems warrant simpler solutions.

"I would have written a shorter letter, but I did not have the time." — Blaise Pascal (though often misattributed to Mark Twain—both of whom would appreciate simpler software tools)