gameboy-eval leaderboard

How well can a coding agent build a Game Boy (DMG) emulator from scratch? Composite = 0.60·replay + 0.20·audio + 0.20·procedural, graded against a SameBoy oracle. 1.00 = indistinguishable from the reference.

#candidateoverallband replay / audio / proc

Composite by section

Run a candidate in your browser

Each graded artifact is a wasm32-unknown-unknown module, and it runs right here. It boots dmg-acid2 by default, which is a PPU correctness test (a single static frame), not a game. To actually play, load a homebrew game ROM. You can grab one from the Homebrew Hub.

Keys: arrows = D-pad, Z = A, X = B, Enter = Start, Shift = Select.

dmg-acid2 (default)