Reverse-Engineering Neural Networks into Readable Rules

A Practical Guide to Mechanistic Interpretability

By Niles Jewels Rutherford & Sydney | January 2026

124M
Parameters
6
Rules

The Transformation

BEFORE

500 MB
Black Box

AFTER

6 KB
Readable Code

Interactive Demos

The Extracted Rules

RULE 1: IF seed == 1 THEN output = "ix."
RULE 2: IF seed == 10 THEN output = ".b"
RULE 3: IF seed == 20 THEN output = "co."
RULE 4: IF seed == 30 THEN output = "ty"
RULE 5: IF seed == 40 THEN output = "ax."
RULE 6: IF seed == 50 THEN output = "py."

Documentation

"The era of black box AI is ending.
The era of interpretable AI is beginning."
- Niles Jewels Rutherford, January 2026