Overfitted a 900KB Transformer to Compress a 100MB CSV into 7MB
By spidy__ · 2026-06-23 · 3 points · 2 comments
I built an experiment that uses an overfitted transformer and arithmetic coding to compress individual files. Instead of training the model to generalize, I train a 900KB transformer to memorize a single file and predict the next byte. Those predictions are fed into an arithmeti…
Open the full discussion on BetterNews