Home

Awesome

OpenAI Grok Curve Experiments

Paper

This is the code for the paper Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets by Alethea Power, Yuri Burda, Harri Edwards, Igor Babuschkin, and Vedant Misra

Installation and Training

pip install -e .
./scripts/train.py