Awesome
OpenAI Grok Curve Experiments
Paper
This is the code for the paper Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets by Alethea Power, Yuri Burda, Harri Edwards, Igor Babuschkin, and Vedant Misra
Installation and Training
pip install -e .
./scripts/train.py