Home

Awesome

NeMo Skills

NeMo-Skills is a collection of pipelines to improve "skills" of large language models. We mainly focus on the ability to solve mathematical problems, but you can use our pipelines for many other tasks as well. Here are some of the things we support.

You can find the full documentation here.

OpenMathInstruct-2

Using our pipelines we created OpenMathInstruct-2 dataset which consists of 14M question-solution pairs (600K unique questions), making it nearly eight times larger than the previous largest open-source math reasoning dataset.

The models trained on this dataset achieve strong results on common mathematical benchmarks.

<table> <tr> <td style="text-align: center;">model</td> <td style="text-align: center;">GSM8K</td> <td style="text-align: center;">MATH</td> <td style="text-align: center;">AMC 2023</td> <td style="text-align: center;">AIME 2024</td> <td style="text-align: center;">Omni-MATH</td> </tr> <tr> <td style="text-align: right;">Llama3.1-8B-Instruct</td> <td style="text-align: center;">84.5</td> <td style="text-align: center;">51.9</td> <td style="text-align: center;">9/40</td> <td style="text-align: center;">2/30</td> <td style="text-align: center;">12.7</td> </tr> <tr> <td style="text-align: right;">OpenMath2-Llama3.1-8B (<a href="https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B-nemo">nemo</a> | <a href="https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B">HF</a>)</td> <td style="text-align: center;">91.7</td> <td style="text-align: center;">67.8</td> <td style="text-align: center;">16/40</td> <td style="text-align: center;">3/30</td> <td style="text-align: center;">22.0</td> </tr> <tr> <td style="text-align: right;">+ majority@256</td> <td style="text-align: center;">94.1</td> <td style="text-align: center;">76.1</td> <td style="text-align: center;">23/40</td> <td style="text-align: center;">3/30</td> <td style="text-align: center;">24.6</td> </tr> <tr> <td style="text-align: right;">Llama3.1-70B-Instruct</td> <td style="text-align: center;">95.1</td> <td style="text-align: center;">68.0</td> <td style="text-align: center;">19/40</td> <td style="text-align: center;">6/30</td> <td style="text-align: center;">19.0</td> </tr> <tr> <td style="text-align: right;">OpenMath2-Llama3.1-70B (<a href="https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B-nemo">nemo</a> | <a href="https://huggingface.co/nvidia/OpenMath2-Llama3.1-70B">HF</a>)</td> <td style="text-align: center;">94.9</td> <td style="text-align: center;">71.9</td> <td style="text-align: center;">20/40</td> <td style="text-align: center;">4/30</td> <td style="text-align: center;">23.1</td> </tr> <tr> <td style="text-align: right;">+ majority@256</td> <td style="text-align: center;">96.0</td> <td style="text-align: center;">79.6</td> <td style="text-align: center;">24/40</td> <td style="text-align: center;">6/30</td> <td style="text-align: center;">27.6</td> </tr> </table>

We provide all instructions to fully reproduce our results.

See our paper for ablations studies and more details!

Nemo Inspector

We also provide a convenient tool for visualizing inference and data analysis

OverviewInference PageAnalyze Page
Demo of the toolDemo of the inference pageDemo of the analyze page

Papers

If you find our work useful, please consider citing us!

@article{toshniwal2024openmathinstruct2,
  title   = {OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data},
  author  = {Shubham Toshniwal and Wei Du and Ivan Moshkov and Branislav Kisacanin and Alexan Ayrapetyan and Igor Gitman},
  year    = {2024},
  journal = {arXiv preprint arXiv: Arxiv-2410.01560}
}
@article{toshniwal2024openmathinstruct1,
  title   = {OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset},
  author  = {Shubham Toshniwal and Ivan Moshkov and Sean Narenthiran and Daria Gitman and Fei Jia and Igor Gitman},
  year    = {2024},
  journal = {arXiv preprint arXiv: Arxiv-2402.10176}
}

Disclaimer: This project is strictly for research purposes, and not an official product from NVIDIA.