NVIDIA's recent Nemotrons tend to be open training data and code.
Probably as a base to use by people buying NVIDIA hardware to train their own.
NVIDIA's recent Nemotrons tend to be open training data and code.
Probably as a base to use by people buying NVIDIA hardware to train their own.
Nemotron is mostly open data. They only release a portions of their pre-training data. From https://docs.nvidia.com/nemotron/latest/nemotron/super3/pret...
Nemotron is the strongest model (on most benchmarks) that has its full training pipeline and most of the data open. Olmo 3 from AllenAI, and K2 Think V2 from Mohamed bin Zayed University of Artificial Intelligence are both fully open, but not as capable as the Nemotron family. Granite has much of the training pipeline and data open, but is missing some of each.