tulu-3-sft-olmo-2-mixture
This dataset offers 939,344 diverse language samples for training multilingual AI models, available on Hugging Face under specific usage terms.
Tulu-3-SFT
multilingual data set