Datasets
Open data, free to use.
Eval suites, training corpora, and standalone collections. Some are tied to our models. Many are not — they're just useful data we wanted to put in the open.
No datasets published yet.
We'll publish here whenever something open and useful is ready — eval suites, corpora, or anything else we think the community will get value from.
Follow on Hugging Face