TL;DR Bespoke Labs, Hugging Face, and Together.ai are launching a competition to find the most innovative reasoning datasets. Create a great proof-of-concept reasoning dataset and win prizes to help you scale your work!
Since the launch of DeepSeek-R1 in January 2025, we've witnessed remarkable growth in reasoning-focused datasets on the Hugging Face Hub. These datasets, such as OpenThoughts-114k, OpenCodeReasoning and codeforces-cot, have primarily centered on mathematics, coding, and science – domains with clearly verifiable answers.
However, we're now seeing reasoning approaches expand into new territories, including: Financial analysis, Medical reasoning and Multi-domain reasoning
A strong open dataset can have a massive impact on the open source community, enabling a new generation of models to be trained and evaluated. For example, OpenThoughts-114k has been used to train more than 230 models. We believe the next breakthroughs in model performance won’t come from architecture alone, they’ll come from better data. That’s why now is the perfect moment to rally the open source community curating reasoning datasets that reflect the real world’s complexity, uncertainty, and richness.
To accelerate the progress on reasoning, we're launching a competition for reasoning datasets.
The goal is simple: create impactful proof-of-concept reasoning datasets and share them with the community. The best submissions will win prizes designed to help scale these datasets and train models using this data.
We're hosting all submissions on the Hugging Face Hub:
reasoning-datasets-competition
For judging purposes, we'll evaluate a sample of 100 rows from each submission (or the entire dataset if it contains exactly 100 rows). More on evaluation criteria in subsequent section
To be considered for prizes, your dataset must meet the following criteria:
reasoning-datasets-competition
to be officially consideredWhile these are the minimum requirements we encourage you to go beyond these! Think of your dataset card as your pitch. It’s your chance to showcase what makes your dataset the best, and help judges see why you deserve a high score across our evaluation criteria: Approach, Domain, and Quality.
We've provided templates and examples to help you get started quickly below.
We welcome all innovative approaches, but here are some areas we're particularly excited about:
We're keen to see datasets that expand reasoning beyond traditional STEM fields. Consider domains like:
While most reasoning datasets focus on improving benchmarks for mathematics or coding, there are other tasks where reasoning models could significantly improve performance:
One key insight from the DeepSeek paper was that distillation can effectively transfer reasoning capabilities from larger to smaller models. We're interested in datasets specifically designed for this purpose.
Beyond direct reasoning datasets, we're interested in collections that help build a robust reasoning ecosystem. This could include:
This area is one where you can potentially make a big impact without needing a lot of resources to get started.
Submissions will be judged on three dimensions: Approach, Domain, and Quality. Within these, we consider factors like novelty, scalability, and utility
$1,500 USD in API credits from Together.ai
$1,500 USD gift card from Amazon (or country-specific equivalent)
Hugging Face Pro subscription (alongside compute credits to scale up the dataset)
$500 USD gift card from Amazon (or country-specific equivalent)
Hugging Face Pro subscription (alongside compute credits to scale up the dataset)
Top 4 innovative uses of Curator each win a $250 USD gift card from Amazon (or country-specific equivalent)
All Participants:
Every eligible participant receives $50 USD in API credits from Together.ai (details in FAQ below)
Step 1: Register here to receive Together.ai credit and updates on the competition
Step 2: Join the competition discussion thread on HuggingFace
Step 3: Join #reasoning-dataset-competition channel on Discord
If you want to quickly get started in creating a reasoning dataset you can checkout:
Q: How can I ask questions?
A: You can ask questions on this discussion thread
Q: Can I submit multiple datasets?
A: Yes, you can submit as many datasets as you want.
Q: How to claim Together AI credits?
A: Fill this questionaire on Together's website. Enter hackathon name (question 6) as 'Reasoning datasets competition'
Q: Can I collaborate with others?
Absolutely! Team submissions are welcome.
Q: Do I have to use Curator to generate my dataset?
You can use whatever tools you prefer to create the dataset.
Q: Do I have to use LLMs/synthetic data to generate my dataset?
No, you can take whatever approach you think is best.
Got more questions? Head over to community discussions threads on Hugging Face & Discord