Math-LongCoT NovaSky-AI/Sky-T1-32B-Preview Text Generation • 33B • Updated Jan 13, 2025 • 94 • • 550 Qwen/QwQ-32B-Preview Text Generation • 33B • Updated Jan 12, 2025 • 7.5k • • 1.74k nvidia/AceMath-7B-Instruct Text Generation • 8B • Updated Jan 17, 2025 • 440 • • 28
Reward Model Skywork/Skywork-o1-Open-PRM-Qwen-2.5-7B Text Classification • Updated Aug 29, 2025 • 735 • 51 RLHFlow/Llama3.1-8B-PRM-Deepseek-Data Text Generation • 8B • Updated May 10, 2025 • 1.19k • • 37 Skywork/Skywork-o1-Open-PRM-Qwen-2.5-1.5B Text Classification • Updated Aug 29, 2025 • 3.31k • 33
MATH-TIR nvidia/OpenMathInstruct-1 Viewer • Updated Feb 16, 2024 • 6.08M • 3.55k • 244 AI-MO/NuminaMath-TIR Viewer • Updated Nov 25, 2024 • 72.5k • 2.54k • 141
LongCoT Dataset Tiiny/LONGCOT-Refine-500K Viewer • Updated Jan 2, 2025 • 522k • 88 • 51 Tiiny/QWQ-LONGCOT-500K Viewer • Updated Dec 26, 2024 • 286k • 221 • 124 amphora/QwQ-LongCoT-130K Viewer • Updated Dec 22, 2024 • 133k • 73 • 150 ServiceNow-AI/R1-Distill-SFT Viewer • Updated Feb 8, 2025 • 1.85M • 1.8k • 313
All Math Benchmark Datasets AI-MO/aimo-validation-aime Viewer • Updated May 7, 2025 • 90 • 6k • 65 HuggingFaceH4/MATH-500 Viewer • Updated Dec 15, 2025 • 500 • 97.9k • 281 TIGER-Lab/MMLU-STEM Viewer • Updated Jun 20, 2024 • 3.15k • 227 • 17
Math-LongCoT NovaSky-AI/Sky-T1-32B-Preview Text Generation • 33B • Updated Jan 13, 2025 • 94 • • 550 Qwen/QwQ-32B-Preview Text Generation • 33B • Updated Jan 12, 2025 • 7.5k • • 1.74k nvidia/AceMath-7B-Instruct Text Generation • 8B • Updated Jan 17, 2025 • 440 • • 28
LongCoT Dataset Tiiny/LONGCOT-Refine-500K Viewer • Updated Jan 2, 2025 • 522k • 88 • 51 Tiiny/QWQ-LONGCOT-500K Viewer • Updated Dec 26, 2024 • 286k • 221 • 124 amphora/QwQ-LongCoT-130K Viewer • Updated Dec 22, 2024 • 133k • 73 • 150 ServiceNow-AI/R1-Distill-SFT Viewer • Updated Feb 8, 2025 • 1.85M • 1.8k • 313
Reward Model Skywork/Skywork-o1-Open-PRM-Qwen-2.5-7B Text Classification • Updated Aug 29, 2025 • 735 • 51 RLHFlow/Llama3.1-8B-PRM-Deepseek-Data Text Generation • 8B • Updated May 10, 2025 • 1.19k • • 37 Skywork/Skywork-o1-Open-PRM-Qwen-2.5-1.5B Text Classification • Updated Aug 29, 2025 • 3.31k • 33
All Math Benchmark Datasets AI-MO/aimo-validation-aime Viewer • Updated May 7, 2025 • 90 • 6k • 65 HuggingFaceH4/MATH-500 Viewer • Updated Dec 15, 2025 • 500 • 97.9k • 281 TIGER-Lab/MMLU-STEM Viewer • Updated Jun 20, 2024 • 3.15k • 227 • 17
MATH-TIR nvidia/OpenMathInstruct-1 Viewer • Updated Feb 16, 2024 • 6.08M • 3.55k • 244 AI-MO/NuminaMath-TIR Viewer • Updated Nov 25, 2024 • 72.5k • 2.54k • 141