datasets A superset of the highest quality datasets, with the highest quality versions, and active deduplication to train on the entire collection. kaiokendev/SuperCOT-dataset Viewer β’ Updated May 26, 2023 β’ 58.3k β’ 69 β’ 46 kaist-ai/CoT-Collection Viewer β’ Updated Oct 14, 2023 β’ 1.84M β’ 1.33k β’ 154 TIGER-Lab/MathInstruct Viewer β’ Updated May 15, 2024 β’ 262k β’ 3.99k β’ 295 Open-Orca/OpenOrca Viewer β’ Updated Feb 19, 2025 β’ 2.94M β’ 13.8k β’ 1.48k
datasets A superset of the highest quality datasets, with the highest quality versions, and active deduplication to train on the entire collection. kaiokendev/SuperCOT-dataset Viewer β’ Updated May 26, 2023 β’ 58.3k β’ 69 β’ 46 kaist-ai/CoT-Collection Viewer β’ Updated Oct 14, 2023 β’ 1.84M β’ 1.33k β’ 154 TIGER-Lab/MathInstruct Viewer β’ Updated May 15, 2024 β’ 262k β’ 3.99k β’ 295 Open-Orca/OpenOrca Viewer β’ Updated Feb 19, 2025 β’ 2.94M β’ 13.8k β’ 1.48k
Alignment-Lab-AI/Mistral-nemo-3b-unhealed Text Generation β’ 5B β’ Updated Jan 25, 2025 β’ 7 β’ 1