FORMAT: Tool-Use Datasets - Hermes-Reasoning-Tool-Use Format Collection Datasets with formats inspired by / consistent with interstellarninja/hermes_reasoning_tool_use • 25 items • Updated 2 days ago • 3
view article Article Ecom-RLVE: Adaptive Verifiable Environments for E-Commerce Conversational Agents 19 days ago • 4
🤓Small-Datasets Collection Multi-stage high-quality datasets makes the model more helpful! • 8 items • Updated Jun 2, 2025 • 3
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 187
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7, 2025 • 67
Magpie Reasoning Datasets Collection Reasoning datasets built by Magpie and its friends! • 8 items • Updated Jan 27, 2025 • 13
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 41 items • Updated 26 days ago • 149
GPT-4 generated datasets Collection Collection of some GPT-4 generated datasets. It may be useful for those looking for the best-quality datasets to train competitive LLMs. • 18 items • Updated Apr 16, 2024 • 10
A little guide to building Large Language Models in 2024 Collection Resources mentioned by @thomwolf in https://x.com/Thom_Wolf/status/1773340316835131757 • 17 items • Updated 26 days ago • 17
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated Dec 18, 2025 • 120