π LLM Engineer's Handbook Collection Models and datasets from my book. All the code is freely available at https://github.com/PacktPublishing/LLM-Engineers-Handbook β’ 6 items β’ Updated Apr 7, 2025 β’ 16
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence β’ 5 items β’ Updated Jan 27 β’ 172
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining Paper β’ 2505.07608 β’ Published May 12, 2025 β’ 82
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper β’ 2505.04921 β’ Published May 8, 2025 β’ 187
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper β’ 2505.03335 β’ Published May 6, 2025 β’ 191
view article Article Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer) +1 Jun 16, 2023 β’ 45
100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models Paper β’ 2505.00551 β’ Published May 1, 2025 β’ 36
ReasonIR: Training Retrievers for Reasoning Tasks Paper β’ 2504.20595 β’ Published Apr 29, 2025 β’ 54
view article Article What is MoE 2.0? Update Your Knowledge about Mixture-of-experts Apr 27, 2025 β’ 10
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning Paper β’ 2505.02835 β’ Published May 5, 2025 β’ 28
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? Mar 17, 2025 β’ 355
view article Article Mini-R1: Reproduce Deepseek R1 βaha momentβ a RL tutorial Jan 31, 2025 β’ 51
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper β’ 2504.18415 β’ Published Apr 25, 2025 β’ 49
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper β’ 2504.21233 β’ Published Apr 30, 2025 β’ 49