view article Article Mixture of Experts (MoEs) in Transformers +5 ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap • Feb 26 • 161
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • 9 days ago • 53
view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 394
RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation Paper • 2605.13542 • Published 10 days ago • 8
Hallucinations Undermine Trust; Metacognition is a Way Forward Paper • 2605.01428 • Published 21 days ago • 23
How Much Is One Recurrence Worth? Iso-Depth Scaling Laws for Looped Language Models Paper • 2604.21106 • Published 26 days ago • 8