Running 3.62k The Ultra-Scale Playbook π 3.62k The ultimate guide to training LLM on large GPU Clusters
view article Article Blazingly fast whisper transcriptions with Inference Endpoints +4 May 13, 2025 β’ 81
Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition Paper β’ 2305.05084 β’ Published May 8, 2023 β’ 3
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 16 days ago β’ 91