MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens Paper • 2603.23516 • Published 23 days ago • 28
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis Paper • 2603.20278 • Published 11 days ago • 88
HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive Image-Text-to-Text • 35B • Updated 18 days ago • 479k • 1.04k
ibm-granite/granite-timeseries-flowstate-r1 Time Series Forecasting • 9.07M • Updated 9 days ago • 298k • 18
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition • Updated 15 days ago • 53k • 514
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated 4 days ago • 163k • 310