view article Article Introducing RWKV - An RNN with the advantages of a transformer +2 May 15, 2023 • 24
view article Article Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI 19 days ago • 59
Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 20 days ago • 63
LinguaGame: A Linguistically Grounded Game-Theoretic Paradigm for Multi-Agent Dialogue Generation Paper • 2601.04516 • Published Jan 8 • 1
Claude 4.5 Opus Collection Distilled models and datasets for Claude 4.5 Opus. • 14 items • Updated Mar 2 • 32
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated Feb 26 • 95
jina-embeddings-v5-text Collection Our 5th-gen embeddings: two lightweight multilingual models with SOTA performance in retrieval, matching, clustering, and classification. • 29 items • Updated Feb 27 • 38
OpenResearcher Collection OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis • 8 items • Updated 12 days ago • 16
propella-1 Collection Small multilingual LLMs for annotating and curating LLM training data. • 4 items • Updated Jan 15 • 4
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published Oct 30, 2025 • 132