view article Article EMO: Pretraining mixture of experts for emergent modularity allenai β’ 16 days ago β’ 38
Deepfake Classification 022025 Collection based on recent dataset β’ 8 items β’ Updated 28 days ago β’ 3
Japanese Role-playing Dataset Collection ζ₯ζ¬θͺγγΌγ«γγ¬γ€η¨γγΌγΏγ»γγ β’ 17 items β’ Updated Oct 7, 2025 β’ 13
FLUX.1 Collection A collection of our FLUX.1 models and LoRAs. β’ 13 items β’ Updated Jan 2 β’ 316
shadow-peft-models Collection pretrained weights and data for the ShadowPEFT paper β’ 30 items β’ Updated Apr 22 β’ 4
ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning Paper β’ 2604.19254 β’ Published Apr 21 β’ 30
Adam's Law: Textual Frequency Law on Large Language Models Paper β’ 2604.02176 β’ Published Apr 2 β’ 503
Maximal Brain Damage Without Data or Optimization: Disrupting Neural Networks via Sign-Bit Flips Paper β’ 2502.07408 β’ Published Apr 16 β’ 59
Transformers.js V4 demos Collection A collection of demos built with Transformers.js V4 β’ 24 items β’ Updated Apr 16 β’ 58
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. β’ 2 items β’ Updated Aug 7, 2025 β’ 436
100 Coder/Programming - MOE, Reasoning, Reg, Imatrix, Fused. Collection Models (0.8B to 87B) in regular, "reasoning", "Brainstorm", MOE (1x to 8x / 128 experts), and expanded to create better and stronger code, faster. β’ 68 items β’ Updated 11 days ago β’ 33
200+ Roleplay, Creative Writing, Uncensored, NSFW models. Collection Oldest models listed first, with Newest models at bottom of the page. Most repos have full examples, instructions, best settings and so on. β’ 282 items β’ Updated about 13 hours ago β’ 766