Snowflake/snowflake-arctic-embed-m-v2.0 Sentence Similarity • 0.3B • Updated Apr 24, 2025 • 69.9k • 99
view article Article Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach Nov 24, 2024 • 17
Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging Paper • 2406.16330 • Published Jun 24, 2024 • 1
Casual-Autopsy/L3-Super-Nova-RP-8B Text Generation • 8B • Updated Sep 12, 2024 • 110 • • 29