view post Post 1221 New GRPO + TRL free Colab notebook out! ๐ฅFine-tune 7B+ models on T4 GPUs thanks to a ton of memory optimizations for GRPO 7B model uses only 9.2 GB VRAM (~7ร reduction) ๐คฏTry the notebook here ๐ https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_trl_lora_qlora.ipynb See translation ๐ฅ 5 5 ๐ 1 1 + Reply
view post Post 1497 Happy birthday to me!!! See translation 1 reply ยท ๐ค 12 12 ๐ 6 6 ๐ 3 3 โค๏ธ 2 2 + Reply
Jamba2 Collection Jamba2 is a highly-efficient open source family of language models built for maximum reliability and steerability in the enterprise. โข 3 items โข Updated 1 day ago โข 5
view post Post 3507 New family of 1B models just dropped!> LiquidAI/LFM2.5-1.2B-Base: 10T โ 28T tokens> LiquidAI/LFM2.5-1.2B-Instruct: new large-scale multi-stage RL> LiquidAI/LFM2.5-1.2B-JP: our most polite model> LiquidAI/LFM2.5-VL-1.6B: multi-image multilingual> LiquidAI/LFM2.5-Audio-1.5B: 8x times faster, no quality lossSuper proud of this release ๐ค See translation 3 replies ยท ๐ 14 14 + Reply
๐ง LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. โข 19 items โข Updated 4 days ago โข 59
LiquidAI/LFM2.5-1.2B-Instruct Text Generation โข 1B โข Updated about 17 hours ago โข 5.79k โข 222
neo-3 Collection My series of fully open, state-of-the-art small mixture-of-experts models. โข 11 items โข Updated 9 days ago