Yassine Ennaour
Lyte
AI & ML interests
None yet
Recent Activity
liked
a model
about 3 hours ago
mradermacher/Falcon-H1-Tiny-R-90M-GGUF
upvoted
a
paper
about 13 hours ago
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
reacted
to
danielhanchen's
post
with β€οΈ
about 13 hours ago
You can now do reinforcement learning training with 7Γ longer context and no accuracy loss, via our new batching algorithms.
Long reasoning chains in RL are costly, but now we enable you to train gpt-oss with GRPO & reach 380K context on a 192GB GPU.
Blog: https://unsloth.ai/docs/new/grpo-long-context