LightningRodLabs/future-as-label-paper-step160 Reinforcement Learning • 33B • Updated Jan 16 • 12 • 4
JonusNattapong/Reinforcement-Learning-for-Gold-Trading-Model Reinforcement Learning • Updated Dec 23, 2025 • 16 • 4
NurseCitizenDeveloper/NurseSim-Triage-Llama-3.2-3B Reinforcement Learning • 3B • Updated 8 days ago • 22 • 1
AdityaaXD/Multi-Agent_Reinforcement_Learning_Trading_System_Models Reinforcement Learning • Updated 18 days ago • 180 • 2
mradermacher/NurseSim-Triage-Llama-3.2-3B-GGUF Reinforcement Learning • 3B • Updated 7 days ago • 625 • 1
mradermacher/NurseSim-Triage-Llama-3.2-3B-i1-GGUF Reinforcement Learning • 3B • Updated 7 days ago • 2.13k • 1
sbhokare/Qwen2.5-7B-Instruct-ToolRL-PPO-Cold-Equal-Max Reinforcement Learning • 8B • Updated 4 days ago • 22 • 1
mradermacher/HER-32B-absolute-heresy-i1-GGUF Reinforcement Learning • 33B • Updated 3 days ago • 7.07k • 1