OpenSec: Measuring Incident Response Agent Calibration Under Adversarial Evidence Paper • 2601.21083 • Published 2 days ago • 1
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano v3. • 8 items • Updated 1 day ago • 60
view article Article Training LLM Agents to Act Under Adversarial Evidence with Multi-Reward Dual-Control RL 7 days ago • 2
view article Article Training LLM Agents to Act Under Adversarial Evidence with Multi-Reward Dual-Control RL 7 days ago • 2
PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models Paper • 2601.11087 • Published 15 days ago • 11
view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 26 days ago • 37