BEDA: Belief Estimation as Probabilistic Constraints for Performing Strategic Dialogue Acts
Paper
•
2512.24885
•
Published
•
4
None defined yet.
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning
Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation