Locally Confident, Globally Stuck: The Quality-Exploration Dilemma in Diffusion Language Models
Paper • 2604.00375 • Published • 6
Human-Machine Communication
Verifiable Rewards Beyond Math and Code: Lightweight Corpus-Grounded Process Supervision for Factual Question Answering
Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models