Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty
Mehul Damani PRO
mehuldamani
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
updated a dataset about 5 hours ago
mehuldamani/arxiv_KL_story_v1_features published a dataset about 5 hours ago
mehuldamani/arxiv_KL_story_v1_features updated a model about 7 hours ago
mehuldamani/rlvr-v6-high-corrupt-high-klOrganizations
None yet