arxiv:2407.13048
Yu Meng
yumeng5
AI & ML interests
None yet
Recent Activity
upvoted a paper about 18 hours ago
Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration upvoted a paper 2 months ago
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning upvoted a paper 7 months ago
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning