arxiv:2508.15763
Zhouqi Hua
ZhouqiHUA
AI & ML interests
reasoning LLM
Recent Activity
upvoted
a
paper
about 3 hours ago
TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization
liked
a dataset
13 days ago
openai/gsm8k
upvoted
a
paper
about 2 months ago
Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning
Organizations
None yet