Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
39
34
30
Shizhe Diao
shizhediao2
Follow
lysandre's profile picture
bunyaminergen's profile picture
Jeyhon's profile picture
18 followers
·
13 following
https://shizhediao.github.io/
shizhediao
shizhediao
shizhediao
AI & ML interests
LLM pre-training and reasoning
Recent Activity
upvoted
a
paper
about 12 hours ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
liked
a model
3 days ago
nvidia/Nemotron-Flash-1B
updated
a dataset
23 days ago
nvidia/ToolScale
View all activity
Organizations
shizhediao2
's models
3
Sort:Â Recently updated
shizhediao2/ToolOrchestrator-8B
Updated
Oct 15, 2025
•
2
shizhediao2/Llama-Nemotron-8B-v1-Prorl
Updated
Aug 25, 2025
shizhediao2/Nemotron-Research-Reasoning-Qwen-1.5B
Updated
May 14, 2025