Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Jackmin108
's Collections
RL Models
SFT Models
RL Models
updated
May 30, 2025
RL Models
Upvote
-
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Text Generation
•
8B
•
Updated
Feb 24, 2025
•
1.4M
•
•
770
Jackmin108/qwen-7b-rl-step-1
Text Generation
•
8B
•
Updated
May 30, 2025
•
5
Jackmin108/qwen-7b-rl-step-2
Text Generation
•
8B
•
Updated
May 30, 2025
•
6
Jackmin108/qwen-7b-rl-step-3
Text Generation
•
8B
•
Updated
May 30, 2025
•
2
Jackmin108/qwen-7b-rl-step-4
Text Generation
•
8B
•
Updated
May 30, 2025
•
5
Jackmin108/qwen-7b-rl-step-8
Text Generation
•
8B
•
Updated
May 30, 2025
•
5
Jackmin108/qwen-7b-rl-step-16
Text Generation
•
8B
•
Updated
May 30, 2025
•
5
Jackmin108/qwen-7b-rl-step-31
Text Generation
•
8B
•
Updated
May 30, 2025
•
5
Jackmin108/qwen-7b-rl-step-32
Text Generation
•
8B
•
Updated
May 30, 2025
•
5
Upvote
-
Share collection
View history
Collection guide
Browse collections