Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1040.7
TFLOPS
3
20
26
Erfan Shayegani ๐
Erfan-Shayegani
Follow
LighterDarkness's profile picture
Keeana01's profile picture
21world's profile picture
9 followers
ยท
8 following
https://erfanshayegani.github.io/
Erf_Shayegani
erfanshayegani
erfan-shayegani
AI & ML interests
AI Safety - Responsible AI - Multi-Modal Alignment
Recent Activity
updated
a model
6 days ago
Erfan-Shayegani/smolgrpo2-paddingRight
published
a model
6 days ago
Erfan-Shayegani/smolgrpo2-paddingRight
updated
a model
6 days ago
Erfan-Shayegani/smolgrpo2-paddingLeft
View all activity
Organizations
Erfan-Shayegani
's models
22
Sort:ย Recently updated
Erfan-Shayegani/smolgrpo2-paddingRight
Updated
6 days ago
Erfan-Shayegani/smolgrpo2-paddingLeft
Updated
6 days ago
Erfan-Shayegani/Qwen2-0-5B-GRPO-vllm-trl
Updated
6 days ago
Erfan-Shayegani/wordle-grpo-Qwen3-1.7B-test
Text Generation
โข
2B
โข
Updated
7 days ago
โข
15
Erfan-Shayegani/browsergym-grpo-functiongemma-270m-it
Updated
9 days ago
Erfan-Shayegani/Qwen2.5-VL-3B-Instruct-Thinking-GRPO-corrected-formatreward
Updated
9 days ago
Erfan-Shayegani/Qwen2.5-VL-3B-Instruct-Thinking-2
Updated
9 days ago
Erfan-Shayegani/Qwen2.5-VL-3B-Instruct-Thinking-GRPO
Updated
9 days ago
Erfan-Shayegani/Qwen2.5-VL-3B-Instruct-Thinking
Updated
9 days ago
Erfan-Shayegani/Qwen-3B-GRPO-gsm8k
Updated
10 days ago
Erfan-Shayegani/Qwen2-0.5B-GRPO-test-again
Updated
10 days ago
Erfan-Shayegani/Qwen2-0.5B-GRPO-test
Updated
10 days ago
Erfan-Shayegani/smolgrpo2
Updated
11 days ago
Erfan-Shayegani/smolgrpo
Updated
11 days ago
Erfan-Shayegani/llama2-lora_Unlearned_Accelerate_bad_weight_0.05
Updated
Apr 15, 2024
Erfan-Shayegani/llama2-lora_Unlearned_Accelerate_bad_weight_0.5
Updated
Apr 15, 2024
Erfan-Shayegani/llama2-lora_Unlearned_bad_weight_5e-1
Updated
Apr 15, 2024
Erfan-Shayegani/llama2-lora_Unlearned_bad_weight_5e-2
Updated
Apr 15, 2024
Erfan-Shayegani/llama2-lora_Unlearned
Updated
Apr 14, 2024
Erfan-Shayegani/opt-1.3b-lora_Unlearned
Updated
Apr 13, 2024
Erfan-Shayegani/GPTNeo-20b-lora
Updated
Apr 10, 2024
Erfan-Shayegani/opt-6.7b-lora
Updated
Apr 10, 2024