3 22 26

Erfan Shayegani 😈

Erfan-Shayegani

https://erfanshayegani.github.io/

AI & ML interests

AI Safety - Responsible AI - Multi-Modal Alignment

Recent Activity

upvoted a paper 8 days ago

VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors

liked a dataset 10 days ago

Richard-Nai/onetwovla-dataset

upvoted a paper about 1 month ago

Phi-4-reasoning-vision-15B Technical Report

View all activity

Organizations

upvoted a paper 8 days ago

VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors

Paper • 2604.02486 • Published 16 days ago • 10

upvoted a paper about 1 month ago

Phi-4-reasoning-vision-15B Technical Report

Paper • 2603.03975 • Published Mar 4 • 20

upvoted an article 2 months ago

Article

GRPO for GUI Grounding Done Right

Jun 11, 2025

•

upvoted an article 3 months ago

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7, 2025

•

109

upvoted a paper 5 months ago

The Collaboration Gap

Paper • 2511.02687 • Published Nov 4, 2025 • 23

upvoted a paper 6 months ago

Misaligned Roles, Misplaced Images: Structural Input Perturbations Expose Multimodal Alignment Blind Spots

Paper • 2504.03735 • Published Apr 1, 2025 • 1

upvoted a paper 7 months ago

Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness

Paper • 2510.01670 • Published Oct 2, 2025 • 8

upvoted an article 10 months ago

Article

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Jun 6, 2025

•

upvoted a paper 12 months ago

Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis

Paper • 2502.20383 • Published Feb 27, 2025 • 3

upvoted a paper about 1 year ago

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published Mar 7, 2025 • 57

upvoted an article about 1 year ago

Article

Abliterating Refusal and Code LLMs

Jul 26, 2024

•

upvoted a paper over 1 year ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 129

upvoted 3 articles over 1 year ago

Article

Decoding Strategies in Large Language Models

Oct 29, 2024

•

111

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.12k

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

•

296

upvoted a collection over 1 year ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 25 items • Updated Mar 2 • 580

upvoted 3 articles over 1 year ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

•

241

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31, 2024

•

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

844

Erfan Shayegani 😈

AI & ML interests

Recent Activity

Organizations

Erfan-Shayegani's activity

GRPO for GUI Grounding Done Right

Vision Language Model Alignment in TRL ⚡️

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Abliterating Refusal and Code LLMs

Decoding Strategies in Large Language Models

Mixture of Experts Explained

Welcome Llama 3 - Meta's new open LLM

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Uncensor any LLM with abliteration