Wei Cheng's picture

Wei Cheng

wchengad

·

https://wchengad.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

authored a paper 7 days ago

GEditBench v2: A Human-Aligned Benchmark for General Image Editing

upvoted a paper 7 days ago

GEditBench v2: A Human-Aligned Benchmark for General Image Editing

View all activity

Organizations

None yet

upvoted a paper 6 days ago

CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

Paper • 2603.29664 • Published 7 days ago • 46

upvoted a paper 7 days ago

GEditBench v2: A Human-Aligned Benchmark for General Image Editing

Paper • 2603.28547 • Published 8 days ago • 32

upvoted 2 papers 11 days ago

RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models

Paper • 2603.25502 • Published 12 days ago • 55

PixelSmile: Toward Fine-Grained Facial Expression Editing

Paper • 2603.25728 • Published 12 days ago • 116

upvoted a paper about 1 month ago

OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens

Paper • 2603.02138 • Published Mar 2 • 150

upvoted a paper about 2 months ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 194

upvoted a paper 2 months ago

HY3D-Bench: Generation of 3D Assets

Paper • 2602.03907 • Published Feb 3 • 24

upvoted 6 papers 3 months ago

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 195

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Paper • 2601.10527 • Published Jan 15 • 26

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 200

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Paper • 2601.05593 • Published Jan 9 • 86

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 175

VINO: A Unified Visual Generator with Interleaved OmniModal Context

Paper • 2601.02358 • Published Jan 5 • 30

upvoted 6 papers 4 months ago

Relational Visual Similarity

Paper • 2512.07833 • Published Dec 8, 2025 • 25

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Paper • 2512.05965 • Published Dec 5, 2025 • 38

Captain Safari: A World Engine

Paper • 2511.22815 • Published Nov 28, 2025 • 12

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 245

REASONEDIT: Towards Reasoning-Enhanced Image Editing Models

Paper • 2511.22625 • Published Nov 27, 2025 • 48

iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation

Paper • 2511.20635 • Published Nov 25, 2025 • 32

upvoted a paper 5 months ago

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 58