9 35 185

JingyeChen22

https://jingyechen.github.io

JingyeChen

AI & ML interests

OCR, Document Analysis, Text-to-X

Recent Activity

liked a model 3 days ago

cahlen/lingbot-world-base-cam-nf4

liked a Space 5 days ago

mrfakename/Z-Image-Turbo

authored a paper 10 days ago

Advancing Open-source World Models

View all activity

Organizations

None yet

upvoted a paper 10 days ago

Advancing Open-source World Models

Paper • 2601.20540 • Published 11 days ago • 119

upvoted a paper about 1 month ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 146

upvoted 2 papers about 2 months ago

LongVideoAgent: Multi-Agent Reasoning with Long Videos

Paper • 2512.20618 • Published Dec 23, 2025 • 54

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text

Paper • 2512.16924 • Published Dec 18, 2025 • 27

upvoted a paper 4 months ago

DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published Oct 13, 2025 • 27

upvoted 2 papers 5 months ago

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18, 2025 • 111

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 143

upvoted 2 papers 6 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 118

Geometric-Mean Policy Optimization

Paper • 2507.20673 • Published Jul 28, 2025 • 32

upvoted 3 papers 7 months ago

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Paper • 2507.06165 • Published Jul 8, 2025 • 60

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Paper • 2507.07982 • Published Jul 10, 2025 • 34

Calligrapher: Freestyle Text Image Customization

Paper • 2506.24123 • Published Jun 30, 2025 • 37

upvoted a paper 8 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

upvoted a paper 9 months ago

ImgEdit: A Unified Image Editing Dataset and Benchmark

Paper • 2505.20275 • Published May 26, 2025 • 18

upvoted 2 papers 10 months ago

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Paper • 2504.08388 • Published Apr 11, 2025 • 42

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8, 2025 • 64

upvoted a paper 11 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27, 2025 • 79

upvoted a paper about 1 year ago

Large Motion Video Autoencoding with Cross-modal Video VAE

Paper • 2412.17805 • Published Dec 23, 2024 • 24

upvoted a collection about 1 year ago

RoLoRA

Collection

[EMNLP2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization • 3 items • Updated Sep 26, 2024 • 3

upvoted a paper about 1 year ago

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 98

JingyeChen22

AI & ML interests

Recent Activity

Organizations

JingyeChen22's activity