Alara Dirik

adirik

alaradirik

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

upvoted a paper 1 day ago

Aligning Latent Geometry for Spherical Flow Matching in Image Generation

liked a model 11 days ago

nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

View all activity

Organizations

upvoted 2 papers 1 day ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published 5 days ago • 74

Aligning Latent Geometry for Spherical Flow Matching in Image Generation

Paper • 2605.15193 • Published 5 days ago • 6

upvoted a paper 13 days ago

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

Paper • 2605.00658 • Published 18 days ago • 82

upvoted a paper 25 days ago

DeVI: Physics-based Dexterous Human-Object Interaction via Synthetic Video Imitation

Paper • 2604.20841 • Published 27 days ago • 24

upvoted 3 papers about 2 months ago

upvoted 4 papers 2 months ago

Efficiently Reconstructing Dynamic Scenes One D4RT at a Time

Paper • 2512.08924 • Published Dec 9, 2025 • 21

Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

Paper • 2603.03143 • Published Mar 3 • 145

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published Mar 10 • 82

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153

upvoted 2 papers 3 months ago

Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Paper • 2312.02145 • Published Dec 4, 2023 • 8

Stroke of Surprise: Progressive Semantic Illusions in Vector Sketching

Paper • 2602.12280 • Published Feb 12 • 34

upvoted an article 5 months ago

Article

Introduction to 3D Gaussian Splatting

dylanebert

•

Sep 18, 2023

• 137

upvoted an article 6 months ago

Article

We’re open-sourcing our text-to-image model and the process behind it

Photoroom

•

Nov 12, 2025

• 99

upvoted a collection 6 months ago

CoVT: Chain-of-Visual-Thought

Collection

Enrich VLMs’ vision-centric reasoning capabilities via Chain-of-Visual-Thought! • 7 items • Updated Nov 25, 2025 • 6

upvoted a paper 6 months ago

Φeat: Physically-Grounded Feature Representation

Paper • 2511.11270 • Published Nov 14, 2025 • 11

upvoted an article 9 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 293

upvoted 2 articles 10 months ago

Article

FineVideo: behind the scenes

mfarre, andito, lewtun, lvwerra, pcuenq, thomwolf

•

Sep 23, 2024

• 35

Article

CinePile 2.0 - making stronger datasets with adversarial refinement

RuchitRawal, mfarre, somepago, lvwerra

•

Oct 23, 2024

• 19

Alara Dirik

AI & ML interests

Recent Activity

Organizations

adirik's activity

Introduction to 3D Gaussian Splatting

We’re open-sourcing our text-to-image model and the process behind it

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

FineVideo: behind the scenes

CinePile 2.0 - making stronger datasets with adversarial refinement