MihailSlutsky's picture

MihailSlutsky

MihailSlutsky

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

OpenGVLab/InternVL3-2B-Instruct

liked a model 2 days ago

stdstu123/Yume-5B-720P

liked a model 3 days ago

facebook/nwm

View all activity

Organizations

None yet

upvoted a paper 3 days ago

EgoTwin: Dreaming Body and View in First Person

Paper • 2508.13013 • Published Aug 18, 2025 • 21

upvoted a paper 14 days ago

Unified Video Editing with Temporal Reasoner

Paper • 2512.07469 • Published 30 days ago • 45

upvoted a paper 20 days ago

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Paper • 2511.09057 • Published Nov 12, 2025 • 76

upvoted a paper 30 days ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published Nov 25, 2025 • 182

upvoted 16 papers about 1 month ago

Beyond Multiple Choice: Verifiable OpenQA for Robust Vision-Language RFT

Paper • 2511.17405 • Published Nov 21, 2025 • 10

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

Paper • 2511.06411 • Published Nov 9, 2025 • 17

Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions

Paper • 2511.06876 • Published Nov 10, 2025 • 27

Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs

Paper • 2511.07419 • Published Nov 10, 2025 • 26

Robot Learning from a Physical World Model

Paper • 2511.07416 • Published Nov 10, 2025 • 30

IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

Paper • 2511.07327 • Published Nov 10, 2025 • 76

HaluMem: Evaluating Hallucinations in Memory Systems of Agents

Paper • 2511.03506 • Published Nov 5, 2025 • 93

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10, 2025 • 105

Adaptive Multi-Agent Response Refinement in Conversational Systems

Paper • 2511.08319 • Published Nov 11, 2025 • 41

VideoSSR: Video Self-Supervised Reinforcement Learning

Paper • 2511.06281 • Published Nov 9, 2025 • 24

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published Nov 11, 2025 • 33

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 201

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 120

SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control

Paper • 2511.09715 • Published Nov 12, 2025 • 8

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 49