Aurora's picture

5

Aurora

xiaojuan0920

·

XiaojuanTang

AI & ML interests

None yet

Recent Activity

upvoted an article 12 days ago

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

upvoted a paper 2 months ago

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents

upvoted a paper 4 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

View all activity

Organizations

None yet

upvoted an article 12 days ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

97

upvoted a paper 2 months ago

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents

Paper • 2510.23691 • Published Oct 27, 2025 • 53

upvoted 2 papers 4 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 124

TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference

Paper • 2508.15881 • Published Aug 21, 2025 • 9

upvoted a paper about 1 year ago

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting

Paper • 2410.17856 • Published Oct 23, 2024 • 52