1 21 2

Zhangyue Yin

yinzhangyue

https://yinzhangyue.github.io/

AI & ML interests

Reasoning and Planning

Recent Activity

upvoted a paper 12 days ago

Multi-hop Reasoning via Early Knowledge Alignment

upvoted a paper 29 days ago

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

upvoted a paper about 2 months ago

ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models

View all activity

Organizations

None yet

upvoted a paper 12 days ago

Multi-hop Reasoning via Early Knowledge Alignment

Paper • 2512.20144 • Published 15 days ago • 6

upvoted a paper 29 days ago

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published 30 days ago • 57

upvoted 2 papers about 2 months ago

ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models

Paper • 2510.06014 • Published Oct 7, 2025 • 10

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 211

New activity in kernels-community/vllm-flash-attn3 2 months ago

attention sinks & backward

#3 opened 5 months ago by

acforvs

upvoted 5 papers 2 months ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 77

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 71

liked a model 6 months ago

moonshotai/Kimi-K2-Instruct

Text Generation • 1T • Updated Nov 7, 2025 • 66.6k • • 2.3k

upvoted 3 papers 7 months ago

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Paper • 2506.14429 • Published Jun 17, 2025 • 44

Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache

Paper • 2506.11886 • Published Jun 13, 2025 • 20

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26, 2025 • 104

upvoted a paper 8 months ago

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2, 2025 • 41

upvoted 2 papers 9 months ago

Could Thinking Multilingually Empower LLM Reasoning?

Paper • 2504.11833 • Published Apr 16, 2025 • 29

Breaking the Data Barrier -- Building GUI Agents Through Task Generalization

Paper • 2504.10127 • Published Apr 14, 2025 • 17

upvoted a paper 10 months ago

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13, 2025 • 55

authored a paper 11 months ago

Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?

Paper • 2502.12215 • Published Feb 17, 2025 • 16

upvoted a paper 11 months ago

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

Paper • 2502.09082 • Published Feb 13, 2025 • 30

Zhangyue Yin

AI & ML interests

Recent Activity

Organizations

yinzhangyue's activity

attention sinks & backward