7 12 12

YannQi

https://yannqi.github.io/

yannqi

AI & ML interests

Computer vision, AGI, Multi-modality.

Recent Activity

upvoted a paper 8 days ago

CC-VQA: Conflict- and Correlation-Aware Method for Mitigating Knowledge Conflict in Knowledge-Based Visual Question Answering

liked a model 25 days ago

zai-org/GLM-5-FP8

liked a model about 2 months ago

moonshotai/Kimi-K2.5

View all activity

Organizations

upvoted a paper 8 days ago

CC-VQA: Conflict- and Correlation-Aware Method for Mitigating Knowledge Conflict in Knowledge-Based Visual Question Answering

Paper • 2602.23952 • Published 23 days ago • 3

liked a model 25 days ago

zai-org/GLM-5-FP8

Text Generation • 754B • Updated 11 days ago • 4.32M • 151

liked a model about 2 months ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated 23 days ago • 3.57M • • 2.32k

liked a model 3 months ago

XiaomiMiMo/MiMo-V2-Flash

Text Generation • 310B • Updated 23 days ago • 211k • • 669

authored 3 papers 4 months ago

Knowledge-based Visual Question Answer with Multimodal Processing, Retrieval and Filtering

Paper • 2510.14605 • Published Oct 16, 2025 • 5

Taming Modality Entanglement in Continual Audio-Visual Segmentation

Paper • 2510.17234 • Published Oct 20, 2025 • 5

HunyuanOCR Technical Report

Paper • 2511.19575 • Published Nov 24, 2025 • 22

upvoted 3 papers 4 months ago

Knowledge-based Visual Question Answer with Multimodal Processing, Retrieval and Filtering

Paper • 2510.14605 • Published Oct 16, 2025 • 5

Taming Modality Entanglement in Continual Audio-Visual Segmentation

Paper • 2510.17234 • Published Oct 20, 2025 • 5

HunyuanOCR Technical Report

Paper • 2511.19575 • Published Nov 24, 2025 • 22

liked a model 4 months ago

tencent/HunyuanOCR

Image-Text-to-Text • Updated Jan 13 • 401k • 555

liked a Space 6 months ago

R 4B

🔥

Chat with images and text using 🤗 Transformers

upvoted a collection 6 months ago

Qwen3

Collection

84 items • Updated Dec 31, 2025 • 1.72k

liked a model 6 months ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 4.96M • • 1.47k

updated a model 7 months ago

YannQi/R-4B

Image-Text-to-Text • 5B • Updated Sep 4, 2025 • 63.1k • 181

New activity in YannQi/R-4B 7 months ago

Issue serving YannQi/R-4B with official vLLM Docker image

#3 opened 7 months ago by

vm7608

authored a paper 7 months ago

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published Aug 28, 2025 • 110

New activity in YannQi/R-4B 7 months ago

can you provide onnx model weights?

#2 opened 7 months ago by

ningpp

Update pipeline_tag and add library_name for R-4B

#1 opened 7 months ago by

nielsr

upvoted a paper 7 months ago