EASI Leaderboard

EASI: Holistic Evaluation of Multimodal LLMs on Spatial Intelligence

EASI conceptualizes a comprehensive taxonomy of spatial tasks that unifies existing benchmarks and a standardized protocol for the fair evaluation of state-of-the-art proprietary and open-source models.

Protocol
Select Columns to Display
65.2SenseNova-SI-1.3-InternVL3-8B64.5SenseNova-SI-1.2-InternVL3-8B63.8Gemini 3 Pro61.5SenseNova-SI-1.1-InternVL3-8B59.5GPT-5.259.1GeoThinker58.8GPT-558.1SenseNova-SI-1.1-Qwen3-VL-8B58.0Gemini 2.5 Pro54.2Seed 1.653.3Grok 451.0SenseNova-SI-1.1-Qwen2.5-VL-7B50.8VST-7B-SFT50.6Qwen3-VL-8B-Instruct49.4SenseNova-SI-1.1-InternVL3-2B49.0InternVL3_5-8B48.6SenseNova-SI-1.1-BAGEL-7B-MoT47.7VST-3B-SFT46.6vlm-3r-llava-qwen2-lora45.7InternVL3-8B45.7SenseNova-SI-1.1-Qwen2.5-VL-3B45.3BAGEL-7B-MoT45.1Cambrian-S-7B44.6Qwen3-VL-2B-Instruct43.7ViLaSR43.2InternVL3_5-2B-Instruct43.1Qwen2.5-VL-7B-Instruct42.0Cambrian-S-3B41.8SpaceR-SFT-7B40.9SpatialLadder-3B39.8InternVL3-2B39.1Qwen2.5-VL-3B-Instruct35.6Spatial-MLLM-subset-sft22.0MindCube-Qwen2.5VL-RawQA-SFT

Last updated: 2026-02-14 04:05:53 UTC