KoViDoRe Benchmark (BEIR) v2 Collection Korean Vision Document Retrieval Benchmark • 4 items • Updated Mar 2 • 6
Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval Paper • 2604.04734 • Published 8 days ago • 9
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers 5 days ago • 38
view article Article How I contributed a new model to the Transformers library using Codex 15 days ago • 45
NanoVDR: Distilling a 2B Vision-Language Retriever into a 70M Text-Only Encoder for Visual Document Retrieval Paper • 2603.12824 • Published Mar 13 • 5
view article Article A framework and leaderboard for Retrieval Pipelines evaluation on ViDoRe v3 Feb 27 • 12
ModernVBERT: Towards Smaller Visual Document Retrievers Paper • 2510.01149 • Published Oct 1, 2025 • 33
VisionDocumentRetrieval Datasets Collection Datasets for vision document retrieval (VDR) • 19 items • Updated 27 days ago • 10
view article Article ColFlor: Towards BERT-Size Vision-Language Document Retrieval Models Oct 18, 2024 • 21
SDS KoPub VDR: A Benchmark Dataset for Visual Document Retrieval in Korean Public Documents Paper • 2511.04910 • Published Nov 7, 2025 • 1
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated Feb 26 • 96
Training Sparse Mixture Of Experts Text Embedding Models Paper • 2502.07972 • Published Feb 11, 2025 • 10
ViDoRe V3: A Comprehensive Evaluation of Retrieval Augmented Generation in Complex Real-World Scenarios Paper • 2601.08620 • Published Jan 13 • 12
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval Paper • 2412.14475 • Published Dec 19, 2024 • 58
ViDoRe Community benchmark contributions Collection This collection regroups works done by the community to improve together Visual Retrieval ! • 4 items • Updated Jan 9 • 1
view article Article Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models Jan 6 • 28
ViDoRe Benchmark V3 Collection ViDoRe V3 is our latest benchmark, engineered to set a new industry gold standard for multi-modal, enterprise document retrieval evaluation. • 8 items • Updated Jan 14 • 20