HUME: Measuring the Human-Model Performance Gap in Text Embedding Task Paper • 2510.10062 • Published Oct 11, 2025 • 10
Dynaword: From One-shot to Continuously Developed Datasets Paper • 2508.02271 • Published Aug 4, 2025 • 15
TextDescriptives: A Python package for calculating a large variety of metrics from text Paper • 2301.02057 • Published Jan 5, 2023
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 45
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks Paper • 2406.13469 • Published Jun 19, 2024