Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models Paper • 2406.02915 • Published Jun 5, 2024 • 1