106 267

Mwangi PRO

Benson

AI & ML interests

None yet

Recent Activity

liked a model about 22 hours ago

jinaai/jina-embeddings-v5-omni-small

upvoted a paper 1 day ago

EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding

upvoted a paper 2 days ago

jina-embeddings-v5-omni: Text-Geometry-Preserving Multimodal Embeddings via Frozen-Tower Composition

View all activity

Organizations

None yet

liked a model about 22 hours ago

jinaai/jina-embeddings-v5-omni-small

Feature Extraction • 2B • Updated 3 days ago • 18.7k • 40

upvoted a paper 1 day ago

EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding

Paper • 2605.09874 • Published 5 days ago • 2

upvoted a paper 2 days ago

jina-embeddings-v5-omni: Text-Geometry-Preserving Multimodal Embeddings via Frozen-Tower Composition

Paper • 2605.08384 • Published 8 days ago • 7

upvoted a collection 3 days ago

jina-embeddings-v5-omni

Collection

Multimodal (text + image + video + audio) embedding models aligned with jina-embeddings-v5-text-*. Two sizes, four task variants each. • 27 items • Updated 3 days ago • 31

upvoted a paper 3 days ago

CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models

Paper • 2605.08735 • Published 7 days ago • 67

upvoted a paper 5 days ago

SkillOS: Learning Skill Curation for Self-Evolving Agents

Paper • 2605.06614 • Published 9 days ago • 42

liked a model 7 days ago

hao9610/X2SAM

Updated 11 days ago • 4

liked a dataset 7 days ago

yifanzhang114/MM-RLHF

Viewer • Updated Apr 21, 2025 • 16.3k • 220 • 14

liked a model 15 days ago

nvidia/Cosmos-Reason2-32B

Image-Text-to-Text • 33B • Updated 16 days ago • 4.25k • 9

upvoted an article 17 days ago

Article

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

nvidia

•

17 days ago

• 55

liked a model 17 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated Mar 15 • 1.07M • 734

liked a dataset 19 days ago

facebook/action100m-preview

Viewer • Updated Jan 29 • 120k • 1.24k • 141

upvoted a paper 25 days ago

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published 29 days ago • 58

upvoted a paper about 1 month ago

VimRAG: Navigating Massive Visual Context in Retrieval-Augmented Generation via Multimodal Memory Graph

Paper • 2602.12735 • Published Feb 13 • 8

liked 3 models about 1 month ago

upvoted 2 papers about 1 month ago

WAVE: Learning Unified & Versatile Audio-Visual Embeddings with Multimodal LLM

Paper • 2509.21990 • Published Sep 26, 2025 • 1

A Simple Baseline for Streaming Video Understanding

Paper • 2604.02317 • Published Apr 2 • 73

liked a model about 1 month ago

tsinghua-ee/WAVE-7B

Updated Feb 11 • 53 • 3

Mwangi PRO

AI & ML interests

Recent Activity

Organizations

Benson's activity

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents