Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2401.09417

Woodpecker: Hallucination Correction for Multimodal Large Language Models

Paper • 2310.16045 • Published Oct 24, 2023 • 17
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Paper • 2310.14566 • Published Oct 23, 2023 • 27
SILC: Improving Vision Language Pretraining with Self-Distillation

Paper • 2310.13355 • Published Oct 20, 2023 • 9
Conditional Diffusion Distillation

Paper • 2310.01407 • Published Oct 2, 2023 • 20

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62

Mamba based models

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62
VMamba: Visual State Space Model

Paper • 2401.10166 • Published Jan 18, 2024 • 39
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

Paper • 2405.14224 • Published May 23, 2024 • 15
Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 148

StableSSM: Alleviating the Curse of Memory in State-space Models through Stable Reparameterization

Paper • 2311.14495 • Published Nov 24, 2023 • 1
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

Paper • 2401.13560 • Published Jan 24, 2024 • 1
Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces

Paper • 2402.00789 • Published Feb 1, 2024 • 2

StateSpaceModels

ZigMa: Zigzag Mamba Diffusion Model

Paper • 2403.13802 • Published Mar 20, 2024 • 18
Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 111
VMamba: Visual State Space Model

Paper • 2401.10166 • Published Jan 18, 2024 • 39
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62

Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts

Paper • 2405.19893 • Published May 30, 2024 • 33
Is Cosine-Similarity of Embeddings Really About Similarity?

Paper • 2403.05440 • Published Mar 8, 2024 • 3
Evaluating Unsupervised Text Classification: Zero-shot and Similarity-based Approaches

Paper • 2211.16285 • Published Nov 29, 2022
Similarity-Based Domain Adaptation with LLMs

Paper • 2503.05281 • Published Mar 7, 2025

Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Paper • 2311.08046 • Published Nov 14, 2023 • 2
nvidia/GR00T-N1-2B

Robotics • 2B • Updated Sep 2, 2025 • 133 • 341
nvidia/Eagle2-1B

Image-Text-to-Text • 1B • Updated Apr 27, 2025 • 157 • 26
nvidia/PhysicalAI-Robotics-GR00T-X-Embodiment-Sim

Updated 28 days ago • 849k • 182

Zamba: A Compact 7B SSM Hybrid Model

Paper • 2405.16712 • Published May 26, 2024 • 24
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62

Woodpecker: Hallucination Correction for Multimodal Large Language Models

Paper • 2310.16045 • Published Oct 24, 2023 • 17
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Paper • 2310.14566 • Published Oct 23, 2023 • 27
SILC: Improving Vision Language Pretraining with Self-Distillation

Paper • 2310.13355 • Published Oct 20, 2023 • 9
Conditional Diffusion Distillation

Paper • 2310.01407 • Published Oct 2, 2023 • 20

Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts

Paper • 2405.19893 • Published May 30, 2024 • 33
Is Cosine-Similarity of Embeddings Really About Similarity?

Paper • 2403.05440 • Published Mar 8, 2024 • 3
Evaluating Unsupervised Text Classification: Zero-shot and Similarity-based Approaches

Paper • 2211.16285 • Published Nov 29, 2022
Similarity-Based Domain Adaptation with LLMs

Paper • 2503.05281 • Published Mar 7, 2025

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62

Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Paper • 2311.08046 • Published Nov 14, 2023 • 2
nvidia/GR00T-N1-2B

Robotics • 2B • Updated Sep 2, 2025 • 133 • 341
nvidia/Eagle2-1B

Image-Text-to-Text • 1B • Updated Apr 27, 2025 • 157 • 26
nvidia/PhysicalAI-Robotics-GR00T-X-Embodiment-Sim

Updated 28 days ago • 849k • 182

Mamba based models

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62
VMamba: Visual State Space Model

Paper • 2401.10166 • Published Jan 18, 2024 • 39
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

Paper • 2405.14224 • Published May 23, 2024 • 15
Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 148

Zamba: A Compact 7B SSM Hybrid Model

Paper • 2405.16712 • Published May 26, 2024 • 24
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62

StableSSM: Alleviating the Curse of Memory in State-space Models through Stable Reparameterization

Paper • 2311.14495 • Published Nov 24, 2023 • 1
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation

Paper • 2401.13560 • Published Jan 24, 2024 • 1
Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces

Paper • 2402.00789 • Published Feb 1, 2024 • 2

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62

StateSpaceModels

ZigMa: Zigzag Mamba Diffusion Model

Paper • 2403.13802 • Published Mar 20, 2024 • 18
Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 111
VMamba: Visual State Space Model

Paper • 2401.10166 • Published Jan 18, 2024 • 39
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 62

Previous
1
2
3
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs