Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2409.17115

LinFusion: 1 GPU, 1 Minute, 16K Image

Paper • 2409.02097 • Published Sep 3, 2024 • 34
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17, 2024 • 27
Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 126
Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17, 2024 • 22

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7, 2025 • 110
MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30, 2025 • 139
Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21, 2025 • 155
Antidistillation Sampling

Paper • 2504.13146 • Published Apr 17, 2025 • 59

ProX Refining Models

Adapted small language models used to generate data refining programs

gair-prox/web-doc-refining-lm

Text Generation • 0.4B • Updated Oct 10, 2024 • 64 • 5
gair-prox/web-chunk-refining-lm

Text Generation • 0.4B • Updated Oct 10, 2024 • 45 • 6
gair-prox/math-doc-refining-lm

Text Generation • 0.8B • Updated Oct 10, 2024 • 4 • 2
gair-prox/math-chunk-refining-lm

Text Generation • 0.4B • Updated Oct 10, 2024 • 2 • 1

ProX Math Models

base models trained on ProX curated openwebmath-pro.

gair-prox/Mistral-7B-ProXMath

Text Generation • 7B • Updated Sep 28, 2024 • 7 • 3
gair-prox/TinyLlama-1.1B-ProXMath

1B • Updated Oct 10, 2024 • 5 • 2
gair-prox/Llama-2-7B-ProXMath

Text Generation • Updated Oct 10, 2024 • 1
gair-prox/CodeLlama-7B-ProXMath

Updated Oct 10, 2024 • 1 • 1

Agentic-ly agentic

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15, 2024 • 40
On the limits of agency in agent-based models

Paper • 2409.10568 • Published Sep 14, 2024 • 14
On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 13
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 66

about 1 month ago

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 127
Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Paper • 2405.04324 • Published May 7, 2024 • 25
Seed-Coder: Let the Code Model Curate Data for Itself

Paper • 2506.03524 • Published Jun 4, 2025 • 6
Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 153

🫐 ProX Projects

Collection for: "Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale"

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 64
gair-prox/DCLM-pro

Viewer • Updated Feb 15, 2025 • 366M • 2.15k • 12
gair-prox/FineWeb-pro

Viewer • Updated Sep 26, 2024 • 63.1M • 977 • 26
gair-prox/open-web-math-pro

Viewer • Updated Sep 26, 2024 • 2.58M • 737 • 12

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 64

a collection of pre-training corpora refined by ProX

gair-prox/DCLM-pro

Viewer • Updated Feb 15, 2025 • 366M • 2.15k • 12
gair-prox/FineWeb-pro

Viewer • Updated Sep 26, 2024 • 63.1M • 977 • 26
gair-prox/open-web-math-pro

Viewer • Updated Sep 26, 2024 • 2.58M • 737 • 12
gair-prox/RedPajama-pro

Viewer • Updated Sep 26, 2024 • 10.2M • 101 • 4

Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.

VILA^2: VILA Augmented VILA

Paper • 2407.17453 • Published Jul 24, 2024 • 41
Octopus v4: Graph of language models

Paper • 2404.19296 • Published Apr 30, 2024 • 118
Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26, 2024 • 48
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28, 2024 • 42

LinFusion: 1 GPU, 1 Minute, 16K Image

Paper • 2409.02097 • Published Sep 3, 2024 • 34
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17, 2024 • 27
Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 126
Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17, 2024 • 22

about 1 month ago

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 127
Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Paper • 2405.04324 • Published May 7, 2024 • 25
Seed-Coder: Let the Code Model Curate Data for Itself

Paper • 2506.03524 • Published Jun 4, 2025 • 6
Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 153

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7, 2025 • 110
MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30, 2025 • 139
Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21, 2025 • 155
Antidistillation Sampling

Paper • 2504.13146 • Published Apr 17, 2025 • 59

🫐 ProX Projects

Collection for: "Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale"

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 64
gair-prox/DCLM-pro

Viewer • Updated Feb 15, 2025 • 366M • 2.15k • 12
gair-prox/FineWeb-pro

Viewer • Updated Sep 26, 2024 • 63.1M • 977 • 26
gair-prox/open-web-math-pro

Viewer • Updated Sep 26, 2024 • 2.58M • 737 • 12

ProX Refining Models

Adapted small language models used to generate data refining programs

gair-prox/web-doc-refining-lm

Text Generation • 0.4B • Updated Oct 10, 2024 • 64 • 5
gair-prox/web-chunk-refining-lm

Text Generation • 0.4B • Updated Oct 10, 2024 • 45 • 6
gair-prox/math-doc-refining-lm

Text Generation • 0.8B • Updated Oct 10, 2024 • 4 • 2
gair-prox/math-chunk-refining-lm

Text Generation • 0.4B • Updated Oct 10, 2024 • 2 • 1

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 64

ProX Math Models

base models trained on ProX curated openwebmath-pro.

gair-prox/Mistral-7B-ProXMath

Text Generation • 7B • Updated Sep 28, 2024 • 7 • 3
gair-prox/TinyLlama-1.1B-ProXMath

1B • Updated Oct 10, 2024 • 5 • 2
gair-prox/Llama-2-7B-ProXMath

Text Generation • Updated Oct 10, 2024 • 1
gair-prox/CodeLlama-7B-ProXMath

Updated Oct 10, 2024 • 1 • 1

a collection of pre-training corpora refined by ProX

gair-prox/DCLM-pro

Viewer • Updated Feb 15, 2025 • 366M • 2.15k • 12
gair-prox/FineWeb-pro

Viewer • Updated Sep 26, 2024 • 63.1M • 977 • 26
gair-prox/open-web-math-pro

Viewer • Updated Sep 26, 2024 • 2.58M • 737 • 12
gair-prox/RedPajama-pro

Viewer • Updated Sep 26, 2024 • 10.2M • 101 • 4

Agentic-ly agentic

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15, 2024 • 40
On the limits of agency in agent-based models

Paper • 2409.10568 • Published Sep 14, 2024 • 14
On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 13
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 66

Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend.

VILA^2: VILA Augmented VILA

Paper • 2407.17453 • Published Jul 24, 2024 • 41
Octopus v4: Graph of language models

Paper • 2404.19296 • Published Apr 30, 2024 • 118
Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26, 2024 • 48
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28, 2024 • 42

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs