Jarrod Barnes PRO

Jarrodbarnes

1 124 168

https://dynamicalsystems.ai

AI & ML interests

Continual Learning, Reinforcement Learning

Recent Activity

upvoted a paper 4 days ago

Reinforcement World Model Learning for LLM-based Agents

upvoted a paper 4 days ago

Bridging the Agent-World Gap: Text World Models for LLM-based Agents

upvoted a paper 4 days ago

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

View all activity

Organizations

upvoted 3 papers 4 days ago

Reinforcement World Model Learning for LLM-based Agents

Paper • 2602.05842 • Published Feb 5 • 28

Bridging the Agent-World Gap: Text World Models for LLM-based Agents

Paper • 2606.09032 • Published 23 days ago • 8

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

Paper • 2606.12191 • Published 21 days ago • 70

liked a model 5 days ago

Qwen/Qwen-AgentWorld-35B-A3B

Text Generation • 35B • Updated 5 days ago • 28.5k • 456

liked a dataset 5 days ago

Qwen/AgentWorldBench

Viewer • Updated 7 days ago • 2.17k • 1.33k • 56

liked a model 12 days ago

facebook/UMA

Updated Apr 29 • 64 • 286

upvoted a collection 15 days ago

SWE-FastContext

Collection

A family of code-search models powering the Explore subagent for coding agents.(It will be made public later) • 3 items • Updated about 14 hours ago • 15

liked a model 20 days ago

google/diffusiongemma-26B-A4B-it

Image-Text-to-Text • 26B • Updated 20 days ago • 1.27M • 1.08k

upvoted a collection 22 days ago

Materials

Collection

Welcome to IBM’s multi-modal foundation model for materials, FM4M, designed to support and advance research in materials science and chemistry. • 6 items • Updated Jan 28, 2025 • 15

updated a dataset 23 days ago

Jarrodbarnes/latent-mining

Viewer • Updated 23 days ago • 165 • 1.14k

published a dataset 26 days ago

Jarrodbarnes/latent-mining

Viewer • Updated 23 days ago • 165 • 1.14k

updated 2 datasets about 1 month ago

poolside-laguna-hackathon/processrl-terminal-environments

Viewer • Updated May 30 • 67 • 195

Jarrodbarnes/processrl-terminal-environments

Viewer • Updated May 30 • 67 • 195

published a dataset about 1 month ago

Jarrodbarnes/processrl-terminal-environments

Viewer • Updated May 30 • 67 • 195

liked a dataset about 1 month ago

open-thoughts/OpenThoughts-Agent-v1-RL

Viewer • Updated Jan 27 • 728 • 427 • 19

upvoted an article about 1 month ago

Article

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego

•

May 27

• 42

upvoted 2 papers about 1 month ago

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Paper • 2605.26494 • Published May 26 • 41

Look Before You Leap: Autonomous Exploration for LLM Agents

Paper • 2605.16143 • Published May 15 • 10

upvoted a collection about 1 month ago

📊 DNA benchmarks

Collection

Zero-shot DNA benchmarks for Variant Effect prediction, Sequence Recovery and Perturbation tasks. • 5 items • Updated May 19 • 13

liked a model about 1 month ago

HuggingFaceBio/Carbon-8B

Text Generation • 8B • Updated 12 days ago • 1.24k • 46

Jarrod Barnes PRO

AI & ML interests

Recent Activity

Organizations

Jarrodbarnes's activity

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL