DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
Paper
• 2310.03714 • Published
• 37
Note I'm not a fan of the implementation, but I think the ideas behind DSPy are interesting.
Note The paper that introduced the concept of multi-agents!
Note GAIA benchmark is the most challenging benchmark for generalist agents, requiring a good web browser, multimodal capabilities, and complex multi-step task solving.
Note This paper is the basis for the Thought -> Action -> Observation cycle used in most agent frameworks nowadays.
Note Has nice explanations as to why writing agent actions in code is better.
Need to analyze data? Let a Llama-3.1 agent do it for you!
Note This paper displays much more impressive scores than ShowUI : but the VLMs used are also much larger (7B and 72B vs 2B) and based on the better Qwen2.5 instead of Qwen2.