Jiayi Zhang

didiforhugface

3 52 5

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Macaron-A2UI: A Model for Generative UI in Personal Agents

upvoted a paper about 1 month ago

Foundation Protocol: A Coordination Layer for Agentic Society

upvoted a paper about 1 month ago

MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

View all activity

Organizations

authored a paper 5 months ago

AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

Paper • 2602.03786 • Published Feb 3 • 90

authored 4 papers 7 months ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 306

AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning

Paper • 2511.19304 • Published Nov 24, 2025 • 92

VisJudge-Bench: Aesthetics and Quality Assessment of Visualizations

Paper • 2510.22373 • Published Oct 25, 2025 • 15

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Paper • 2511.15065 • Published Nov 19, 2025 • 78

authored 6 papers 8 months ago

InteractComp: Evaluating Search Agents With Ambiguous Queries

Paper • 2510.24668 • Published Oct 28, 2025 • 100

Concise Reasoning, Big Gains: Pruning Long Reasoning Trace with Difficulty-Aware Prompting

Paper • 2505.19716 • Published May 26, 2025 • 4

You Don't Know Until You Click:Automated GUI Testing for Production-Ready Software Evaluation

Paper • 2508.14104 • Published Aug 17, 2025 • 1

RobustFlow: Towards Robust Agentic Workflow Generation

Paper • 2509.21834 • Published Sep 26, 2025 • 2

VeritasFi: An Adaptable, Multi-tiered RAG Framework for Multi-modal Financial Question Answering

Paper • 2510.10828 • Published Oct 12, 2025 • 1

ReCode: Unify Plan and Action for Universal Granularity Control

Paper • 2510.23564 • Published Oct 27, 2025 • 123

authored 3 papers 11 months ago

MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning

Paper • 2503.07459 • Published Mar 10, 2025 • 16

Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search

Paper • 2502.17248 • Published Feb 24, 2025 • 1

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28, 2025 • 86

authored 3 papers over 1 year ago

Jiayi Zhang

AI & ML interests

Recent Activity

Organizations

didiforhugface's activity