Abstract
Learning per-query agent configurations through reinforcement learning improves task accuracy while reducing computational costs compared to fixed templates and hand-tuned heuristics.
Configuring LLM-based agent systems involves choosing workflows, tools, token budgets, and prompts from a large combinatorial design space, and is typically handled today by fixed large templates or hand-tuned heuristics. This leads to brittle behavior and unnecessary compute, since the same cumbersome configuration is often applied to both easy and hard input queries. We formulate agent configuration as a query-wise decision problem and introduce ARC (Agentic Resource & Configuration learner), which learns a light-weight hierarchical policy using reinforcement learning to dynamically tailor these configurations. Across multiple benchmarks spanning reasoning and tool-augmented question answering, the learned policy consistently outperforms strong hand-designed and other baselines, achieving up to 25% higher task accuracy while also reducing token and runtime costs. These results demonstrate that learning per-query agent configurations is a powerful alternative to "one size fits all" designs.
Community
Building agentic systems is hard, but configuring them is even harder.
We all know the struggle: Which LLM should handle the planning? Which tool does it need? How much context is too much? What is the most effective workflow?
In our new paper, Learning to Configure Agentic AI Systems, we propose a framework (called ARC) that automates these decisions. Instead of manual trial-and-error, we use a Hierarchical Reinforcement Learning (HRL) algorithm to dynamically find the best configuration for a given input.
#AgenticAI #LLMs #ReinforcementLearning
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- TodoEvolve: Learning to Architect Agent Planning Systems (2026)
- ASTER: Agentic Scaling with Tool-integrated Extended Reasoning (2026)
- StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management (2026)
- Learning to Share: Selective Memory for Efficient Parallel Agentic Systems (2026)
- Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems (2026)
- PaperScout: An Autonomous Agent for Academic Paper Search with Process-Aware Sequence-Level Policy Optimization (2026)
- Evolutionary Generation of Multi-Agent Systems (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper