Optimizing Agentic Architecture: Strategies for Reducing High Token Costs in Multi-Intent Workflows
OpenAI Developer Community
April 22, 2026
As for a optimizing “agent per tool call”. Experience of our lab showed that if you can accomplish the action without LLM call after routing - do it. This makes workflow more reliable and predictable.
We basically use another LLM(agent) call only in case we need semantically extract parameters necessary to perform action
Discussion in the ATmosphere