External Publication

Optimizing Agentic Architecture: Strategies for Reducing High Token Costs in Multi-Intent Workflows

OpenAI Developer Community April 22, 2026

As for a optimizing “agent per tool call”. Experience of our lab showed that if you can accomplish the action without LLM call after routing - do it. This makes workflow more reliable and predictable. We basically use another LLM(agent) call only in case we need semantically extract parameters necessary to perform action

Discussion in the ATmosphere