External Publication
Visit Post

Optimizing Agentic Architecture: Strategies for Reducing High Token Costs in Multi-Intent Workflows

OpenAI Developer Community April 22, 2026
Source
As for a optimizing “agent per tool call”. Experience of our lab showed that if you can accomplish the action without LLM call after routing - do it. This makes workflow more reliable and predictable. We basically use another LLM(agent) call only in case we need semantically extract parameters necessary to perform action

Discussion in the ATmosphere

Loading comments...