Getting Frequent Gateway timeout on Inference Provider
Hugging Face Forums [Unofficial]
June 3, 2026
I am using the inference provider with the model moonshotai/Kimi-K2.6:fireworks-ai, but I am experiencing frequent Gateway Timeout errors when using the Hugging Face Inference Provider.
This issue seems to occur specifically when the context is long and reasoning is set to a high level, where it is expected that the model will take significantly more time to generate a response.
The Fireworks AI backend itself appears to be working fine, as I do not encounter this issue when using the Fireworks AI API directly.
Could someone help me understand what might be causing this or suggest a possible solution?
Discussion in the ATmosphere