External Publication

Getting Frequent Gateway timeout on Inference Provider

Hugging Face Forums [Unofficial] June 3, 2026

I am using the inference provider with the model moonshotai/Kimi-K2.6:fireworks-ai, but I am experiencing frequent Gateway Timeout errors when using the Hugging Face Inference Provider.

This issue seems to occur specifically when the context is long and reasoning is set to a high level, where it is expected that the model will take significantly more time to generate a response.

The Fireworks AI backend itself appears to be working fine, as I do not encounter this issue when using the Fireworks AI API directly.

Could someone help me understand what might be causing this or suggest a possible solution?

Discussion in the ATmosphere