Failing to use Qwen3.5-397B-A17B through HF inference
Hugging Face Forums [Unofficial]
April 1, 2026
Did something change about this model ? I used to have no issues running this model inside Zed Editor - but today for some reason I am getting error:
{"status":400,"error":"BAD REQUEST","message":"payload validation: max_completion_tokens is limited to 16384 for qwen3.5-397b-a17b"}
Even when I change the max_completion_tokens param to below that in Zed, it doesnt do anything - the error still happens. Anyone may have any idea whats going on?
Discussion in the ATmosphere