External Publication
Visit Post

Feature request: Kernels metadata in the models

Hugging Face Forums [Unofficial] May 6, 2026
Source
I like it, but “a user can immediately know whether their GPU is compatible or not for training” shouldn’t be effected by the kernels used. If they use flash-attn and the user does more training without flash attention it would still train.

Discussion in the ATmosphere

Loading comments...