Add new official benchmark on the Hub
Hugging Face Forums [Unofficial]
March 25, 2026
Hi,
Thanks for reaching out! Your dataset internlm/WildClawBench and the included eval.yaml look good. We can add it to the official benchmark allow-list.
Before we do, please ensure that:
The repository follows the Hub’s benchmark submission guidelines.
The
eval.yamlincludes all required fields and a working evaluation script.Any dependencies or instructions for reproducing the benchmark are clearly documented.
Once confirmed, we’ll proceed with adding it to the allow-list and it should appear as an official benchmark on the Hub.
Thanks for contributing this!
Discussion in the ATmosphere