Show and Tell: QLANKR Test, a tool for evaluating AI agents and RAG workflows
Hugging Face Forums [Unofficial]
April 7, 2026
Site: https://test.qlankr.com
Hi everyone,
I built QLANKR Test because AI evaluation still feels too inconsistent and too dependent on guesswork.
A lot of builders are shipping agents, chatbots, RAG systems, and tool-calling workflows, but the feedback loop is often messy. You tweak a prompt, change a tool, run it again, and it is not always easy to understand what actually improved.
QLANKR Test is my attempt to make that process more structured.
It helps test:
* AI agents
* chatbots
* RAG systems
* tool-calling workflows
The goal is to make evaluation more structured, repeatable, and easier to inspect.
I would especially love feedback on:
* whether the report feels useful
* whether the scoring makes sense
* what is still missing for real-world agent evaluation
Site: https://test.qlankr.com
Discussion in the ATmosphere