External Publication
Visit Post

Show and Tell: QLANKR Test, a tool for evaluating AI agents and RAG workflows

Hugging Face Forums [Unofficial] April 7, 2026
Source
Site: https://test.qlankr.com Hi everyone, I built QLANKR Test because AI evaluation still feels too inconsistent and too dependent on guesswork. A lot of builders are shipping agents, chatbots, RAG systems, and tool-calling workflows, but the feedback loop is often messy. You tweak a prompt, change a tool, run it again, and it is not always easy to understand what actually improved. QLANKR Test is my attempt to make that process more structured. It helps test: * AI agents * chatbots * RAG systems * tool-calling workflows The goal is to make evaluation more structured, repeatable, and easier to inspect. I would especially love feedback on: * whether the report feels useful * whether the scoring makes sense * what is still missing for real-world agent evaluation Site: https://test.qlankr.com

Discussion in the ATmosphere

Loading comments...