External Publication
Visit Post

Is an agent-harness evaluation preprint suitable for arXiv cs.AI?

Hugging Face Forums [Unofficial] May 1, 2026
Source
I’m an independent researcher working on agent systems and LLM evaluation. I recently prepared a small empirical preprint and am trying to understand the right path for sharing it with the research community. The paper studies how different agent harnesses/scaffolds can affect measured benchmark performance and token cost under a controlled setup. It compares Goose, OpenCode, and OpenHands-SDK on a fixed Terminal-Bench-Pro task slice across two models. Paper / DOI: https://doi.org/10.5281/zenodo.19819492 Code/repo: https://github.com/namanvats/scaffold-effects I’m currently looking for advice from people familiar with arXiv cs.AI submissions: does this look appropriately scoped for cs.AI, and what is the respectful way for a first-time independent author to handle the endorsement process? I’m not asking for a review of the paper’s claims, only for guidance on category fit and the right process.

Discussion in the ATmosphere

Loading comments...