Is an agent-harness evaluation preprint suitable for arXiv cs.AI?
Hugging Face Forums [Unofficial]
May 1, 2026
I’m an independent researcher working on agent systems and LLM evaluation. I recently prepared a small empirical preprint and am trying to understand the right path for sharing it with the research community.
The paper studies how different agent harnesses/scaffolds can affect measured benchmark performance and token cost under a controlled setup. It compares Goose, OpenCode, and OpenHands-SDK on a fixed Terminal-Bench-Pro task slice across two models.
Paper / DOI: https://doi.org/10.5281/zenodo.19819492
Code/repo: https://github.com/namanvats/scaffold-effects
I’m currently looking for advice from people familiar with arXiv cs.AI submissions: does this look appropriately scoped for cs.AI, and what is the respectful way for a first-time independent author to handle the endorsement process?
I’m not asking for a review of the paper’s claims, only for guidance on category fit and the right process.
Discussion in the ATmosphere