Raw Record Source

{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreiaxmwdivhy57f45mg5ovclpgsycvd6yrq22ob4tfi6bpstojudycu",
    "uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mmuc6ihaq6y2"
  },
  "path": "/t/endorsement-request-halfway-speculative-decoding-direct-acceptance-rate-optimization-with-joint-drafter-target-training/176267#post_1",
  "publishedAt": "2026-05-27T18:56:24.000Z",
  "site": "https://discuss.huggingface.co",
  "tags": [
    "https://sergiu-nistor.com/assets/publications/Halfway_Speculative_Decoding.pdf",
    "https://arxiv.org/auth/endorse?x=TTNLCR"
  ],
  "textContent": "Hi everyone,\n\nI’m looking for an arXiv endorsement for **cs.CL** to submit my paper on speculative decoding.\n\n**Paper:** Halfway Speculative Decoding: Direct Acceptance Rate Optimization with Joint Drafter-Target Training\n\n**Abstract:** In speculative decoding, direct optimization of the acceptance rate through LK losses has proven to be more effective than the standard approach of training the draft head through KL divergence minimization against the target’s output distribution, as the acceptance rate is what ultimately governs inference speedup. However, prior approaches have either jointly trained the draft head and target model without directly targeting acceptance rate, or optimized the acceptance rate on the draft head alone while keeping the target frozen. We propose Halfway Speculative Decoding, which extends LK loss optimization to both models jointly, using cross-entropy loss to regularize the target model and prevent quality degradation. We study how speculation metrics and target quality scale with training budget by training separate model instances on increasing numbers of training examples and evaluating across several benchmarks.\n\n**PDF:** https://sergiu-nistor.com/assets/publications/Halfway_Speculative_Decoding.pdf\n\n**Endorsement link:** https://arxiv.org/auth/endorse?x=TTNLCR\n\nHappy to share more details or discuss the work. Thank you!",
  "title": "[Endorsement Request] Halfway Speculative Decoding: Direct Acceptance Rate Optimization with Joint Drafter-Target Training"
}