Research

Academic or research source. Check the methodology, sample size, and whether it's been replicated.

Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training

Pass@k is a widely used performance metric for verifiable large language model tasks, including mathematical reasoning, code generation, and short-answer reasoning.

arXiv cs.AI · Feb 24, 2026 18:43 UTC · Paper: ~15 min

Read Original

Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training

TLDR

Pass@k is a widely used performance metric for verifiable large language model tasks, including mathematical reasoning, code generation, and short-answer reasoning.

Artifacts

Paper PDF

Open

O open S save B back M mode