Roughly 88% of experiments do not produce a clean primary-metric win. The bottleneck is interpreting the ones already concluded — not running more. An agent that pulls results, retrieves related history, cross-references releases, and proposes the next three tests closes the gap.
Manual Monday attribution is the loudest voice winning the narrative. Replace it with an agent that pulls the delta, queries five evidence systems, and ships a ranked hypothesis list with explicit confidence — and an unexplained remainder.