This paper says we should measure an AI agent’s uncertainty across its whole conversation, not just on one final answer.
This paper turns rebuttal writing from ‘just write some text’ into ‘make a plan with proof, then write.’