This paper teaches AI models not just how to solve problems but also how to tell when their own answers might be wrong.
This paper studies how sure (confident) large language models are during multi-turn chats where clues arrive step by step.