Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency
BeginnerHaoming Xu, Ningyuan Zhao et al.Jan 9arXiv
LLMs can look confident but still change their answers when the surrounding text nudges them, showing that confidence alone isnβt real truthfulness.
#Neighbor-Consistency Belief#belief robustness#self-consistency