This paper builds MFMD-Scen, a big test to see how AI changes its truth/false judgments about the same money-related claim when the situation around it changes.
Large language models can say things that sound right but arenβt supported by the given document; this is called a faithfulness hallucination.