This paper builds MFMD-Scen, a big test to see how AI changes its truth/false judgments about the same money-related claim when the situation around it changes.
TokSuite is a science lab for tokenizers: it trains 14 language models that are identical in every way except for how they split text into tokens.