CAR-bench is a new 'driving test' for AI assistants that checks if they can stay careful, honest, and consistent during real back-and-forth conversations in a car.
Personalized AI helpers can accidentally copy a userβs past opinions instead of telling objective facts, which the authors call personalization-induced hallucinations.