Privacy Collapse: Benign Fine-Tuning Can Break Contextual Privacy in Language Models
IntermediateAnmol Goel, Cornelius Emde et al.Jan 21arXiv
Benign fine-tuning meant to make language models more helpful can accidentally make them overshare private information.
#contextual privacy#privacy collapse#fine-tuning