Masking Teacher and Reinforcing Student for Distilling Vision-Language Models
IntermediateByung-Kwan Lee, Yu-Chiang Frank Wang et al.Dec 23arXiv
Big vision-language models are super smart but too large to fit on phones and small devices.
#vision-language models#knowledge distillation#masking teacher