One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation
IntermediateYuan Gao, Chen Chen et al.Dec 8arXiv
This paper shows that we can turn big, smart vision features into a small, easy-to-use code for image generation with just one attention layer.
#Feature Auto-Encoder#FAE#Self-Supervised Learning