Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing
BeginnerShilong Zhang, He Zhang et al.Dec 19arXiv
This paper shows that great image understanding features alone are not enough for making great images; you also need strong pixel-level detail.
#Pixel–Semantic VAE#Semantic Regularization#Off-Manifold Generation