VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction
IntermediateSinan Du, Jiahao Guo et al.Nov 28arXiv
VQRAE is a new kind of image tokenizer that lets one model both understand images (continuous features) and generate/reconstruct them (discrete tokens).
#VQRAE#Vector Quantization#Representation Autoencoder