GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models
IntermediateBozhou Li, Sihan Yang et al.Dec 17arXiv
This paper is about making the words you type into a generator turn into the right pictures and videos more reliably.
#diffusion models#text encoder#multimodal large language model