Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders
IntermediateSiqi Kou, Jiachun Jin et al.Jan 15arXiv
Most text-to-image models act like word-to-pixel copy machines and miss the hidden meaning in our prompts.
#think-then-generate#reasoning-aware text-to-image#LLM encoder