UM-Text: A Unified Multimodal Model for Image Understanding and Visual Text Editing
IntermediateLichen Ma, Xiaolong Fu et al.Jan 13arXiv
UM-Text is a single AI that understands both your words and your picture to add or change text in images so it looks like it truly belongs there.
#visual text editing#multimodal diffusion#Visual Language Model