DREAM is one model that both understands images (like CLIP) and makes images from text (like top text-to-image models).
Kimi K2.5 is a new open-source AI that can read both text and visuals (images and videos) and act like a team of helpers to finish big tasks faster.