Big text-to-image models make amazing pictures but are slow because they take hundreds of tiny steps to turn noise into an image.
DuetSVG is a new AI that learns to make SVG graphics by generating an image and the matching SVG code together, like sketching first and then tracing neatly.
Diffusion models sometimes copy training images too closely, which can be a privacy and copyright problem.
Wan-Move is a new way to control how things move in AI-generated videos by guiding motion directly inside the model’s hidden features.
This paper fixes two big problems in image-making AI that builds pictures step by step: it often practices with perfect answers (teacher forcing) but must perform using its own imperfect guesses later, and the earliest coarse steps are much harder than the later fine steps.