BBQ is a text-to-image model that lets you place objects exactly where you want using numeric bounding boxes and color them with exact RGB values.
Text-to-image models draw pretty pictures, but often put things in the wrong places or miss how objects interact.