DenseGRPO teaches image models using lots of small, timely rewards instead of one final score at the end.
Robots usually learn by copying many demonstrations, which is expensive and makes them brittle when things change.