DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment
IntermediateHaoyou Deng, Keyu Yan et al.Jan 28arXiv
DenseGRPO teaches image models using lots of small, timely rewards instead of one final score at the end.
#DenseGRPO#flow matching#GRPO