Unified Personalized Reward Model for Vision Generation
IntermediateYibin Wang, Yuhang Zang et al.Feb 2arXiv
The paper introduces UnifiedReward-Flex, a reward model that judges images and videos the way a thoughtful human wouldโby flexibly changing what it checks based on the prompt and the visual evidence.
#personalized reward model#multimodal reward#context-adaptive reasoning