When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains
IntermediateAhmadreza Jeddi, Kimia Shaban et al.Mar 1arXiv
This paper asks a simple question: does reinforcement learning (RL) truly make medical vision-language models (VLMs) smarter, or just help them pick better from answers they already know?
#medical vision-language models#reinforcement learning#supervised fine-tuning