Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model
IntermediateTianyi Wu, Mingzhe Du et al.Feb 7arXiv
This paper introduces SecCoderX, a way to teach code-writing AIs to be secure without breaking what the code is supposed to do.
#secure code generation#reinforcement learning#vulnerability reward model