Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use
IntermediateAradhye Agarwal, Gurdit Siyan et al.Mar 3arXiv
Agentic AIs donβt just chat; they plan, use tools, and take many steps, so one wrong click can cause real harm.
#MOSAIC#agentic safety#plan-check-act