GameTalk: Training LLMs for Strategic Conversation
IntermediateVictor Conchello Vendrell, Max Ruiz Luyten et al.Jan 22arXiv
Large language models usually get judged one message at a time, but many real tasks need smart planning across a whole conversation.
#strategic conversation#reinforcement learning for LLMs#multi-turn dialogue