LRAgent: Efficient KV Cache Sharing for Multi-LoRA LLM Agents
IntermediateHyesung Jeon, Hyeongju Ha et al.Feb 1arXiv
Multi-agent LLM systems often use LoRA adapters so each agent has a special role, but they all rebuild almost the same KV cache, wasting memory and time.
#LoRA#Multi-LoRA#KV cache