HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding
IntermediateHaowei Zhang, Shudong Yang et al.Jan 21arXiv
HERMES is a training-free way to make video-language models understand live, streaming video quickly and accurately.
#HERMES#KV cache#hierarchical memory