Papers4

All Beginner Intermediate Advanced

All Sources arXiv

#long-term memory

RIVER: A Real-Time Interaction Benchmark for Video LLMs

Intermediate

Yansong Shi, Qingsong Zhao et al.Mar 4arXiv

RIVER Bench is a new test that checks how well AI can watch a video stream and talk with you in real time.

#RIVER Bench#online video understanding#multimodal large language models

Not triaged yet

MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning

Intermediate

Jiejun Tan, Zhicheng Dou et al.Mar 3arXiv

MemSifter is a smart helper that picks the right memories for a big AI so the big AI doesn’t have to read everything.

#long-term memory#LLM retrieval#proxy model

Not triaged yet

MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments

Intermediate

Guangyi Liu, Pengxiang Zhao et al.Feb 3arXiv

MemGUI-Bench is a new test that checks how well phone-controlling AI agents can remember important information both during a task and across different tries.

#mobile GUI agents#memory benchmarking#short-term memory

Not triaged yet

Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks

Intermediate

Bohan Zeng, Kaixin Zhu et al.Feb 2arXiv

This paper argues that true world models are not just sprinkling facts into single tasks, but building a unified system that can see, think, remember, act, and generate across many situations.

#world models#unified framework#multimodal reasoning

Not triaged yet