Papers4

All Beginner Intermediate Advanced

All Sources arXiv

#rejection sampling fine-tuning

V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval

Intermediate

Dongyang Chen, Chaoyang Wang et al.Feb 5arXiv

V-Retrver is a new way for AI to search across text and images by double-checking tiny visual details instead of only guessing from words.

#V-Retrver#multimodal retrieval#agentic reasoning

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Intermediate

Taofeng Xue, Chong Peng et al.Jan 22arXiv

Before this work, computer-using AIs mostly copied old examples and struggled with long step-by-step tasks on real computers.

#computer use agent#verifiable synthesis#validator

ROI-Reasoning: Rational Optimization for Inference via Pre-Computation Meta-Cognition

Intermediate

Muyang Zhao, Qi Qi et al.Jan 7arXiv

The paper teaches AI models to plan their thinking time like a smart test-taker who has to finish several questions before the bell rings.

#meta-cognition#budgeted reasoning#token budget

Openpi Comet: Competition Solution For 2025 BEHAVIOR Challenge

Intermediate

Junjie Bai, Yu-Wei Chao et al.Dec 10arXiv

This paper shows how to make home-helper robots better at long, multi-step chores by smart training on diverse tasks and by polishing the model after training using its own best attempts.

#Vision-Language-Action#long-horizon manipulation#rejection sampling fine-tuning