Papers129

Vibe AIGC: A New Paradigm for Content Generation via Agentic Orchestration

Jiaheng Liu, Yuanxing Zhang et al.Feb 4arXiv

This paper says today's content AIs are great at pretty pictures and videos but often miss what people actually want, creating a big Intent-Execution Gap.

#Vibe AIGC#Agentic Orchestration#Meta Planner

Likelihood-Based Reward Designs for General LLM Reasoning

Beginner

Ariel Kwiatkowski, Natasha Butt et al.Feb 3arXiv

Binary right/wrong rewards for training reasoning in large language models are hard to design and often too sparse to learn from.

#log-likelihood reward#chain-of-thought (CoT)#reinforcement learning for LLMs

AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

Beginner

Jianhao Ruan, Zhihao Xu et al.Feb 3arXiv

AOrchestra is like a smart conductor that builds the right mini-helpers (sub-agents) on demand to solve big, multi-step tasks.

#agent orchestration#sub-agent-as-tools#four-tuple abstraction

CL-bench: A Benchmark for Context Learning

Beginner

Shihan Dou, Ming Zhang et al.Feb 3arXiv

CL-bench is a new test that checks whether AI can truly learn new things from the information you give it right now, not just from what it memorized before.

#context learning#benchmark#rubric-based evaluation

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

Beginner

Yinjie Wang, Tianbao Xie et al.Feb 2arXiv

RLAnything is a new reinforcement learning (RL) framework that trains three things together at once: the policy (the agent), the reward model (the judge), and the environment (the tasks).

#reinforcement learning#closed-loop optimization#reward modeling

WideSeek: Advancing Wide Research via Multi-Agent Scaling

Beginner

Ziyang Huang, Haolin Ren et al.Feb 2arXiv

The paper tackles a new kind of search called Wide Research, where an AI must gather lots of related facts under complex rules and put them into a clean table.

#Wide Research#General Broad Information Seeking#Knowledge Graph

Kimi K2.5: Visual Agentic Intelligence

Beginner

Kimi Team, Tongtong Bai et al.Feb 2arXiv

Kimi K2.5 is a new open-source AI that can read both text and visuals (images and videos) and act like a team of helpers to finish big tasks faster.

#multimodal learning#vision-language models#joint optimization

WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora

Beginner

Pengyu Wang, Benfeng Xu et al.Feb 2arXiv

WildGraphBench is a new test that checks how well GraphRAG systems find and combine facts from messy, real-world web pages.

#GraphRAG#Retrieval-Augmented Generation#Wikipedia references

Wiki Live Challenge: Challenging Deep Research Agents with Expert-Level Wikipedia Articles

Beginner

Shaohan Wang, Benfeng Xu et al.Feb 2arXiv

This paper builds a live challenge that tests how well Deep Research Agents (DRAs) can write expert-level Wikipedia-style articles.

#Deep Research Agents#Wikipedia Good Articles#Benchmark

Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars

Beginner

Youliang Zhang, Zhengguang Zhou et al.Feb 2arXiv

This paper teaches talking avatars not just to speak, but to look around their scene and handle nearby objects exactly as a text instruction says.

#grounded human-object interaction#talking avatars#diffusion transformer

PolySAE: Modeling Feature Interactions in Sparse Autoencoders via Polynomial Decoding

Beginner

Panagiotis Koromilas, Andreas D. Demou et al.Feb 1arXiv

PolySAE is a new kind of sparse autoencoder that keeps a simple, linear way to find features but uses a smarter decoder that can multiply features together.

#Sparse Autoencoder#Polynomial Decoder#Feature Interactions

PaperBanana: Automating Academic Illustration for AI Scientists

Beginner

Dawei Zhu, Rui Meng et al.Jan 30arXiv

PaperBanana is a team of AI helpers that turns a paper’s method text and caption into a clean, accurate, publication-ready figure.

#academic illustration#methodology diagrams#visual language models

1 2 3 4 5