Papers200

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

LLaDA2.1 teaches a diffusion-style language model to write fast rough drafts and then fix its own mistakes by editing tokens it already wrote.

#discrete diffusion language model#editable decoding#token-to-token editing

Not triaged yet

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Beginner

Yuhao Dong, Shulin Tian et al.Feb 9arXiv

This paper teaches AI to learn how-to steps from demonstrations in the moment, the way people do.

#video in-context learning#procedural video understanding#multimodal large language models

Not triaged yet

NarraScore: Bridging Visual Narrative and Musical Dynamics via Hierarchical Affective Control

Beginner

Yufan Wen, Zhaocheng Liu et al.Feb 9arXiv

NarraScore turns a video's changing story into a matching soundtrack by using emotion as the bridge.

#video-to-music generation#affective computing#valence-arousal

Not triaged yet

PISCO: Precise Video Instance Insertion with Sparse Control

Beginner

Xiangbo Gao, Renjie Li et al.Feb 9arXiv

PISCO is a video AI that lets you place a specific object into a real video exactly where and when you want, using just a few keyframes instead of editing every frame.

#video instance insertion#sparse keyframe control#video diffusion

Not triaged yet

AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions

Beginner

Xianyang Liu, Shangding Gu et al.Feb 5arXiv

AgenticPay is a safe playground where AI agents practice buying and selling by talking, not just by typing numbers.

#multi-agent negotiation#language-mediated bargaining#LLM agents

Not triaged yet

CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs

Beginner

Haoran Li, Sucheng Ren et al.Feb 5arXiv

The paper introduces CoPE, a simple change to how models track word positions that makes long documents much easier for them to understand.

#CoPE#RoPE#Rotary Positional Embedding

Not triaged yet

Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning

Beginner

Yu-Ang Lee, Ching-Yun Ko et al.Feb 4arXiv

When you tune the learning rate carefully, plain old LoRA fine-tuning works about as well as fancy new versions.

#LoRA#parameter-efficient fine-tuning#learning rate tuning

Not triaged yet

Vibe AIGC: A New Paradigm for Content Generation via Agentic Orchestration

Beginner

Jiaheng Liu, Yuanxing Zhang et al.Feb 4arXiv

This paper says today's content AIs are great at pretty pictures and videos but often miss what people actually want, creating a big Intent-Execution Gap.

#Vibe AIGC#Agentic Orchestration#Meta Planner

Not triaged yet

Likelihood-Based Reward Designs for General LLM Reasoning

Beginner

Ariel Kwiatkowski, Natasha Butt et al.Feb 3arXiv

Binary right/wrong rewards for training reasoning in large language models are hard to design and often too sparse to learn from.

#log-likelihood reward#chain-of-thought (CoT)#reinforcement learning for LLMs

Not triaged yet

AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

Beginner

Jianhao Ruan, Zhihao Xu et al.Feb 3arXiv

AOrchestra is like a smart conductor that builds the right mini-helpers (sub-agents) on demand to solve big, multi-step tasks.

#agent orchestration#sub-agent-as-tools#four-tuple abstraction

Not triaged yet

CL-bench: A Benchmark for Context Learning

Beginner

Shihan Dou, Ming Zhang et al.Feb 3arXiv

CL-bench is a new test that checks whether AI can truly learn new things from the information you give it right now, not just from what it memorized before.

#context learning#benchmark#rubric-based evaluation

Not triaged yet

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

Beginner

Yinjie Wang, Tianbao Xie et al.Feb 2arXiv

RLAnything is a new reinforcement learning (RL) framework that trains three things together at once: the policy (the agent), the reward model (the judge), and the environment (the tasks).

#reinforcement learning#closed-loop optimization#reward modeling

Not triaged yet

4 5 6 7 8