Papers1262

iGRPO: Self-Feedback-Driven LLM Reasoning

Ali Hatamizadeh, Shrimai Prabhumoye et al.Feb 9arXiv

This paper teaches a language model to improve its own math answers by first writing several drafts and then learning to beat its best draft.

#iGRPO#GRPO#Reinforcement Learning

Not triaged yet

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Intermediate

Shiyang Feng, Runmin Ma et al.Feb 9arXiv

InternAgent-1.5 is a single AI system that can read papers, plan experiments, run code or lab steps, check results, and keep improving over time.

#AI for Science#Autonomous Scientific Discovery#Agentic AI

Not triaged yet

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Intermediate

SII-OpenMOSS Team, Donghua Yu et al.Feb 9arXiv

MOVA is an open-source AI that makes videos and sounds at the same time so mouths, actions, and noises match perfectly.

#video-audio generation#lip synchronization#dual-tower architecture

Not triaged yet

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

Intermediate

Linli Yao, Yuancheng Wei et al.Feb 9arXiv

This paper teaches AI to write movie-like scripts for videos by adding exact timestamps and rich details about what you see and hear.

#Omni Dense Captioning#time-aware video captioning#audio-visual understanding

Not triaged yet

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Beginner

Tiwei Bie, Maosong Cao et al.Feb 9arXiv

LLaDA2.1 teaches a diffusion-style language model to write fast rough drafts and then fix its own mistakes by editing tokens it already wrote.

#discrete diffusion language model#editable decoding#token-to-token editing

Not triaged yet

GISA: A Benchmark for General Information-Seeking Assistant

Intermediate

Yutao Zhu, Xingshuo Zhang et al.Feb 9arXiv

GISA is a new test (benchmark) that checks how well AI assistants can search the web like real people do.

#GISA#information-seeking agents#web search benchmark

Not triaged yet

Beyond Correctness: Learning Robust Reasoning via Transfer

Intermediate

Hyunseok Lee, Soheil Abbasloo et al.Feb 9arXiv

This paper teaches language models not just to get the final answer right but to think in a way others can reliably follow.

#Reinforcement Learning with Transferable Reward#RLTR#Reasoning Transferability

Not triaged yet

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Beginner

Yuhao Dong, Shulin Tian et al.Feb 9arXiv

This paper teaches AI to learn how-to steps from demonstrations in the moment, the way people do.

#video in-context learning#procedural video understanding#multimodal large language models

Not triaged yet

NarraScore: Bridging Visual Narrative and Musical Dynamics via Hierarchical Affective Control

Beginner

Yufan Wen, Zhaocheng Liu et al.Feb 9arXiv

NarraScore turns a video's changing story into a matching soundtrack by using emotion as the bridge.

#video-to-music generation#affective computing#valence-arousal

Not triaged yet

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Intermediate

Zixuan Huang, Xin Xia et al.Feb 9arXiv

Big AI reasoning models often keep thinking long after they already found the right answer, wasting time and tokens.

#SAGE#efficient reasoning#chain of thought

Not triaged yet

PISCO: Precise Video Instance Insertion with Sparse Control

Beginner

Xiangbo Gao, Renjie Li et al.Feb 9arXiv

PISCO is a video AI that lets you place a specific object into a real video exactly where and when you want, using just a few keyframes instead of editing every frame.

#video instance insertion#sparse keyframe control#video diffusion

Not triaged yet

G-LNS: Generative Large Neighborhood Search for LLM-Based Automatic Heuristic Design

Intermediate

Baoyun Zhao, He Wang et al.Feb 9arXiv

This paper teaches an AI to invent its own 'break-and-fix' strategies (called LNS operators) for tough puzzles like delivery routes and city tours.

#Generative LNS#Automated Heuristic Design#Large Language Models

Not triaged yet

23 24 25 26 27