Papers1055

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

Shiting Huang, Zecheng Li et al.Feb 10arXiv

The paper teaches large language models to do what good students do: find where they went wrong, turn that lesson into a rule, and remember it for next time.

#Reinforcement Learning with Verifiable Rewards#RLVR#Meta-Experience Learning

QP-OneModel: A Unified Generative LLM for Multi-Task Query Understanding in Xiaohongshu Search

Intermediate

Jianzhao Huang, Xiaorui Huang et al.Feb 10arXiv

Search engines on social apps used to rely on many separate mini-models that often misunderstood slang and were hard to keep updated.

#Query Processing#Unified Generative Model#Named Entity Recognition

The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies

Intermediate

Chenxu Wang, Chaozhuo Li et al.Feb 10arXiv

The paper shows a three-way no-win situation: an AI society cannot be closed off, keep learning forever, and stay perfectly safe for humans all at the same time.

#self-evolving AI#multi-agent systems#AI safety

Stroke3D: Lifting 2D strokes into rigged 3D model via latent diffusion models

Intermediate

Ruisi Zhao, Haoren Zheng et al.Feb 10arXiv

Stroke3D lets you draw simple 2D stick-figure strokes plus a short text, and it builds a ready-to-animate 3D model with a skeleton and textures.

#Stroke3D#rigged 3D generation#skeleton-first pipeline

EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies

Intermediate

Xavier Hu, Jinxiang Xia et al.Feb 10arXiv

EcoGym is a new open test playground where AI agents run small businesses over many days to see if they can plan well for the long term.

#EcoGym#long-horizon planning#LLM agents

ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation

Intermediate

Zihan Yang, Shuyuan Tu et al.Feb 9arXiv

ArcFlow is a new way to make text-to-image models draw great pictures in only 2 steps instead of 50, giving about a 40× speed boost.

#ArcFlow#few-step distillation#non-linear flow

GEBench: Benchmarking Image Generation Models as GUI Environments

Intermediate

Haodong Li, Jingwei Wu et al.Feb 9arXiv

This paper introduces GEBench, a new test to check if image generation models can act like real app screens that change when you click or type.

#GEBench#GE-Score#GUI generation

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Intermediate

Shiyang Feng, Runmin Ma et al.Feb 9arXiv

InternAgent-1.5 is a single AI system that can read papers, plan experiments, run code or lab steps, check results, and keep improving over time.

#AI for Science#Autonomous Scientific Discovery#Agentic AI

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Intermediate

SII-OpenMOSS Team, Donghua Yu et al.Feb 9arXiv

MOVA is an open-source AI that makes videos and sounds at the same time so mouths, actions, and noises match perfectly.

#video-audio generation#lip synchronization#dual-tower architecture

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

Intermediate

Linli Yao, Yuancheng Wei et al.Feb 9arXiv

This paper teaches AI to write movie-like scripts for videos by adding exact timestamps and rich details about what you see and hear.

#Omni Dense Captioning#time-aware video captioning#audio-visual understanding

GISA: A Benchmark for General Information-Seeking Assistant

Intermediate

Yutao Zhu, Xingshuo Zhang et al.Feb 9arXiv

GISA is a new test (benchmark) that checks how well AI assistants can search the web like real people do.

#GISA#information-seeking agents#web search benchmark

Beyond Correctness: Learning Robust Reasoning via Transfer

Intermediate

Hyunseok Lee, Soheil Abbasloo et al.Feb 9arXiv

This paper teaches language models not just to get the final answer right but to think in a way others can reliably follow.

#Reinforcement Learning with Transferable Reward#RLTR#Reasoning Transferability

17 18 19 20 21