Papers1262

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

Shiting Huang, Zecheng Li et al.Feb 10arXiv

The paper teaches large language models to do what good students do: find where they went wrong, turn that lesson into a rule, and remember it for next time.

#Reinforcement Learning with Verifiable Rewards#RLVR#Meta-Experience Learning

Not triaged yet

When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models

Beginner

Jiacheng Hou, Yining Sun et al.Feb 10arXiv

Modern image editors can now follow visual prompts like arrows and scribbles, which opens a new way for attackers to hide harmful instructions inside images.

#vision-centric jailbreak#image editing safety#visual prompts

Not triaged yet

QP-OneModel: A Unified Generative LLM for Multi-Task Query Understanding in Xiaohongshu Search

Intermediate

Jianzhao Huang, Xiaorui Huang et al.Feb 10arXiv

Search engines on social apps used to rely on many separate mini-models that often misunderstood slang and were hard to keep updated.

#Query Processing#Unified Generative Model#Named Entity Recognition

Not triaged yet

The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies

Intermediate

Chenxu Wang, Chaozhuo Li et al.Feb 10arXiv

The paper shows a three-way no-win situation: an AI society cannot be closed off, keep learning forever, and stay perfectly safe for humans all at the same time.

#self-evolving AI#multi-agent systems#AI safety

Not triaged yet

Stroke3D: Lifting 2D strokes into rigged 3D model via latent diffusion models

Intermediate

Ruisi Zhao, Haoren Zheng et al.Feb 10arXiv

Stroke3D lets you draw simple 2D stick-figure strokes plus a short text, and it builds a ready-to-animate 3D model with a skeleton and textures.

#Stroke3D#rigged 3D generation#skeleton-first pipeline

Not triaged yet

EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies

Intermediate

Xavier Hu, Jinxiang Xia et al.Feb 10arXiv

EcoGym is a new open test playground where AI agents run small businesses over many days to see if they can plan well for the long term.

#EcoGym#long-horizon planning#LLM agents

Not triaged yet

Effective Reasoning Chains Reduce Intrinsic Dimensionality

Beginner

Archiki Prasad, Mandar Joshi et al.Feb 9arXiv

The paper asks a simple question: which kind of step-by-step reasoning helps small language models learn best, and why?

#intrinsic dimensionality#chain-of-thought#LoRA

Not triaged yet

SceneSmith: Agentic Generation of Simulation-Ready Indoor Scenes

Beginner

Nicholas Pfaff, Thomas Cohn et al.Feb 9arXiv

SceneSmith is a smart team of AI helpers that turns a short text like 'a cozy study with books and a desk' into a full 3D home scene you can drop right into a robot simulator.

#agentic scene synthesis#text-to-3D generation#indoor scene generation

Not triaged yet

WorldCompass: Reinforcement Learning for Long-Horizon World Models

Beginner

Zehan Wang, Tengfei Wang et al.Feb 9arXiv

WorldCompass teaches video world models to follow actions better and keep pictures pretty by using reinforcement learning after pretraining.

#world models#reinforcement learning#clip-level rollout

Not triaged yet

Contact-Anchored Policies: Contact Conditioning Creates Strong Robot Utility Models

Beginner

Zichen Jeff Cui, Omar Rayyan et al.Feb 9arXiv

Robots often get confused by wordy instructions, so this paper tells them exactly where to touch instead of what to do in sentences.

#contact-anchored policies#robot utility models#contact anchor

Not triaged yet

ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation

Intermediate

Zihan Yang, Shuyuan Tu et al.Feb 9arXiv

ArcFlow is a new way to make text-to-image models draw great pictures in only 2 steps instead of 50, giving about a 40× speed boost.

#ArcFlow#few-step distillation#non-linear flow

Not triaged yet

GEBench: Benchmarking Image Generation Models as GUI Environments

Intermediate

Haodong Li, Jingwei Wu et al.Feb 9arXiv

This paper introduces GEBench, a new test to check if image generation models can act like real app screens that change when you click or type.

#GEBench#GE-Score#GUI generation

Not triaged yet

22 23 24 25 26