🎓How I Study AIHISA
📖Read
📄Papers📰Blogs🎬Courses
💡Learn
🛤️Paths📚Topics💡Concepts🎴Shorts
🎯Practice
📝Daily Log🎯Prompts🧠Review
SearchSettings
How I Study AI - Learn AI Papers & Lectures the Easy Way

Papers1055

AllBeginnerIntermediateAdvanced
All SourcesarXiv

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

Intermediate
Shiting Huang, Zecheng Li et al.Feb 10arXiv

The paper teaches large language models to do what good students do: find where they went wrong, turn that lesson into a rule, and remember it for next time.

#Reinforcement Learning with Verifiable Rewards#RLVR#Meta-Experience Learning

QP-OneModel: A Unified Generative LLM for Multi-Task Query Understanding in Xiaohongshu Search

Intermediate
Jianzhao Huang, Xiaorui Huang et al.Feb 10arXiv

Search engines on social apps used to rely on many separate mini-models that often misunderstood slang and were hard to keep updated.

#Query Processing#Unified Generative Model#Named Entity Recognition

The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies

Intermediate
Chenxu Wang, Chaozhuo Li et al.Feb 10arXiv

The paper shows a three-way no-win situation: an AI society cannot be closed off, keep learning forever, and stay perfectly safe for humans all at the same time.

#self-evolving AI#multi-agent systems#AI safety

Stroke3D: Lifting 2D strokes into rigged 3D model via latent diffusion models

Intermediate
Ruisi Zhao, Haoren Zheng et al.Feb 10arXiv

Stroke3D lets you draw simple 2D stick-figure strokes plus a short text, and it builds a ready-to-animate 3D model with a skeleton and textures.

#Stroke3D#rigged 3D generation#skeleton-first pipeline

EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies

Intermediate
Xavier Hu, Jinxiang Xia et al.Feb 10arXiv

EcoGym is a new open test playground where AI agents run small businesses over many days to see if they can plan well for the long term.

#EcoGym#long-horizon planning#LLM agents

ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation

Intermediate
Zihan Yang, Shuyuan Tu et al.Feb 9arXiv

ArcFlow is a new way to make text-to-image models draw great pictures in only 2 steps instead of 50, giving about a 40× speed boost.

#ArcFlow#few-step distillation#non-linear flow

GEBench: Benchmarking Image Generation Models as GUI Environments

Intermediate
Haodong Li, Jingwei Wu et al.Feb 9arXiv

This paper introduces GEBench, a new test to check if image generation models can act like real app screens that change when you click or type.

#GEBench#GE-Score#GUI generation

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Intermediate
Shiyang Feng, Runmin Ma et al.Feb 9arXiv

InternAgent-1.5 is a single AI system that can read papers, plan experiments, run code or lab steps, check results, and keep improving over time.

#AI for Science#Autonomous Scientific Discovery#Agentic AI

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Intermediate
SII-OpenMOSS Team, Donghua Yu et al.Feb 9arXiv

MOVA is an open-source AI that makes videos and sounds at the same time so mouths, actions, and noises match perfectly.

#video-audio generation#lip synchronization#dual-tower architecture

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

Intermediate
Linli Yao, Yuancheng Wei et al.Feb 9arXiv

This paper teaches AI to write movie-like scripts for videos by adding exact timestamps and rich details about what you see and hear.

#Omni Dense Captioning#time-aware video captioning#audio-visual understanding

GISA: A Benchmark for General Information-Seeking Assistant

Intermediate
Yutao Zhu, Xingshuo Zhang et al.Feb 9arXiv

GISA is a new test (benchmark) that checks how well AI assistants can search the web like real people do.

#GISA#information-seeking agents#web search benchmark

Beyond Correctness: Learning Robust Reasoning via Transfer

Intermediate
Hyunseok Lee, Soheil Abbasloo et al.Feb 9arXiv

This paper teaches language models not just to get the final answer right but to think in a way others can reliably follow.

#Reinforcement Learning with Transferable Reward#RLTR#Reasoning Transferability
1718192021