Academic rebuttals are not just about being polite; they are about smart, strategic persuasion under hidden information.
This survey explains how to make AI agents not just smart, but also efficient with their time, memory, and tool use.
The paper introduces M^4olGen, a two-stage system that designs new molecules to match exact numbers for several properties (like QED, LogP, MW, HOMO, LUMO) at the same time.
This paper introduces MATTRL, a way for multiple AI agents to learn from their own conversations at test time using short, reusable text notes instead of retraining their weights.
This survey asks how close AI memory systems are to human memory and organizes the answer into three parts: implicit memory (inside the model), explicit memory (outside storage you can look up), and agentic memory (what an AI agent keeps over time to plan and act).
The paper shows that when we give AI lots of extra text, even harmless extra text, it can get badly confused—sometimes losing up to 80% of its accuracy.
MemGovern teaches code agents to learn from past human fixes on GitHub by turning messy discussions into clean, reusable 'experience cards.'
LLMs can look confident but still change their answers when the surrounding text nudges them, showing that confidence alone isn’t real truthfulness.
Long-term AI helpers remember past chats, but using all memories can trap them in old ideas (Memory Anchoring).
KnowMe-Bench is a new test that checks if AI helpers truly understand a person, not just remember facts.
Real people often ask vague questions with pictures, and today’s vision-language models (VLMs) struggle with them.
COMPASS is a new framework that turns a company’s rules into thousands of smart test questions to check if chatbots follow those rules.