This paper shows that code-writing AI agents can take an existing math problem and automatically turn it into a new, harder one while keeping it solvable.
Text-to-image models can make pretty pictures but still miss details in complex prompts, like counts, positions, or exact text.
Multi-agent systems are like teams of smart helpers, but one bad message can mislead the whole team.
This paper put real AI agents into a safe, live playground and asked expert testers to mess with them to see what breaks.
This paper studies Moltbook, a giant social network made only of AI agents, to see if they start acting like a real society over time.
The paper shows a three-way no-win situation: an AI society cannot be closed off, keep learning forever, and stay perfectly safe for humans all at the same time.
This paper builds SocialVeil, a testing world where AI chat agents must talk to each other even when communication is messy, not perfect.
This paper builds an AI team that can make real full‑stack websites (frontend, backend, and database) from plain English instructions.
LatentMem is a new memory system that helps teams of AI agents remember the right things for their specific jobs without overloading them with text.
This paper builds a smart team of AI helpers, called MEnvAgent, that automatically sets up the right computer environments for code projects in many languages.
This paper builds an AI agent that learns new skills while working, like a kid who learns new tricks during recess without a teacher telling them what to do.
This paper turns rebuttal writing from ‘just write some text’ into ‘make a plan with proof, then write.’