🎓How I Study AIHISA
📖Read
📄Papers📰Blogs🎬Courses
💡Learn
🛤️Paths📚Topics💡Concepts🎴Shorts
🎯Practice
📝Daily Log🎯Prompts🧠Review
SearchSettings
How I Study AI - Learn AI Papers & Lectures the Easy Way

Papers1262

AllBeginnerIntermediateAdvanced
All SourcesarXiv

Arcee Trinity Large Technical Report

Intermediate
Varun Singh, Lucas Krauss et al.Feb 19arXiv

Trinity is a family of open language models that are huge on the inside but only wake up a few 'experts' for each word, so they are fast and affordable to run.

#Mixture-of-Experts#SMEBU#Gated Attention

Not triaged yet

Conv-FinRe: A Conversational and Longitudinal Benchmark for Utility-Grounded Financial Recommendation

Intermediate
Yan Wang, Yi Han et al.Feb 19arXiv

This paper builds Conv-FinRe, a new test that checks if AI financial advisors give advice that fits a person’s true goals, not just what they clicked before.

#financial recommendation#utility-based evaluation#conversational benchmark

Not triaged yet

DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers

Intermediate
Dahye Kim, Deepti Ghadiyaram et al.Feb 19arXiv

This paper speeds up image and video generators called diffusion transformers by changing how big their puzzle pieces (patches) are at each step.

#Diffusion Transformer#Dynamic Tokenization#Patch Scheduling

Not triaged yet

Discovering Multiagent Learning Algorithms with Large Language Models

Intermediate
Zun Li, John Schultz et al.Feb 18arXiv

The paper shows how a code-writing AI (a large language model) can invent brand‑new multi‑agent learning algorithms instead of humans having to hand‑design them.

#Multi-Agent Reinforcement Learning#Counterfactual Regret Minimization#Policy Space Response Oracles

Not triaged yet

DODO: Discrete OCR Diffusion Models

Beginner
Sean Man, Roy Ganz et al.Feb 18arXiv

OCR is like reading a page exactly as it is, and that strictness makes it perfect for fast, parallel generation.

#OCR#vision-language models#discrete diffusion

Not triaged yet

SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation

Intermediate
Kushal Kedia, Tyler Ga Wei Lum et al.Feb 18arXiv

SimToolReal teaches a robot hand to use many different tools by practicing in simulation and then working in the real world without extra training.

#dexterous manipulation#sim-to-real reinforcement learning#goal-conditioned policy

Not triaged yet

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

Intermediate
Jianliang He, Leda Wang et al.Feb 18arXiv

This paper explains, in detail, how a simple two-layer neural network learns to add numbers on a clock (modular addition) by building and combining wave-like patterns called Fourier features.

#modular addition#Fourier features#discrete Fourier transform

Not triaged yet

Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation

Intermediate
Runpei Dong, Ziyan Li et al.Feb 18arXiv

This paper teaches a humanoid robot to find and pick up many different objects in new places using plain-language requests like 'grab the orange mug.'

#humanoid loco-manipulation#end-effector tracking#open-vocabulary perception

Not triaged yet

Reinforced Fast Weights with Next-Sequence Prediction

Intermediate
Hee Seung Hwang, Xindi Wu et al.Feb 18arXiv

Fast weight models remember context with a tiny, fixed memory, but standard next-token training teaches them to think only one word ahead.

#fast weight models#next-sequence prediction#reinforcement learning for LMs

Not triaged yet

Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents

Intermediate
Wenxuan Ding, Nicholas Tomlin et al.Feb 18arXiv

This paper teaches AI agents to make smart choices about when to explore for more information and when to act right away.

#Calibrate-Then-Act#cost-aware exploration#LLM agents

Not triaged yet

Learning Situated Awareness in the Real World

Intermediate
Chuhan Li, Ruilin Han et al.Feb 18arXiv

SAW-Bench is a new test that checks if AI can understand the world from a first-person view, like wearing smart glasses.

#situated awareness#egocentric video#observer-centric reasoning

Not triaged yet

Towards a Science of AI Agent Reliability

Intermediate
Stephan Rabanser, Sayash Kapoor et al.Feb 18arXiv

Accuracy alone can make AI agents look good on paper while still failing in real life; this paper shows how to measure reliability properly.

#AI agent reliability#consistency#robustness

Not triaged yet

1415161718