Papers1262

Arcee Trinity Large Technical Report

Varun Singh, Lucas Krauss et al.Feb 19arXiv

Trinity is a family of open language models that are huge on the inside but only wake up a few 'experts' for each word, so they are fast and affordable to run.

#Mixture-of-Experts#SMEBU#Gated Attention

Not triaged yet

Conv-FinRe: A Conversational and Longitudinal Benchmark for Utility-Grounded Financial Recommendation

Intermediate

Yan Wang, Yi Han et al.Feb 19arXiv

This paper builds Conv-FinRe, a new test that checks if AI financial advisors give advice that fits a person’s true goals, not just what they clicked before.

#financial recommendation#utility-based evaluation#conversational benchmark

Not triaged yet

DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers

Intermediate

Dahye Kim, Deepti Ghadiyaram et al.Feb 19arXiv

This paper speeds up image and video generators called diffusion transformers by changing how big their puzzle pieces (patches) are at each step.

#Diffusion Transformer#Dynamic Tokenization#Patch Scheduling

Not triaged yet

Discovering Multiagent Learning Algorithms with Large Language Models

Intermediate

Zun Li, John Schultz et al.Feb 18arXiv

The paper shows how a code-writing AI (a large language model) can invent brand‑new multi‑agent learning algorithms instead of humans having to hand‑design them.

#Multi-Agent Reinforcement Learning#Counterfactual Regret Minimization#Policy Space Response Oracles

Not triaged yet

DODO: Discrete OCR Diffusion Models

Beginner

Sean Man, Roy Ganz et al.Feb 18arXiv

OCR is like reading a page exactly as it is, and that strictness makes it perfect for fast, parallel generation.

#OCR#vision-language models#discrete diffusion

Not triaged yet

SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation

Intermediate

Kushal Kedia, Tyler Ga Wei Lum et al.Feb 18arXiv

SimToolReal teaches a robot hand to use many different tools by practicing in simulation and then working in the real world without extra training.

#dexterous manipulation#sim-to-real reinforcement learning#goal-conditioned policy

Not triaged yet

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

Intermediate

Jianliang He, Leda Wang et al.Feb 18arXiv

This paper explains, in detail, how a simple two-layer neural network learns to add numbers on a clock (modular addition) by building and combining wave-like patterns called Fourier features.

#modular addition#Fourier features#discrete Fourier transform

Not triaged yet

Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation

Intermediate

Runpei Dong, Ziyan Li et al.Feb 18arXiv

This paper teaches a humanoid robot to find and pick up many different objects in new places using plain-language requests like 'grab the orange mug.'

#humanoid loco-manipulation#end-effector tracking#open-vocabulary perception

Not triaged yet