Papers196

All Beginner Intermediate Advanced

All Sources arXiv

COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence

Beginner

Zefeng Zhang, Xiangzhao Hao et al.Dec 4arXiv

COOPER is a single AI model that both “looks better” (perceives depth and object boundaries) and “thinks smarter” (reasons step by step) to answer spatial questions about images.

#COOPER#multimodal large language model#unified model

VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory

Beginner

Yifei Yu, Xiaoshan Wu et al.Dec 4arXiv

VideoSSM is a new way to make long, stable, and lively videos by giving the model two kinds of memory: a short-term window and a long-term state-space memory.

#autoregressive video diffusion#state-space model#hybrid memory

Fast-Decoding Diffusion Language Models via Progress-Aware Confidence Schedules

Beginner

Amr Mohamed, Yang Zhang et al.Dec 2arXiv

Diffusion language models (dLLMs) can write all parts of an answer in parallel, but they usually take many tiny cleanup steps, which makes them slow.

#diffusion language models#early exit decoding#progress-aware threshold

Recurrent Neural Networks (RNNs): A gentle Introduction and Overview

Beginner

Robin M. SchmidtNov 23arXiv

Recurrent Neural Networks (RNNs) are special neural networks that learn from sequences, like sentences or time series, by remembering what came before.

#Recurrent Neural Network#Backpropagation Through Time#Truncated BPTT

13 14 15 16 17