Papers14

#instruction tuning

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Hongyuan Tao, Bencheng Liao et al.Dec 9arXiv

InfiniteVL is a vision-language model that mixes two ideas: local focus with Sliding Window Attention and long-term memory with a linear module called Gated DeltaNet.

#InfiniteVL#linear attention#Gated DeltaNet

Not triaged yet

Fast-Decoding Diffusion Language Models via Progress-Aware Confidence Schedules

Beginner

Amr Mohamed, Yang Zhang et al.Dec 2arXiv

Diffusion language models (dLLMs) can write all parts of an answer in parallel, but they usually take many tiny cleanup steps, which makes them slow.

#diffusion language models#early exit decoding#progress-aware threshold

Not triaged yet

1 2