Papers2

#RoPE positional embeddings

ArXiv-to-Model: A Practical Study of Scientific LM Training

This paper shows, step by step, how to train a 1.36-billion-parameter science-focused language model directly from raw arXiv LaTeX files using only 2 A100 GPUs.

#scientific language model#arXiv LaTeX#tokenization

LTX-2: Efficient Joint Audio-Visual Foundation Model

Intermediate

Yoav HaCohen, Benny Brazowski et al.Jan 6arXiv

LTX-2 is an open-source model that makes video and sound together from a text prompt, so the picture and audio match in time and meaning.

#text-to-video#text-to-audio#audiovisual generation