Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 1: Overview and Tokenization
BeginnerThis session introduces a brand-new course on building language models from scratch. You learn what language modeling is, where itβs used (speech recognition, translation, text generation, classification), and how different modeling families work. The class emphasizes implementing models yourself in Python and PyTorch, plus how to train and evaluate them.
#language modeling#tokenization#n-gram