🎓How I Study AIHISA
📖Read
📄Papers📰Blogs🎬Courses
💡Learn
🛤️Paths📚Topics💡Concepts🎴Shorts
🎯Practice
🧩Problems🎯Prompts🧠Review
Search
How I Study AI - Learn AI Papers & Lectures the Easy Way

Papers2

AllBeginnerIntermediateAdvanced
All SourcesarXiv
#Preference Optimization

YaPO: Learnable Sparse Activation Steering Vectors for Domain Adaptation

Intermediate
Abdelaziz Bounhar, Rania Hossam Elmohamady Elbadry et al.Jan 13arXiv

This paper introduces YaPO, a way to gently nudge a language model’s hidden thoughts so it behaves better without retraining it.

#Activation Steering#Sparse Autoencoder#Preference Optimization

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

Intermediate
Shidong Cao, Hongzhan Lin et al.Jan 7arXiv

DiffCoT treats a model’s step-by-step thinking (Chain-of-Thought) like a messy draft that can be cleaned up over time, not something fixed forever.

#Chain-of-Thought#Diffusion models#Autoregressive decoding