Concepts4

Groups

Wake-Sleep Algorithm

The Wake–Sleep algorithm trains a pair of models: a generative model that explains how data are produced and a recognition model that guesses hidden causes from observed data.

#wake-sleep#helmholtz machine#generative model+12

⚙️AlgorithmIntermediate

PPO & Trust Region Methods

Proximal Policy Optimization (PPO) stabilizes policy gradient learning by preventing each update from moving the policy too far from the previous one.

#ppo

Concepts4

Wake-Sleep Algorithm

PPO & Trust Region Methods

t-SNE & UMAP

Natural Gradient Method