Concepts2

Groups

Mixture of Experts (MoE)

A Mixture of Experts (MoE) routes each input to a small subset of specialized models called experts, enabling conditional computation.

#mixture of experts#moe#gating network+12

📚TheoryAdvanced

Policy Gradient Theorem

The policy gradient theorem tells us how to push a stochastic policy’s parameters to increase expected return by following the gradient of expected rewards.

#policy gradient