Groups
Category
Stochastic Depth randomly drops whole residual layers during training while keeping the full network at inference time.
A sufficient statistic compresses all information in the sample about a parameter into a lower-dimensional summary without losing inferential power.