Groups
Category
Layer Normalization rescales and recenters each sample across its feature dimensions, making it independent of batch size.
Expectation is the long-run average value of a random variable and acts like the balance point of its distribution.