Groups
Transformer expressiveness studies what kinds of sequence-to-sequence mappings a Transformer can represent or approximate.
Neural network expressivity studies what kinds of functions different network architectures can represent and how efficiently they can do so.