Action100M is a gigantic video dataset with about 100 million labeled action moments built automatically from 1.2 million instructional videos.
This paper teaches an AI model to understand both which way an object is facing (orientation) and how it turns between views (rotation), all in one system.