Synthetic Visual Genome 2: Extracting Large-scale Spatio-Temporal Scene Graphs from Videos
BeginnerZiqi Gao, Jieyu Zhang et al.Feb 26arXiv
This paper builds a giant, automatically made video library called SVG2 that tells who is in a video, what they look like, and how they interact over time.
#video scene graph#spatio-temporal reasoning#panoptic segmentation