ObjEmbed: Towards Universal Multimodal Object Embeddings
IntermediateShenghao Fu, Yukun Su et al.Feb 2arXiv
ObjEmbed teaches an AI to understand not just whole pictures, but each object inside them, and to link those objects to the right words.
#object embeddings#IoU embedding#visual grounding