T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation
BeginnerZhe Cao, Tao Wang et al.Dec 24arXiv
T2AV-Compass is a new, unified test to fairly grade AI systems that turn text into matching video and audio.
#Text-to-Audio-Video generation#multimodal evaluation#cross-modal alignment