AdaTooler-V: Adaptive Tool-Use for Images and Videos
IntermediateChaoyang Wang, Kaituo Feng et al.Dec 18arXiv
AdaTooler-V teaches an image-and-video AI to first ask, โDo I really need a tool?โ before using one, which saves time and boosts accuracy.
#adaptive tool-use#multimodal chain-of-thought#visual tool interactions