OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution
IntermediateLe Zhang, Yixiong Xiao et al.Jan 28arXiv
OmegaUse is a new AI that can use phones and computers by looking at screenshots and deciding where to click, type, or scrollโmuch like a careful human user.
#GUI agent#UI grounding#navigation policy