The model underpinning Operator is a Computer-Using Agent (CUA) that combines GPT-4o's vision mode to "see" what's on the user's screen through screenshots with graphical user interfaces (GUIs) that ...
"This capability marks the next step in AI development, allowing models to use the same tools humans rely on daily and opening the door to a vast range of new applications," the company said in a blog ...
The Nvidia and Jeff Bezos-backed company said the tool, known as Perplexity Assistant, can book dinner reservations, hail rides on apps and set reminders, among other actions. "We'd love to make it ...