Sunday, October 12, 2025

AI: Gemini 2.5 Computer Use model from Google

 Introducing the Gemini 2.5 Computer Use model

Google is releasing the Gemini 2.5 Computer Use model via the Gemini API, enabling developers to build agents that can interact with user interfaces. This model outperforms others in web and mobile control benchmarks with lower latency. You can access it now on Google AI Studio and Vertex AI to start building and share feedback in the Developer Forum.

The model’s core capabilities are exposed through the new `computer_use` tool in the Gemini API and should be operated within a loop. Inputs to the tool are the user request, screenshot of the environment, and a history of recent actions. The input can also specify whether to exclude functions from the full list of supported UI actions or specify additional custom functions to include.



No comments: