Introducing computer use in Gemini 3.5 Flash

Introducing computer use in Gemini 3.5 Flash
Computer use is now a built-in tool in Gemini 3.5 Flash to build agents that can interact across platforms.
Computer use is now a built-in tool supported in Gemini 3.5 Flash, delivering our best performance yet for agentic computer use tasks. Previously only available as a standalone Gemini 2.5 computer use model, computer use is now integrated natively in the main Gemini Flash model. Gemini already excels at function calling and using built-in tools like Search and Maps grounding. With built-in computer use capability, developers can now use 3.5 Flash to reliably build custom agents that can see, reason and take action across browser, mobile and desktop environments. This unlocks improved performance for long-horizon and enterprise automation tasks like continuous software testing and knowledge work across professional applications.

Developers and enterprises can start using computer use in 3.5 Flash via the Gemini API and Gemini Enterprise Agent Platform.
Click Here
Open
3.5 Flash uses computer use to analyse the Gemini app and return a categorized list of features.
Making computer use safe in 3.5 Flash
To mitigate some of the prompt injection risks for agents operating in live environments, we use targeted adversarial training for computer use in Gemini 3.5 Flash. We’re also releasing two optional enterprise safeguard systems that enable enterprises to:
- Require explicit user confirmation for sensitive or irreversible actions.
- Automatically stop tasks if an indirect prompt injection is identified.
Taking a “defense-in-depth” approach, we encourage developers to combine these features with secure sandboxing, human-in-the-loop verification and strict access controls. Additional information on safety measures can be found in our best practices documentation.
We are already seeing customers drive value with computer use. Here’s what some of them have to say:



To start building with computer use today:
- Try it now: Test the capabilities in a demo environment hosted by Browserbase.
- Start building: Dive into our reference implementation and documentation via Gemini API and Gemini Enterprise Agent Platform.

Key Capabilities and Features
- Cross-Platform Execution: Custom agents can interact seamlessly across browser, mobile, and desktop environments (
ENVIRONMENT_BROWSER,ENVIRONMENT_MOBILE,ENVIRONMENT_DESKTOP). [5, 6] - Intent-Based Interaction: The model takes screenshots of the active environment, identifies interactive UI components (buttons, text boxes, sliders), and executes actions like clicking, typing, or scrolling. [2, 4]
- Adjustable Thinking Levels: Developers can fine-tune the depth of the model’s reasoning loops to optimally balance processing quality, cost constraints, and execution latency. [2, 5]
- Human-in-the-Loop (HITL): Supports highly customizable client-side functions, making it simple to hand over control to a human supervisor for manual intervention when necessary. [7]
Speed and Cost Benchmarks from Introducing computer
- Benchmark Performance: Gemini 3.5 Flash scores 78.4 on the OSWorld-Verified leaderboard, placing it virtually neck-and-neck with premium frontier models like GPT-5.5 (78.7). [2]
- Speed Superiority: Processing at roughly 289 tokens per second, it operates nearly four times faster than competing frontier models. [4]
- Aggressive Pricing: Priced at $1.50 per million input tokens and $9 per million output tokens, it is less than half the operating cost of comparable industry alternatives. [4]
Integrated Safety and Enterprise Guardrails
- Explicit Confirmation Prompts: Forces the agent to halt and request human validation before executing irreversible operations, such as completing financial purchases or deleting records. [2, 9]
- Indirect Prompt Injection Filters: An automated monitoring layer that instantly terminates active workflows if it detects hidden malicious instructions embedded within web pages or document layouts. [2, 4]
How to Get Started
- What specific operating system or browser environment are you targeting?
- What is the primary automation use case (e.g., continuous QA testing, form filling, data scraping)?
Introducing computer
Read more
. Stricter classifications for Google Groups to enhance data security and privacy
. Preserving cultural heritage: Inside Google DeepMind’s collaboration with Pelé
. How to use Agent in Google Flow | Find Your Flow
. What’s trending on Google this summer
. NVIDIA cuPhoton: The Tech Fast-Tracking Space Discovery
. Unitree R1 | Price from $4,900, Ready Stock
. Chrome is bringing advanced autofill capabilities to your phone with Google Wallet
. Towards a world where no one is surprised by a natural disaster
. Gemini 3.5 Live Translate, Gemini in Xcode, and more! – Google Developer News June 2026
. Google Wallet makes TSA PreCheck Touchless ID available for more travelers
for more refer Gemini website click here
for more refer Artificial Intelligence website click here

