Gemini 3 Flash is a fast and cost-effective AI model designed for speed and efficiency, while still retaining strong reasoning capabilities. Developed by Google DeepMind, it is part of the Gemini 3 model family. It is designed to be accessible across Google products.
Gemini 3 Flash is designed for tasks that require quick processing, such as coding, complex analysis, and rapid responses in interactive applications. It is suitable for agentic workflows and excels at applied reasoning. Its key features include a "speed-first" architecture, advanced distillation techniques, and optimization for rapid token generation, reducing latency. It also allows for multimodal function responses, including images and PDFs. It is capable of handling a large number of function calls reliably and transforming unstructured data into organized databases.
Gemini 3 Flash is available in preview through the Gemini API in Google AI Studio, Google Antigravity, Vertex AI, and Gemini Enterprise. It can also be accessed through developer tools like Gemini CLI and Android Studio. It is also rolling out to the Gemini app and AI Mode in Search. As of January 2026, Gemini 3 Flash has full production availability and serves as the default model in the Gemini app. The pricing for Gemini 3 Flash is $0.50 per 1 million input tokens and $3.00 per 1 million output tokens, with audio input priced at $1.00 per 1 million tokens.