menu
close

Google Expands Gemini 2.5 Lineup with Cost-Efficient Flash-Lite Model

Google has expanded its Gemini 2.5 family by introducing Flash-Lite, their most cost-efficient and fastest 2.5 model yet, while also making Gemini 2.5 Flash and Pro generally available. The company simultaneously launched Gemini CLI, an open-source AI agent that brings Gemini directly into developers' terminals for coding, problem-solving, and task management. These releases represent Google's ongoing effort to make advanced AI capabilities more accessible and integrated into everyday developer workflows.
Google Expands Gemini 2.5 Lineup with Cost-Efficient Flash-Lite Model

Google has strengthened its AI offerings by expanding the Gemini 2.5 model family with the introduction of Gemini 2.5 Flash-Lite, their most cost-efficient and fastest model in the 2.5 lineup.

Announced on July 2, 2025, Flash-Lite joins the now generally available Gemini 2.5 Flash and Pro models, completing a three-tiered approach to serve different AI application needs. Flash-Lite is specifically designed for high-volume, latency-sensitive tasks like translation and classification, with benchmark tests showing lower latency than previous Flash models.

Despite its optimization for speed and cost, Flash-Lite maintains the core capabilities of the Gemini 2.5 family, including a 1 million-token context window, multimodal input support, and compatibility with tools like Google Search and code execution. Unlike its siblings, Flash-Lite has thinking capabilities turned off by default to maximize efficiency, though users can enable this feature when needed.

Alongside this model expansion, Google introduced Gemini CLI, an open-source AI agent that brings Gemini directly into developers' terminals. Released under the Apache 2.0 license, this tool provides lightweight access to Gemini for coding, content creation, problem-solving, and task management. Developers can access Gemini 2.5 Pro free of charge with a personal Google account, with generous usage limits of 60 model requests per minute and 1,000 requests per day.

The CLI tool supports extensive customization through system prompts and configuration settings, making it adaptable to diverse workflows. It also integrates with Google's AI coding assistant, Gemini Code Assist, providing a unified experience across different development environments.

These releases reflect Google's strategy to make advanced AI capabilities more accessible while providing options tailored to specific performance and cost requirements. The Gemini 2.5 family now offers a complete spectrum from the high-performance Pro model for complex tasks to the cost-efficient Flash-Lite for high-throughput applications.

Source:

Latest News