Gemini 3.1 Flash-Lite
Google's most cost-efficient Gemini 3 model with 1M context, multimodal input, and 2.5x faster time-to-first-token than Gemini 2.5 Flash
Gemini 3.1 Flash-Lite
Google • March 2026
Training Data
Up to early 2025
Gemini 3.1 Flash-Lite
March 2026
Parameters
Not disclosed
Training Method
Multimodal pre-training with RLHF
Context Window
1,000,000 tokens
Knowledge Cutoff
January 2025
Key Features
1M Context Window • Multimodal Input • Extended Thinking
Capabilities
Speed: Outstanding
Cost Efficiency: Outstanding
Multimodal: Good
What's New in This Version
2.5x faster time-to-first-token and 45% faster output generation than Gemini 2.5 Flash at the lowest cost in the Gemini 3 family
Google's most cost-efficient Gemini 3 model with 1M context, multimodal input, and 2.5x faster time-to-first-token than Gemini 2.5 Flash
What's New in This Version
2.5x faster time-to-first-token and 45% faster output generation than Gemini 2.5 Flash at the lowest cost in the Gemini 3 family
Technical Specifications
Key Features
Capabilities
Other Google Models
Explore more models from Google
Gemini 3.1 Pro
Google's newest flagship with doubled reasoning performance and 1M token context for agentic workflows
Gemini 3 Pro
Google's latest flagship model with advanced multimodal capabilities and PhD-level reasoning
Gemini 3 Deep Think
Google's enhanced reasoning mode with extended thinking capabilities for complex problems