Gemini 2.5 Flash Lite Preview 06-17 Check detailed information and pricing for AI models

Context Length 1,048,576 tokens, google from provided

1,048,576
Context Tokens
$0.10
Prompt Price
$0.40
Output Price
8/16
Feature Support
finance #13
programming #18

Model Overview

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the [Reasoning API parameter](https://openrouter.ai/docs/use-cases/reasoning-tokens) to selectively trade off cost for intelligence.

Basic Information

Developer
google
Model Series
Gemini
Release Date
2025-06-17
Context Length
1,048,576 tokens
Max Completion Tokens
65,535 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.10 / 1M tokens
Completion Tokens
$0.40 / 1M tokens

Supported Features

Supported (8)

Image Input
Seed
Frequency Penalty
Presence Penalty
Response Format
Tool Usage
Structured Outputs
Reasoning

Unsupported (8)

Top K
Repetition Penalty
Min P
Logit Bias
Logprobs
Top Logprobs
Web Search Options
Top A

Actual Usage Statistics

#128
Out of 346 total models
2.0B
Total Tokens Last 30 Days
2.0B
Daily Average Usage
-
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (google)

Gemini 2.5 Flash
1,048,576 tokens
$0.30 / $2.50
Gemini 2.5 Pro
1,048,576 tokens
$1.25 / $10.00
Gemini 2.5 Pro Preview 06-05
1,048,576 tokens
$1.25 / $10.00
Gemma 1 2B
8,192 tokens
$0.00 / $0.00
Gemma 3n 4B (free)
8,192 tokens
Free

Similar Price Range Models

GPT-4.1 Nano
openai
1,047,576 tokens
$0.10 / $0.40
Gemini 2.0 Flash
google
1,048,576 tokens
$0.10 / $0.40
R1 Distill Llama 70B
deepseek
131,072 tokens
$0.10 / $0.40
Qwen2.5 72B Instruct
qwen
32,768 tokens
$0.12 / $0.39
Qwen3 32B
qwen
40,960 tokens
$0.10 / $0.30