Gemini 1.5 Flash 8B Check detailed information and pricing for AI models

Context Length 1,000,000 tokens, google from provided

1,000,000
Context Tokens
$0.04
Prompt Price
$0.15
Output Price
7/16
Feature Support
translation #2
technology #16

Model Overview

Gemini Flash 1.5 8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective for real-time and large-scale operations. This model focuses on cost-effective solutions while maintaining high-quality results. [Click here to learn more about this model](https://developers.googleblog.com/en/gemini-15-flash-8b-is-now-generally-available-for-use/). Usage of Gemini is subject to Google's [Gemini Terms of Use](https://ai.google.dev/terms).

Basic Information

Developer
google
Model Series
Gemini
Release Date
2024-10-03
Context Length
1,000,000 tokens
Max Completion Tokens
8,192 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.04 / 1M tokens
Completion Tokens
$0.15 / 1M tokens

Supported Features

Supported (7)

Image Input
Seed
Frequency Penalty
Presence Penalty
Response Format
Tool Usage
Structured Outputs

Unsupported (9)

Top K
Repetition Penalty
Min P
Logit Bias
Logprobs
Top Logprobs
Reasoning
Web Search Options
Top A

Actual Usage Statistics

#17
Out of 345 total models
124.3B
Total Tokens Last 30 Days
4.1B
Daily Average Usage
14%
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (google)

Gemini 2.5 Flash Lite Preview 06-17
1,048,576 tokens
$0.10 / $0.40
Gemini 2.5 Flash
1,048,576 tokens
$0.30 / $2.50
Gemini 2.5 Pro
1,048,576 tokens
$1.25 / $10.00
Gemini 2.5 Pro Preview 06-05
1,048,576 tokens
$1.25 / $10.00
Gemma 1 2B
8,192 tokens
$0.00 / $0.00

Similar Price Range Models

Command R7B (12-2024)
cohere
128,000 tokens
$0.04 / $0.15
Qwen3 8B
qwen
128,000 tokens
$0.04 / $0.14
Nova Micro 1.0
amazon
128,000 tokens
$0.04 / $0.14
Mistral Small 3.1 24B
mistralai
131,072 tokens
$0.05 / $0.15