Gemini 1.5 Flash 8B Check detailed information and pricing for AI models
Context Length 1,000,000 tokens, google from provided
1,000,000
Context Tokens
$0.04
Prompt Price
$0.15
Output Price
7/16
Feature Support
translation #2
technology #16
Model Overview
Gemini Flash 1.5 8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective for real-time and large-scale operations. This model focuses on cost-effective solutions while maintaining high-quality results. [Click here to learn more about this model](https://developers.googleblog.com/en/gemini-15-flash-8b-is-now-generally-available-for-use/). Usage of Gemini is subject to Google's [Gemini Terms of Use](https://ai.google.dev/terms).
Basic Information
Developer
google
Model Series
Gemini
Release Date
2024-10-03
Context Length
1,000,000 tokens
Max Completion Tokens
8,192 tokens
Variant
standard
Pricing Information
Prompt Tokens
$0.04 / 1M tokens
Completion Tokens
$0.15 / 1M tokens
Data Policy
Supported Features
Supported (7)
Image Input
Seed
Frequency Penalty
Presence Penalty
Response Format
Tool Usage
Structured Outputs
Unsupported (9)
Top K
Repetition Penalty
Min P
Logit Bias
Logprobs
Top Logprobs
Reasoning
Web Search Options
Top A
Actual Usage Statistics
#17
Out of 345 total models
124.3B
Total Tokens Last 30 Days
4.1B
Daily Average Usage
14%
Weekly Usage Change