Qwen3 8B Check detailed information and pricing for AI models
Context Length 128,000 tokens, qwen from provided
128,000
Context Tokens
$0.04
Prompt Price
$0.14
Output Price
8/16
Feature Support
Reasoning #15
Model Overview
Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.
Basic Information
Developer
qwen
Model Series
Qwen3
Release Date
2025-04-28
Context Length
128,000 tokens
Max Completion Tokens
20,000 tokens
Variant
standard
Pricing Information
Prompt Tokens
$0.04 / 1M tokens
Completion Tokens
$0.14 / 1M tokens
Data Policy
Supported Features
Supported (8)
Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Min P
Logit Bias
Reasoning
Unsupported (8)
Image Input
Response Format
Tool Usage
Logprobs
Top Logprobs
Structured Outputs
Web Search Options
Top A
Other Variants
Actual Usage Statistics
#137
Out of 345 total models
1.4B
Total Tokens Last 30 Days
48.10M
Daily Average Usage
33%
Weekly Usage Change