Qwen3 8B Check detailed information and pricing for AI models

Context Length 128,000 tokens, qwen from provided

128,000
Context Tokens
$0.04
Prompt Price
$0.14
Output Price
8/16
Feature Support
Reasoning #15

Model Overview

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.

Basic Information

Developer
qwen
Model Series
Qwen3
Release Date
2025-04-28
Context Length
128,000 tokens
Max Completion Tokens
20,000 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.04 / 1M tokens
Completion Tokens
$0.14 / 1M tokens

Supported Features

Supported (8)

Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Min P
Logit Bias
Reasoning

Unsupported (8)

Image Input
Response Format
Tool Usage
Logprobs
Top Logprobs
Structured Outputs
Web Search Options
Top A

Other Variants

Actual Usage Statistics

#137
Out of 345 total models
1.4B
Total Tokens Last 30 Days
48.10M
Daily Average Usage
33%
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (qwen)

Qwen3 0.6B
32,000 tokens
$0.00 / $0.00
Qwen3 1.7B
32,000 tokens
$0.00 / $0.00
Qwen3 4B
128,000 tokens
$0.00 / $0.00
Qwen3 30B A3B (free)
40,960 tokens
Free
Qwen3 30B A3B
40,960 tokens
$0.08 / $0.29

Similar Price Range Models

Nova Micro 1.0
amazon
128,000 tokens
$0.04 / $0.14
Command R7B (12-2024)
cohere
128,000 tokens
$0.04 / $0.15
Gemini 1.5 Flash 8B
google
1,000,000 tokens
$0.04 / $0.15
Mistral Small 3.1 24B
mistralai
131,072 tokens
$0.05 / $0.15