Qwen3 8B Check detailed information and pricing for AI models

Context Length 128,000 tokens, qwen from provided

128,000
Context Tokens
$0.04
Prompt Price
$0.14
Output Price
8/16
Feature Support
Reasoning #4

Model Overview

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.

Basic Information

Developer
qwen
Model Series
Qwen3
Release Date
2025-04-28
Context Length
128,000 tokens
Max Completion Tokens
20,000 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.04 / 1M tokens
Completion Tokens
$0.14 / 1M tokens

Supported Features

Supported (8)

Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Min P
Logit Bias
Reasoning

Unsupported (8)

Image Input
Response Format
Tool Usage
Logprobs
Top Logprobs
Structured Outputs
Web Search Options
Top A

Other Variants

Actual Usage Statistics

#88
Out of 353 total models
9.2B
Total Tokens Last 30 Days
306.04M
Daily Average Usage
16%
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (qwen)

Qwen3 Next 80B A3B Thinking
262,144 tokens
$0.10 / $0.39
Qwen3 Next 80B A3B Instruct
262,144 tokens
$0.10 / $0.39
Qwen Plus 0728
1,000,000 tokens
$0.40 / $1.20
Qwen Plus 0728 (thinking)
1,000,000 tokens
$0.40 / $4.00
Qwen3 Max
256,000 tokens
$1.20 / $6.00

Similar Price Range Models

Devstral Small 2505
mistralai
131,072 tokens
$0.04 / $0.14
Qwen3 30B A3B
qwen
40,960 tokens
$0.04 / $0.14
GLM Z1 32B
thudm
32,768 tokens
$0.04 / $0.14
Qwen2.5 VL 32B Instruct
qwen
16,384 tokens
$0.04 / $0.14
Gemma 3 12B
google
96,000 tokens
$0.04 / $0.14