Qwen3 32B Check detailed information and pricing for AI models

Context Length 40,960 tokens, qwen from provided

40,960
Context Tokens
$0.10
Prompt Price
$0.30
Output Price
9/16
Feature Support

Model Overview

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts and can extend to 131K tokens using YaRN-based scaling.

Basic Information

Developer
qwen
Model Series
Qwen3
Release Date
2025-04-28
Context Length
40,960 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.10 / 1M tokens
Completion Tokens
$0.30 / 1M tokens

Supported Features

Supported (9)

Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Response Format
Min P
Tool Usage
Reasoning

Unsupported (7)

Image Input
Logit Bias
Logprobs
Top Logprobs
Structured Outputs
Web Search Options
Top A

Other Variants

Actual Usage Statistics

#50
Out of 346 total models
17.6B
Total Tokens Last 30 Days
585.02M
Daily Average Usage
65%
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (qwen)

Qwen3 0.6B
32,000 tokens
$0.00 / $0.00
Qwen3 1.7B
32,000 tokens
$0.00 / $0.00
Qwen3 4B
128,000 tokens
$0.00 / $0.00
Qwen3 30B A3B (free)
40,960 tokens
Free
Qwen3 30B A3B
40,960 tokens
$0.08 / $0.29

Similar Price Range Models

Llama 3.1 70B Instruct
meta-llama
131,072 tokens
$0.10 / $0.28
Hermes 3 70B Instruct
nousresearch
131,072 tokens
$0.12 / $0.30
Gemini 1.5 Flash
google
1,000,000 tokens
$0.08 / $0.30
Llama 4 Scout
meta-llama
1,048,576 tokens
$0.08 / $0.30
Gemini 2.0 Flash Lite
google
1,048,576 tokens
$0.08 / $0.30