Qwen3 32B Check detailed information and pricing for AI models
Context Length 40,960 tokens, qwen from provided
40,960
Context Tokens
$0.10
Prompt Price
$0.30
Output Price
9/16
Feature Support
Model Overview
Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts and can extend to 131K tokens using YaRN-based scaling.
Basic Information
Developer
qwen
Model Series
Qwen3
Release Date
2025-04-28
Context Length
40,960 tokens
Variant
standard
Pricing Information
Prompt Tokens
$0.10 / 1M tokens
Completion Tokens
$0.30 / 1M tokens
Data Policy
Supported Features
Supported (9)
Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Response Format
Min P
Tool Usage
Reasoning
Unsupported (7)
Image Input
Logit Bias
Logprobs
Top Logprobs
Structured Outputs
Web Search Options
Top A
Other Variants
Actual Usage Statistics
#50
Out of 346 total models
17.6B
Total Tokens Last 30 Days
585.02M
Daily Average Usage
65%
Weekly Usage Change