Qwen3 32B Check detailed information and pricing for AI models

Context Length 40,960 tokens, qwen from provided

40,960
Context Tokens
$0.03
Prompt Price
$0.13
Output Price
10/16
Feature Support

Model Overview

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts and can extend to 131K tokens using YaRN-based scaling.

Basic Information

Developer
qwen
Model Series
Qwen3
Release Date
2025-04-28
Context Length
40,960 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.03 / 1M tokens
Completion Tokens
$0.13 / 1M tokens

Data Policy

Terms of Service

학습 정책

1

Supported Features

Supported (10)

Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Min P
Logit Bias
Logprobs
Top Logprobs
Reasoning

Unsupported (6)

Image Input
Response Format
Tool Usage
Structured Outputs
Web Search Options
Top A

Other Variants

Actual Usage Statistics

#40
Out of 353 total models
58.5B
Total Tokens Last 30 Days
2.0B
Daily Average Usage
13%
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (qwen)

Qwen3 Next 80B A3B Thinking
262,144 tokens
$0.10 / $0.39
Qwen3 Next 80B A3B Instruct
262,144 tokens
$0.10 / $0.39
Qwen Plus 0728
1,000,000 tokens
$0.40 / $1.20
Qwen Plus 0728 (thinking)
1,000,000 tokens
$0.40 / $4.00
Qwen3 Max
256,000 tokens
$1.20 / $6.00

Similar Price Range Models

InternVL3 78B
opengvlab
32,768 tokens
$0.03 / $0.13
R1 Distill Llama 70B
deepseek
131,072 tokens
$0.03 / $0.13
Dolphin3.0 Mistral 24B
cognitivecomputations
32,768 tokens
$0.03 / $0.11