Qwen3 8B Check detailed information and pricing for AI models

Context Length 128,000 tokens, qwen from provided

128,000

Context Tokens

$0.04

Prompt Price

$0.14

Output Price

8/16

Feature Support

Reasoning #4

Model Overview

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.

Basic Information

Developer

qwen

Model Series

Qwen3

Release Date

2025-04-28

Context Length

128,000 tokens

Max Completion Tokens

20,000 tokens

Variant

standard

Pricing Information

Prompt Tokens

$0.04 / 1M tokens

Completion Tokens

$0.14 / 1M tokens

Data Policy

Supported Features

Supported (8)

Top K

Seed

Frequency Penalty

Presence Penalty

Repetition Penalty

Min P

Logit Bias

Reasoning

Unsupported (8)

Image Input

Response Format

Tool Usage

Logprobs

Top Logprobs

Structured Outputs

Web Search Options

Top A

Other Variants

Qwen3 8B (free)

free

Free

Actual Usage Statistics

#88

Out of 353 total models

9.2B

Total Tokens Last 30 Days

306.04M

Daily Average Usage

16%

Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (qwen)

Qwen3 Next 80B A3B Thinking

262,144 tokens

$0.10 / $0.39

View Details

Qwen3 Next 80B A3B Instruct

262,144 tokens

$0.10 / $0.39

View Details

Qwen Plus 0728

1,000,000 tokens

$0.40 / $1.20

View Details

Qwen Plus 0728 (thinking)

1,000,000 tokens

$0.40 / $4.00

View Details