GLM 4 32B Check detailed information and pricing for AI models

Context Length 32,000 tokens, thudm from provided

32,000
Context Tokens
$0.55
Prompt Price
$1.66
Output Price
7/16
Feature Support

Model Overview

GLM-4-32B-0414 is a 32B bilingual (Chinese-English) open-weight language model optimized for code generation, function calling, and agent-style tasks. Pretrained on 15T of high-quality and reasoning-heavy data, it was further refined using human preference alignment, rejection sampling, and reinforcement learning. The model excels in complex reasoning, artifact generation, and structured output tasks, achieving performance comparable to GPT-4o and DeepSeek-V3-0324 across several benchmarks.

Basic Information

Developer
thudm
Model Series
Other
Release Date
2025-04-17
Context Length
32,000 tokens
Max Completion Tokens
32,000 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.55 / 1M tokens
Completion Tokens
$1.66 / 1M tokens

Supported Features

Supported (7)

Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Min P
Logit Bias

Unsupported (9)

Image Input
Response Format
Tool Usage
Logprobs
Top Logprobs
Structured Outputs
Reasoning
Web Search Options
Top A

Other Variants

Actual Usage Statistics

#271
Out of 353 total models
122.72M
Total Tokens Last 30 Days
4.09M
Daily Average Usage
448%
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (thudm)

GLM 4.1V 9B Thinking
65,536 tokens
$0.04 / $0.14
GLM Z1 Rumination 32B
32,000 tokens
$0.00 / $0.00
GLM Z1 9B
32,000 tokens
$0.00 / $0.00
GLM 4 9B
32,000 tokens
$0.00 / $0.00
GLM Z1 32B (free)
32,768 tokens
Free

Similar Price Range Models

Llama 3.1 Nemotron Ultra 253B v1
nvidia
131,072 tokens
$0.60 / $1.80
GLM 4.5V
z-ai
65,536 tokens
$0.50 / $1.80
Command R
cohere
128,000 tokens
$0.50 / $1.50
Magistral Small 2506
mistralai
40,000 tokens
$0.50 / $1.50
Command R (03-2024)
cohere
128,000 tokens
$0.50 / $1.50