GLM 4.5 Air Check detailed information and pricing for AI models

Context Length 131,072 tokens, z-ai from provided

131,072
Context Tokens
$0.14
Prompt Price
$0.86
Output Price
2/16
Feature Support
translation #10
programming #10
academia #10
trivia #10
roleplay #10
finance #10
science #10
health #10
seo #10
legal #10
technology #10
marketing #10

Model Overview

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config)

Basic Information

Developer
z-ai
Model Series
Other
Release Date
2025-07-25
Context Length
131,072 tokens
Max Completion Tokens
131,072 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.14 / 1M tokens
Completion Tokens
$0.86 / 1M tokens

Supported Features

Supported (2)

Tool Usage
Reasoning

Unsupported (14)

Image Input
Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Response Format
Min P
Logit Bias
Logprobs
Top Logprobs
Structured Outputs
Web Search Options
Top A

Other Variants

Actual Usage Statistics

#83
Out of 353 total models
11.3B
Total Tokens Last 30 Days
490.28M
Daily Average Usage
38%
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (z-ai)

GLM 4.5V
65,536 tokens
$0.50 / $1.80
GLM 4.5
131,072 tokens
$0.33 / $1.32
GLM 4 32B
128,000 tokens
$0.10 / $0.10