GLM 4.5 Air Check detailed information and pricing for AI models
Context Length 131,072 tokens, z-ai from provided
131,072
Context Tokens
$0.14
Prompt Price
$0.86
Output Price
2/16
Feature Support
translation #10
programming #10
academia #10
trivia #10
roleplay #10
finance #10
science #10
health #10
seo #10
legal #10
technology #10
marketing #10
Model Overview
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config)
Basic Information
Developer
z-ai
Model Series
Other
Release Date
2025-07-25
Context Length
131,072 tokens
Max Completion Tokens
131,072 tokens
Variant
standard
Pricing Information
Prompt Tokens
$0.14 / 1M tokens
Completion Tokens
$0.86 / 1M tokens
Supported Features
Supported (2)
Tool Usage
Reasoning
Unsupported (14)
Image Input
Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Response Format
Min P
Logit Bias
Logprobs
Top Logprobs
Structured Outputs
Web Search Options
Top A
Other Variants
Actual Usage Statistics
#83
Out of 353 total models
11.3B
Total Tokens Last 30 Days
490.28M
Daily Average Usage
38%
Weekly Usage Change