GLM 4.5V Check detailed information and pricing for AI models

Context Length 65,536 tokens, z-ai from provided

65,536
Context Tokens
$0.50
Prompt Price
$1.80
Output Price
10/16
Feature Support

Model Overview

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding, image Q&A, OCR, and document parsing, with strong gains in front-end web coding, grounding, and spatial reasoning. It offers a hybrid inference mode: a "thinking mode" for deep reasoning and a "non-thinking mode" for fast responses. Reasoning behavior can be toggled via the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config)

Basic Information

Developer
z-ai
Model Series
Other
Release Date
2025-08-11
Context Length
65,536 tokens
Max Completion Tokens
65,536 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.50 / 1M tokens
Completion Tokens
$1.80 / 1M tokens

Supported Features

Supported (10)

Image Input
Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Min P
Logit Bias
Tool Usage
Reasoning

Unsupported (6)

Response Format
Logprobs
Top Logprobs
Structured Outputs
Web Search Options
Top A

Actual Usage Statistics

#159
Out of 353 total models
1.3B
Total Tokens Last 30 Days
149.76M
Daily Average Usage
-
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (z-ai)

GLM 4.5
131,072 tokens
$0.33 / $1.32
GLM 4.5 Air (free)
131,072 tokens
Free
GLM 4.5 Air
131,072 tokens
$0.14 / $0.86
GLM 4 32B
128,000 tokens
$0.10 / $0.10

Similar Price Range Models

Llama 3.1 Nemotron Ultra 253B v1
nvidia
131,072 tokens
$0.60 / $1.80
GLM 4 32B
thudm
32,000 tokens
$0.55 / $1.66
Command R
cohere
128,000 tokens
$0.50 / $1.50
Magistral Small 2506
mistralai
40,000 tokens
$0.50 / $1.50
Command R (03-2024)
cohere
128,000 tokens
$0.50 / $1.50