Hermes 4 70B Check detailed information and pricing for AI models

Context Length 131,072 tokens, nousresearch from provided

131,072
Context Tokens
$0.09
Prompt Price
$0.37
Output Price
11/16
Feature Support

Model Overview

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It introduces the same hybrid mode as the larger 405B release, allowing the model to either respond directly or generate explicit <think>...</think> reasoning traces before answering. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config) This 70B variant is trained with the expanded post-training corpus (~60B tokens) emphasizing verified reasoning data, leading to improvements in mathematics, coding, STEM, logic, and structured outputs while maintaining general assistant performance. It supports JSON mode, schema adherence, function calling, and tool use, and is designed for greater steerability with reduced refusal rates.

Basic Information

Developer
nousresearch
Model Series
Llama3
Release Date
2025-08-26
Context Length
131,072 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.09 / 1M tokens
Completion Tokens
$0.37 / 1M tokens

Data Policy

Terms of Service

학습 정책

1

Supported Features

Supported (11)

Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Min P
Logit Bias
Tool Usage
Logprobs
Top Logprobs
Reasoning

Unsupported (5)

Image Input
Response Format
Structured Outputs
Web Search Options
Top A

Actual Usage Statistics

No recent usage data available.

Models by Same Author (nousresearch)

Hermes 4 405B
131,072 tokens
$0.20 / $0.80
DeepHermes 3 Mistral 24B Preview (free)
32,768 tokens
Free
DeepHermes 3 Mistral 24B Preview
32,768 tokens
$0.09 / $0.37
DeepHermes 3 Llama 3 8B Preview (free)
131,072 tokens
Free
Hermes 3 70B Instruct
131,072 tokens
$0.10 / $0.28

Similar Price Range Models

DeepHermes 3 Mistral 24B Preview
nousresearch
32,768 tokens
$0.09 / $0.37
Gemini 2.5 Flash Lite Preview 06-17
google
1,048,576 tokens
$0.10 / $0.40
GPT-4.1 Nano
openai
1,047,576 tokens
$0.10 / $0.40
Gemini 2.0 Flash
google
1,048,576 tokens
$0.10 / $0.40
Qwen2.5 VL 72B Instruct
qwen
32,768 tokens
$0.10 / $0.40