Llama 3.3 70B Instruct Check detailed information and pricing for AI models

Context Length 131,072 tokens, meta-llama from provided

131,072
Context Tokens
$0.05
Prompt Price
$0.25
Output Price
8/16
Feature Support
Overall #19
trivia #7
legal #10
roleplay #12
finance #13
science #14
seo #15
marketing #16
technology #17
academia #18

Model Overview

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model is optimized for multilingual dialogue use cases and outperforms many of the available open source and closed chat models on common industry benchmarks. Supported languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. [Model Card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_3/MODEL_CARD.md)

Basic Information

Developer
meta-llama
Model Series
Llama3
Release Date
2024-12-06
Context Length
131,072 tokens
Max Completion Tokens
16,384 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.05 / 1M tokens
Completion Tokens
$0.25 / 1M tokens

Supported Features

Supported (8)

Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Response Format
Min P
Tool Usage

Unsupported (8)

Image Input
Logit Bias
Logprobs
Top Logprobs
Structured Outputs
Reasoning
Web Search Options
Top A

Other Variants

Actual Usage Statistics

#16
Out of 345 total models
132.2B
Total Tokens Last 30 Days
4.4B
Daily Average Usage
10%
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (meta-llama)

Llama 3.3 8B Instruct (free)
128,000 tokens
Free
Llama Guard 4 12B
163,840 tokens
$0.05 / $0.05
Llama 4 Maverick (free)
128,000 tokens
Free
Llama 4 Maverick
1,048,576 tokens
$0.15 / $0.60
Llama 4 Scout (free)
128,000 tokens
Free

Similar Price Range Models

Qwen3 14B
qwen
40,960 tokens
$0.06 / $0.24
Nova Lite 1.0
amazon
300,000 tokens
$0.06 / $0.24
Qwen-Turbo
qwen
1,000,000 tokens
$0.05 / $0.20