UI-TARS 7B Check detailed information and pricing for AI models

Context Length 128,000 tokens, bytedance from provided

128,000
Context Tokens
$0.10
Prompt Price
$0.20
Output Price
8/16
Feature Support

Model Overview

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement learning-based reasoning, enabling robust action planning and execution across virtual interfaces. This model achieves state-of-the-art results on a range of interactive and grounding benchmarks, including OSworld, WebVoyager, AndroidWorld, and ScreenSpot. It also demonstrates perfect task completion across diverse Poki games and outperforms prior models in Minecraft agent tasks. UI-TARS-1.5 supports thought decomposition during inference and shows strong scaling across variants, with the 1.5 version notably exceeding the performance of earlier 72B and 7B checkpoints.

Basic Information

Developer
bytedance
Model Series
Other
Release Date
2025-07-22
Context Length
128,000 tokens
Max Completion Tokens
2,048 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.10 / 1M tokens
Completion Tokens
$0.20 / 1M tokens

Supported Features

Supported (8)

Image Input
Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Min P
Logit Bias

Unsupported (8)

Response Format
Tool Usage
Logprobs
Top Logprobs
Structured Outputs
Reasoning
Web Search Options
Top A

Actual Usage Statistics

#226
Out of 353 total models
323.98M
Total Tokens Last 30 Days
11.17M
Daily Average Usage
38%
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (bytedance)

Seed OSS 36B Instruct
131,072 tokens
$0.20 / $0.80

Similar Price Range Models

Molmo 7B D
allenai
4,096 tokens
$0.10 / $0.20
Mistral 7B Instruct v0.1
mistralai
2,824 tokens
$0.11 / $0.19