Phi 4 Reasoning 查看AI模型的詳細資訊和價格

上下文 長度 32,768 代幣, microsoft 來自 提供

32,768
上下文權杖
$0.00
提示價格
$0.00
輸出價格
0/16
功能支援

模型介紹

Phi-4-reasoning is a 14B parameter dense decoder-only transformer developed by Microsoft, fine-tuned from Phi-4 to enhance complex reasoning capabilities. It uses a combination of supervised fine-tuning on chain-of-thought traces and reinforcement learning, targeting math, science, and code reasoning tasks. With a 32k context window and high inference efficiency, it is optimized for structured responses in a two-part format: reasoning trace followed by a final solution. The model achieves strong results on specialized benchmarks such as AIME, OmniMath, and LiveCodeBench, outperforming many larger models in structured reasoning tasks. It is released under the MIT license and intended for use in latency-constrained, English-only environments requiring reliable step-by-step logic. Recommended usage includes ChatML prompts and structured reasoning format for best results.

基本資訊

開發商
microsoft
模型系列
Other
發布日期
2025-05-01
上下文長度
32,768 令牌
變體
standard

價格資訊

提示令牌
$0.00 / 1M 代幣
完成令牌
$0.00 / 1M 代幣

支援功能

不支援 (16)

圖像輸入
Top K
種子
頻率懲罰
存在懲罰
重複懲罰
回應格式
Min P
Logit偏置
工具使用
Logprobs
Top Logprobs
結構化輸出
推理
網路搜尋選項
Top A

其他變體

實際使用量統計

暫無最近使用量資料。

同作者模型 (microsoft)

Phi 4 Reasoning Plus (free)
32,768 令牌
免費
Phi 4 Reasoning Plus
32,768 令牌
$0.07 / $0.35
MAI DS R1 (free)
163,840 令牌
免費
MAI DS R1
163,840 令牌
$0.20 / $0.80
Phi 4 Multimodal Instruct
131,072 令牌
$0.05 / $0.10

相似價位模型

Jamba 1.5 Large
ai21
256,000 令牌
$0.00 / $0.00
R1 Distill Qwen 7B
deepseek
131,072 令牌
$0.00 / $0.00
Deepseek R1 0528 Qwen3 8B (free)
deepseek
131,072 令牌
$0.00 / $0.00
Gemma 1 2B
google
8,192 令牌
$0.00 / $0.00
R1 0528 (free)
deepseek
163,840 令牌
$0.00 / $0.00