Phi 4 Reasoning 查看AI模型的詳細資訊和價格

上下文長度 32,768 代幣， microsoft 來自提供

32,768

上下文權杖

$0.00

提示價格

$0.00

輸出價格

0/16

功能支援

模型介紹

Phi-4-reasoning is a 14B parameter dense decoder-only transformer developed by Microsoft, fine-tuned from Phi-4 to enhance complex reasoning capabilities. It uses a combination of supervised fine-tuning on chain-of-thought traces and reinforcement learning, targeting math, science, and code reasoning tasks. With a 32k context window and high inference efficiency, it is optimized for structured responses in a two-part format: reasoning trace followed by a final solution. The model achieves strong results on specialized benchmarks such as AIME, OmniMath, and LiveCodeBench, outperforming many larger models in structured reasoning tasks. It is released under the MIT license and intended for use in latency-constrained, English-only environments requiring reliable step-by-step logic. Recommended usage includes ChatML prompts and structured reasoning format for best results.

基本資訊

開發商

microsoft

模型系列

Other

發布日期

2025-05-01

上下文長度

32,768 令牌

變體

standard

價格資訊

提示令牌

$0.00 / 1M 代幣

完成令牌

$0.00 / 1M 代幣

支援功能

不支援 (16)

圖像輸入

Top K

種子

頻率懲罰

存在懲罰

重複懲罰

回應格式

Min P

Logit偏置

工具使用

Logprobs

Top Logprobs

結構化輸出

推理

網路搜尋選項

Top A

其他變體

Phi 4 Reasoning (free)

free

免費

實際使用量統計

暫無最近使用量資料。

同作者模型 (microsoft)

Phi 4 Reasoning Plus (free)

32,768 令牌

免費

查看詳情

Phi 4 Reasoning Plus

32,768 令牌

$0.07 / $0.35

查看詳情

MAI DS R1 (free)

163,840 令牌

免費

查看詳情

MAI DS R1

163,840 令牌

$0.25 / $1.00

查看詳情