Phi 4 Reasoning (free) 查看AI模型的詳細資訊和價格

上下文長度 32,768 代幣， microsoft 來自提供

32,768

上下文權杖

免費

提示價格

免費

輸出價格

10/16

功能支援

模型介紹

Phi-4-reasoning is a 14B parameter dense decoder-only transformer developed by Microsoft, fine-tuned from Phi-4 to enhance complex reasoning capabilities. It uses a combination of supervised fine-tuning on chain-of-thought traces and reinforcement learning, targeting math, science, and code reasoning tasks. With a 32k context window and high inference efficiency, it is optimized for structured responses in a two-part format: reasoning trace followed by a final solution. The model achieves strong results on specialized benchmarks such as AIME, OmniMath, and LiveCodeBench, outperforming many larger models in structured reasoning tasks. It is released under the MIT license and intended for use in latency-constrained, English-only environments requiring reliable step-by-step logic. Recommended usage includes ChatML prompts and structured reasoning format for best results.

基本資訊

開發商

microsoft

模型系列

Other

發布日期

2025-05-01

上下文長度

32,768 令牌

變體

free

價格資訊

此模型可免費使用

資料政策

使用條款

학습 정책

支援功能

支援 (10)

Top K

種子

頻率懲罰

存在懲罰

重複懲罰

Min P

Logit偏置

Logprobs

Top Logprobs

推理

不支援 (6)

圖像輸入

回應格式

工具使用

結構化輸出

網路搜尋選項

Top A

其他變體

Phi 4 Reasoning

standard

$0.00 / $0.00

實際使用量統計

暫無最近使用量資料。

同作者模型 (microsoft)

Phi 4 Reasoning Plus (free)

32,768 令牌

免費

查看詳情

Phi 4 Reasoning Plus

32,768 令牌

$0.07 / $0.35

查看詳情

MAI DS R1 (free)

163,840 令牌

免費

查看詳情

MAI DS R1

163,840 令牌

$0.25 / $1.00

查看詳情

Phi 4 Multimodal Instruct

131,072 令牌

$0.05 / $0.10

查看詳情