Vercel：DeepSeek的Token调用量超过OpenAI，成本仅占总支出1%

发布时间：2026-06-11T20:11:23+08:00 阅读：0 分类：tech

Vercel 发布 2026 年 6 月 AI Gateway 生产指数。报告显示，得益于 5 月上线 Vercel 网关的 DeepSeek V4 系列（含 Flash 与 Pro 模型）推动，DeepSeek 的 Token 流量份额单月内从不足 1% 飙升至 17%，超越 OpenAI（13%）位居第三。然而由于定价极低，所有用户使用 DeepSeek 的总成本之和仅占网关整体资金支出的 1% 左右。

价格是 DeepSeek 迅速爆发的主因。DeepSeek V4 Flash 百万 Token 输入与输出收费仅为 0.14 美元和 0.28 美元，较 Anthropic 同类前沿模型便宜 20 至 50 倍，较 Qwen 3.6 Plus 与 Kimi K2.6 也低 8 至 12 倍。评测表明 DeepSeek V4 性能达标，促使开发团队迅速在生产中部署。

尽管低成本模型流量暴涨，但在资金消耗上，前沿模型仍占主导。5 月 Anthropic 支出份额从 61% 增至 65%，在应用生成、后台智能体及编程等高难度场景占 70% 到 80% 支出。例如在编程智能体场景，DeepSeek 贡献了 49% 的 Token 流量，但仅占 4% 的费用，而 Anthropic 以 28% 的流量耗费了 70% 的资金。

开发团队正通过智能路由管理预算，将高频低风险任务分流至低成本模型，仅在关键环节使用前沿模型。对投资回报率（ROI）的考量也减缓了模型升级。例如谷歌 5 月推出的 Gemini 3.5 Flash 定价高于 3.0 版本，导致迁移缓慢，月底时 3.0 Flash 仍占 Flash 系列 90% 的流量，而 3.5 Flash 仅占 7%。同时，AI 智能体表现出极高 Token 消耗密度，以四分之一的请求量消耗了过半 Token。

Vercel

DeepSeek enters the fight for token volume, Anthropic continues to dominate...

The June 2026 AI Gateway production index: DeepSeek's token share jumped to 17% as low-cost models entered production, while Anthropic held 65% of all spend.

13 个帖子 - 11 位参与者

阅读完整话题

来源: LinuxDo 最新话题查看原文

DeepSeek enters the fight for token volume, Anthropic continues to dominate...

延伸阅读