What is the cheapest AI model?

Gemini 2.0 Flash at $0.10/1M input tokens is one of the cheapest capable AI models available via API.

Which AI model is best for coding?

Claude Opus 4.7 and DeepSeek V3 are consistently rated highest for coding tasks in 2026.

What is the largest context window?

Gemini 2.5 Pro and GPT-4.1 both support up to 1 million tokens of context window.

更新于 2026年5月 · 追踪 20+ 个模型

AI 大模型对比，
价格与性能

探索最佳 AI 大模型：编程、写作、图像生成、视频、推理与 API 一站查询。

对比模型查看定价

API 定价表

每百万词元定价（美元），点击模型查看详情。

模型	厂商	类别	输入 /百万词元	输出 /百万词元	上下文	详情
Gemini 2.0 Flash	Google	对话模型	$0.10	$0.40	1.0M	查看 →
DeepSeek V3🔥	DeepSeek	编程模型	$0.27	$1.10	128K	查看 →
Qwen 2.5 72B	Alibaba	编程模型	$0.40	$1.20	128K	查看 →
Moonshot Kimi K1.5	Moonshot	推理模型	$0.50	$2.00	128K	查看 →
DeepSeek R1🔥	DeepSeek	推理模型	$0.55	$2.19	128K	查看 →
Claude Haiku 4.5	Anthropic	对话模型	$0.80	$4.00	200K	查看 →
Gemini 2.5 Pro🔥	Google	推理模型	$1.25	$10.00	1.0M	查看 →
GPT-4.1NEW	OpenAI	对话模型	$2.00	$8.00	1.0M	查看 →
GPT-4o🔥	OpenAI	对话模型	$2.50	$10.00	128K	查看 →
Claude Sonnet 4.6NEW	Anthropic	对话模型	$3.00	$15.00	200K	查看 →
GPT-5🔥NEW	OpenAI	推理模型	$15.00	$60.00	1.0M	查看 →
Claude Opus 4.7🔥	Anthropic	推理模型	$15.00	$75.00	200K	查看 →

💬

对话模型

适用于客服、日常对话、办公辅助、内容创作等广泛场景，是最通用的 AI 能力形态。

模型	厂商	输入 /百万词元	输出 /百万词元	上下文	详情
GPT-4o热门 GPT-4o 是快速的多模态模型，适用于对话、编程和视觉任务。	OpenAI	$2.50	$10.00	128K	查看 →
Body Builder (beta) Body Builder (beta) 是由 Openrouter 推出的通用对话大模型，速度适中。支持最长 128K 词元的上下文窗口。具备API 接入等能力。定价：输入 $-1000000/百万词元，输出 $-1000000/百万词元。（原厂说明：Transform your natural language requests into structured OpenRouter API request objects.）	Openrouter	非按词元计费	—	128K	查看 →
IBM: Granite 4.0 Micro IBM: Granite 4.0 Micro 是由 Ibm-granite 推出的通用对话大模型，速度适中。支持最长 131K 词元的上下文窗口。具备API 接入等能力。定价：输入 $0.017/百万词元，输出 $0.11/百万词元。（原厂说明：Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models.）	Ibm-granite	$0.017	$0.11	131K	查看 →
Meta: Llama 3.1 8B Instruct Meta: Llama 3.1 8B Instruct 是由 Meta 推出的通用对话大模型，速度适中。支持最长 16K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.02/百万词元，输出 $0.05/百万词元。（原厂说明：Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors.）	Meta	$0.02	$0.05	16K	查看 →
Mistral: Mistral Nemo Mistral: Mistral Nemo 是由 Mistral AI 推出的通用对话大模型，速度适中。支持最长 131K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.02/百万词元，输出 $0.03/百万词元。（原厂说明：A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA.）	Mistral AI	$0.02	$0.03	131K	查看 →
Meta: Llama 3.2 1B Instruct Meta: Llama 3.2 1B Instruct 是由 Meta 推出的通用对话大模型，速度适中。支持最长 60K 词元的上下文窗口。具备API 接入等能力。定价：输入 $0.027/百万词元，输出 $0.2/百万词元。（原厂说明：Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis.）	Meta	$0.027	$0.20	60K	查看 →
LiquidAI: LFM2-24B-A2B LiquidAI: LFM2-24B-A2B 是由 Liquid AI 推出的通用对话大模型，速度适中。支持最长 33K 词元的上下文窗口。具备API 接入等能力。定价：输入 $0.03/百万词元，输出 $0.12/百万词元。（原厂说明：LFM2-24B-A2B is the largest model in the LFM2 family of hybrid architectures designed for efficient on-device deployment.）	Liquid AI	$0.03	$0.12	33K	查看 →
OpenAI: gpt-oss-20b OpenAI: gpt-oss-20b 是由 OpenAI 推出的通用对话大模型，速度适中。支持最长 131K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.03/百万词元，输出 $0.14/百万词元。（原厂说明：gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license.）	OpenAI	$0.03	$0.14	131K	查看 →
Qwen: Qwen-Turbo Qwen: Qwen-Turbo 是由 Alibaba 推出的通用对话大模型，响应速度快。支持最长 131K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.0325/百万词元，输出 $0.13/百万词元。（原厂说明：Qwen-Turbo, based on Qwen2.5, is a 1M context model that provides fast speed and low cost, suitable for simple tasks.）	Alibaba	$0.0325	$0.13	131K	查看 →
Amazon: Nova Micro 1.0 Amazon: Nova Micro 1.0 是由 Amazon 推出的通用对话大模型，速度适中。支持最长 128K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.035/百万词元，输出 $0.14/百万词元。（原厂说明：Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in the Amazon Nova family of models at a very low cost.）	Amazon	$0.035	$0.14	128K	查看 →
Cohere: Command R7B (12-2024) Cohere: Command R7B (12-2024) 是由 Cohere 推出的通用对话大模型，速度适中。支持最长 128K 词元的上下文窗口。具备API 接入等能力。定价：输入 $0.0375/百万词元，输出 $0.15/百万词元。（原厂说明：Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024.）	Cohere	$0.0375	$0.15	128K	查看 →
OpenAI: gpt-oss-120b OpenAI: gpt-oss-120b 是由 OpenAI 推出的通用对话大模型，速度适中。支持最长 131K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.039/百万词元，输出 $0.18/百万词元。（原厂说明：gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases.）	OpenAI	$0.039	$0.18	131K	查看 →

查看全部 319 个模型 → →

💻

编程模型

专为软件开发优化，擅长代码生成、Bug 排查、代码审查与技术文档撰写。

模型	厂商	输入 /百万词元	输出 /百万词元	上下文	详情
DeepSeek V3热门 DeepSeek V3 是极具竞争力的 MoE 模型，代码能力强，成本低廉。	DeepSeek	$0.27	$1.10	128K	查看 →
Pareto Code Router Pareto Code Router 是由 Openrouter 推出的代码生成大模型，速度适中。支持最长 2M 词元的上下文窗口。具备API 接入等能力。定价：输入 $-1000000/百万词元，输出 $-1000000/百万词元。（原厂说明：The Pareto Router maintains a tiered shortlist of strong coding models, ranked by Artificial Analysis coding percentiles.）	Openrouter	非按词元计费	—	2.0M	查看 →
Qwen: Qwen3 Coder 30B A3B Instruct Qwen: Qwen3 Coder 30B A3B Instruct 是由 Alibaba 推出的代码生成大模型，速度适中。支持最长 160K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.07/百万词元，输出 $0.27/百万词元。（原厂说明：Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use.）	Alibaba	$0.07	$0.27	160K	查看 →
Qwen: Qwen3 Coder Next Qwen: Qwen3 Coder Next 是由 Alibaba 推出的代码生成大模型，速度适中。支持最长 262K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.11/百万词元，输出 $0.8/百万词元。（原厂说明：Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows.）	Alibaba	$0.11	$0.80	262K	查看 →
Qwen: Qwen3 Coder Flash Qwen: Qwen3 Coder Flash 是由 Alibaba 推出的代码生成大模型，响应速度快。支持最长 1M 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.195/百万词元，输出 $0.975/百万词元。（原厂说明：Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus.）	Alibaba	$0.195	$0.975	1.0M	查看 →
xAI: Grok Code Fast 1 xAI: Grok Code Fast 1 是由 xAI 推出的代码生成大模型，响应速度快。支持最长 256K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.2/百万词元，输出 $1.5/百万词元。（原厂说明：Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding.）	xAI	$0.20	$1.50	256K	查看 →
Qwen: Qwen3 Coder 480B A35B Qwen: Qwen3 Coder 480B A35B 是由 Alibaba 推出的代码生成大模型，速度适中。支持最长 262K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.22/百万词元，输出 $1.8/百万词元。（原厂说明：Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team.）	Alibaba	$0.22	$1.80	262K	查看 →
Kwaipilot: KAT-Coder-Pro V2 Kwaipilot: KAT-Coder-Pro V2 是由 Kwaipilot 推出的代码生成大模型，速度适中。支持最长 256K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.3/百万词元，输出 $1.2/百万词元。（原厂说明：KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration.）	Kwaipilot	$0.30	$1.20	256K	查看 →
Mistral: Codestral 2508 Mistral: Codestral 2508 是由 Mistral AI 推出的代码生成大模型，速度适中。支持最长 256K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.3/百万词元，输出 $0.9/百万词元。（原厂说明：Mistral's cutting-edge language model for coding released end of July 2025.）	Mistral AI	$0.30	$0.90	256K	查看 →
Qwen 2.5 72B Qwen 2.5 72B 是阿里巴巴开源旗舰模型，多语言与代码能力突出。	Alibaba	$0.40	$1.20	128K	查看 →
Arcee AI: Coder Large Arcee AI: Coder Large 是由 Arcee-ai 推出的代码生成大模型，响应较慢但能力更强。支持最长 33K 词元的上下文窗口。具备API 接入等能力。定价：输入 $0.5/百万词元，输出 $0.8/百万词元。（原厂说明：Coder‑Large is a 32 B‑parameter offspring of Qwen 2.5‑Instruct that has been further trained on permissively‑licensed GitHub, CodeSearchNet and synthetic bug‑fix corpora.）	Arcee-ai	$0.50	$0.80	33K	查看 →
Qwen: Qwen3 Coder Plus Qwen: Qwen3 Coder Plus 是由 Alibaba 推出的代码生成大模型，响应较慢但能力更强。支持最长 1M 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.65/百万词元，输出 $3.25/百万词元。（原厂说明：Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B.）	Alibaba	$0.65	$3.25	1.0M	查看 →

查看全部 14 个模型 → →

🧠

推理模型

强化逻辑推理，适合数学运算、科学研究、复杂分析与策略规划。

模型	厂商	输入 /百万词元	输出 /百万词元	上下文	详情
DeepSeek R1热门 DeepSeek R1 是链式思维推理模型，以极低成本媲美 o1。	DeepSeek	$0.55	$2.19	128K	查看 →
Gemini 2.5 Pro热门 Gemini 2.5 Pro 是 Google 最强模型，原生 100 万 Token 上下文，最先进的推理能力。	Google	$1.25	$10.00	1.0M	查看 →
GPT-5热门最新 GPT-5 是 OpenAI 最强模型，融合高级推理与多模态理解能力。	OpenAI	$15.00	$60.00	1.0M	查看 →
Claude Opus 4.7热门 Claude Opus 是 Anthropic 最智能的模型，擅长复杂推理与智能体任务。	Anthropic	$15.00	$75.00	200K	查看 →
Baidu: ERNIE 4.5 21B A3B Thinking Baidu: ERNIE 4.5 21B A3B Thinking 是由 Baidu 推出的推理思考大模型，速度适中。支持最长 131K 词元的上下文窗口。具备API 接入等能力。定价：输入 $0.07/百万词元，输出 $0.28/百万词元。（原厂说明：ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost reasoning depth and quality for top-tier performance in logical puzzles, math, science, coding, text generation, and expert-level academic benchmarks.）	Baidu	$0.07	$0.28	131K	查看 →
Qwen: Qwen3 30B A3B Thinking 2507 Qwen: Qwen3 30B A3B Thinking 2507 是由 Alibaba 推出的推理思考大模型，速度适中。支持最长 131K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.08/百万词元，输出 $0.4/百万词元。（原厂说明：Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimized for complex tasks requiring extended multi-step thinking.）	Alibaba	$0.08	$0.40	131K	查看 →
Qwen: Qwen3 Next 80B A3B Thinking Qwen: Qwen3 Next 80B A3B Thinking 是由 Alibaba 推出的推理思考大模型，速度适中。支持最长 131K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.0975/百万词元，输出 $0.78/百万词元。（原厂说明：Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default.）	Alibaba	$0.0975	$0.78	131K	查看 →
Qwen: Qwen3 VL 8B Thinking Qwen: Qwen3 VL 8B Thinking 是由 Alibaba 推出的推理思考大模型，速度适中。支持最长 131K 词元的上下文窗口。具备图像理解、工具调用、API 接入等能力。定价：输入 $0.117/百万词元，输出 $1.365/百万词元。（原厂说明：Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences.）	Alibaba	$0.117	$1.365	131K	查看 →
Qwen: Qwen3 VL 30B A3B Thinking Qwen: Qwen3 VL 30B A3B Thinking 是由 Alibaba 推出的推理思考大模型，速度适中。支持最长 131K 词元的上下文窗口。具备图像理解、工具调用、API 接入等能力。定价：输入 $0.13/百万词元，输出 $1.56/百万词元。（原厂说明：Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual understanding for images and videos.）	Alibaba	$0.13	$1.56	131K	查看 →
Qwen: Qwen3 235B A22B Thinking 2507 Qwen: Qwen3 235B A22B Thinking 2507 是由 Alibaba 推出的推理思考大模型，速度适中。支持最长 131K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.1495/百万词元，输出 $1.495/百万词元。（原厂说明：Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks.）	Alibaba	$0.1495	$1.495	131K	查看 →
Arcee AI: Trinity Large Thinking Arcee AI: Trinity Large Thinking 是由 Arcee-ai 推出的推理思考大模型，响应较慢但能力更强。支持最长 262K 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.22/百万词元，输出 $0.85/百万词元。（原厂说明：Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI.）	Arcee-ai	$0.22	$0.85	262K	查看 →
Qwen: Qwen Plus 0728 (thinking) Qwen: Qwen Plus 0728 (thinking) 是由 Alibaba 推出的推理思考大模型，响应较慢但能力更强。支持最长 1M 词元的上下文窗口。具备工具调用、API 接入等能力。定价：输入 $0.26/百万词元，输出 $0.78/百万词元。（原厂说明：Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.）	Alibaba	$0.26	$0.78	1.0M	查看 →

查看全部 31 个模型 → →

🖼️

图像生成

根据文字描述生成高质量图片，适合设计、广告物料、插图与艺术创作。

模型	厂商	输入 /百万词元	输出 /百万词元	上下文	API	详情
FLUX.1热门 FLUX.1 是最先进的图像生成模型，以照片级真实感和提示词准确性著称。	Black Forest Labs	非按词元计费	—	—		查看 →
Auto Router Auto Router 是由 Openrouter 推出的图像生成大模型，速度适中。支持最长 2M 词元的上下文窗口。具备图像理解、工具调用、API 接入等能力。定价：输入 $-1000000/百万词元，输出 $-1000000/百万词元。（原厂说明：Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output.）	Openrouter	非按词元计费	—	2.0M		查看 →
Google: Nano Banana (Gemini 2.5 Flash Image) Google: Nano Banana (Gemini 2.5 Flash Image) 是由 Google 推出的图像生成大模型，响应速度快。支持最长 33K 词元的上下文窗口。具备图像理解、API 接入等能力。定价：输入 $0.3/百万词元，输出 $2.5/百万词元。（原厂说明：Gemini 2.5 Flash Image, a.k.a.）	Google	$0.30	$2.50	33K		查看 →
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) 是由 Google 推出的图像生成大模型，响应速度快。支持最长 66K 词元的上下文窗口。具备图像理解、API 接入等能力。定价：输入 $0.5/百万词元，输出 $3/百万词元。（原厂说明：Gemini 3.1 Flash Image Preview, a.k.a.）	Google	$0.50	$3.00	66K		查看 →
Google: Nano Banana Pro (Gemini 3 Pro Image Preview) Google: Nano Banana Pro (Gemini 3 Pro Image Preview) 是由 Google 推出的图像生成大模型，响应速度快。支持最长 66K 词元的上下文窗口。具备图像理解、API 接入等能力。定价：输入 $2/百万词元，输出 $12/百万词元。（原厂说明：Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro.）	Google	$2.00	$12.00	66K		查看 →
OpenAI: GPT-5 Image Mini OpenAI: GPT-5 Image Mini 是由 OpenAI 推出的图像生成大模型，响应速度快。支持最长 400K 词元的上下文窗口。具备图像理解、API 接入等能力。定价：输入 $2.5/百万词元，输出 $2/百万词元。（原厂说明：GPT-5 Image Mini combines OpenAI's advanced language capabilities, powered by GPT-5 Mini, with GPT Image 1 Mini for efficient image generation.）	OpenAI	$2.50	$2.00	400K		查看 →
OpenAI: GPT-5.4 Image 2 OpenAI: GPT-5.4 Image 2 是由 OpenAI 推出的图像生成大模型，速度适中。支持最长 272K 词元的上下文窗口。具备图像理解、API 接入等能力。定价：输入 $8/百万词元，输出 $15/百万词元。（原厂说明：GPT-5.4 Image 2 combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2.）	OpenAI	$8.00	$15.00	272K		查看 →
OpenAI: GPT-5 Image OpenAI: GPT-5 Image 是由 OpenAI 推出的图像生成大模型，速度适中。支持最长 400K 词元的上下文窗口。具备图像理解、API 接入等能力。定价：输入 $10/百万词元，输出 $10/百万词元。（原厂说明：GPT-5 Image combines OpenAI's GPT-5 model with state-of-the-art image generation capabilities.）	OpenAI	$10.00	$10.00	400K		查看 →
Midjourney v6 Midjourney v6 生成令人惊叹的艺术图像，美学质量卓越。	Midjourney	非按词元计费	—	—	—	查看 →
Stable Diffusion XL Stable Diffusion XL 是领先的开源图像生成模型，支持本地部署。	Stability AI	非按词元计费	—	—		查看 →

🎬

视频生成

基于文字描述或图片生成视频，适合广告创意、短视频内容与影视辅助制作。

模型	厂商	输入 /百万词元	输出 /百万词元	上下文	API	详情
Sora热门 Sora 根据文本提示生成逼真且富有想象力的视频，时长最长 60 秒。	OpenAI	非按词元计费	—	—	—	查看 →
Kling AI热门 Kling AI 生成高质量视频，具有真实的运动效果和流畅的转场。	Kling AI	非按词元计费	—	—		查看 →
Veo 2 Veo 2 是 Google 的高级视频生成模型，具有电影级质量和物理理解能力。	Google	非按词元计费	—	—	—	查看 →
Runway Gen-3 Runway Gen-3 Alpha 是功能强大的视频生成模型，为开发者提供 API 访问。	Runway	非按词元计费	—	—		查看 →

AI 大模型对比，
价格与性能

API 定价表

对话模型

编程模型

推理模型

图像生成

视频生成

热门对比

最新模型

AI 大模型对比，价格与性能

API 定价表

对话模型

编程模型

推理模型

图像生成

视频生成

热门对比

最新模型

AI 大模型对比，
价格与性能