AI Model Rankings

基于模型热度、真实调用趋势与能力指标，追踪主流 AI 模型的市场表现。

Top Models

主流模型每周使用趋势

LLM Leaderboard

按时间范围查询不同模型的调用量

Hy3 preview

by tencent

2.85T tokens

↑41%

DeepSeek V4 Flash

by deepseek

2.76T tokens

↑102%

Claude Sonnet 4.6

by anthropic

1.56T tokens

↑3%

Claude Opus 4.7

by anthropic

1.41T tokens

↓8%

Owl Alpha

by MGEO

1.21T tokens

↑115%

Gemini 3 Flash Preview

by google

1.12T tokens

↓1%

DeepSeek V3.2

by deepseek

1.12T tokens

↑20%

DeepSeek V4 Pro

by deepseek

950B tokens

↑8%

Step 3.5 Flash

by stepfun

790B tokens

↑43%

10.

Kimi K2.6

by moonshotai

782B tokens

↓4%

11.

MiniMax M2.7

by minimax

637B tokens

↓17%

12.

Nemotron 3 Super (free)

by nvidia

589B tokens

↓2%

13.

Gemini 2.5 Flash

by google

566B tokens

↓1%

14.

Gemini 2.5 Flash Lite

by google

550B tokens

↓8%

15.

Claude Opus 4.6

by anthropic

542B tokens

↓10%

16.

Gemini 3.1 Pro Preview

by google

502B tokens

↑73%

17.

GPT-5.5

by openai

478B tokens

↑5%

18.

gpt-oss-120b

by openai

440B tokens

↑14%

19.

Gemini 3.1 Flash Lite Preview

by google

385B tokens

↑17%

20.

GLM 5.1

by zhipu

381B tokens

↑6%

Tool Calls

对比不同模型的工具调用使用量

Claude Sonnet 4.6

by anthropic · tool usage

94.1B calls

↑18%

GPT-5.5

by openai · tool usage

88.5B calls

↑12%

Gemini 3 Flash Preview

by google · tool usage

76.4B calls

↑9%

DeepSeek V4 Flash

by deepseek · tool usage

70.2B calls

↑6%

Qwen3.6 35B A3B

by qwen · tool usage

62.9B calls

↓5%

Claude Opus 4.7

by anthropic · tool usage

58.6B calls

↓7%

MiniMax M2.7

by minimax · tool usage

50.2B calls

↑14%

Kimi K2.6

by moonshotai · tool usage

46.8B calls

gpt-oss-120b

by openai · tool usage

39.7B calls

↑11%

10.

GLM 5.1

by zhipu · tool usage

34.9B calls

↑3%

Benchmarks

按综合能力指标对比模型表现

GPT-5.5 (xhigh)

by openai

66.8

Claude Opus 4.7 (Adaptive)

by anthropic

64.9

MiMo-V2.5-Pro

by xiaomi

63.7

Grok 4.3

by x-ai

61.4

Claude Sonnet 4.6

by anthropic

55.1

Qwen3.6 35B A3B (Reasoning)

by qwen

51.7

MiniMax-M2.1

by minimax

48.2

Mistral Medium 3.5

by mistralai

45.6

Grok 4.1 Fast (Reasoning)

by x-ai

43.1

10.

Gemini 3 Flash Preview

by google

40.5

Fastest models

对比不同服务商下的模型吞吐表现

Highest throughput

gpt-oss-safeguard-20b

fastest on Groq

645 tok/s

$0.07/M

gpt-oss-20b

fastest on Groq

634 tok/s

$0.07/M

gpt-oss-120b

fastest on Cerebras

626 tok/s

$0.35/M

Mercury 2

fastest on Inception

355 tok/s

$0.25/M

Qwen3 32B

fastest on Cerebras

351 tok/s

$0.29/M

GLM 4.7

fastest on Cerebras

302 tok/s

$2.25/M

MiniMax M2.5

fastest on Mara

261 tok/s

$0.30/M

Llama 3.1 8B Instruct

fastest on Cerebras

230 tok/s

$0.10/M

Qwen3.6 35B A3B

fastest on WandB

165 tok/s

$0.25/M

10.

Nano Banana (Gemini 2.5)

fastest on Google

162 tok/s

$0.35/M

Context Length

按上下文窗口对比模型使用情况

10K

Gemini 3 Flash Preview

by google · 1.12T tokens

1M context

↑17%

Gemini 2.5 Flash

by google · 566B tokens

1M context

↓1%

Claude Sonnet 4.6

by anthropic · 1.56T tokens

200K context

↑3%

Claude Opus 4.7

by anthropic · 1.41T tokens

200K context

↓8%

GPT-5.5

by openai · 478B tokens

400K context

↑5%

Qwen3.6 35B A3B

by qwen · 600B tokens

256K context

↑9%

DeepSeek V4 Pro

by deepseek · 950B tokens

128K context

↑8%

MiniMax M2.7

by minimax · 637B tokens

1M context

↓17%

Kimi K2.6

by moonshotai · 782B tokens

256K context

↓4%

10.

Nemotron 3 Super

by nvidia · 589B tokens

128K context

↓2%

Languages

按自然语言使用量对比模型表现

DeepSeek V4 Flash

by deepseek · 中文

2.38T tokens

↑66%

Hy3 preview

by tencent · 中文

2.12T tokens

↑38%

Qwen3.6 35B A3B

by qwen · 中文

1.76T tokens

↑21%

Kimi K2.6

by moonshotai · 中文

1.42T tokens

↓9%

GLM 5.1

by zhipu · 中文

1.08T tokens

↑18%

MiniMax M2.7

by minimax · 中文

820B tokens

↓7%

DeepSeek V3.2

by deepseek · 中文

790B tokens

↑12%

Claude Sonnet 4.6

by anthropic · 中文

520B tokens

↑4%

Gemini 3 Flash Preview

by google · 中文

470B tokens

↓3%

10.

GPT-5.5

by openai · 中文

390B tokens

Programming

按编程语言使用量对比模型表现

Python

Claude Sonnet 4.6

by anthropic · Python

1.18T tokens

↑3%

DeepSeek V4 Flash

by deepseek · Python

1.06T tokens

↑102%

GPT-5.5

by openai · Python

620B tokens

↑5%

Qwen3.6 35B A3B

by qwen · Python

540B tokens

↑19%

Gemini 3 Flash Preview

by google · Python

502B tokens

↓1%

Kimi K2.6

by moonshotai · Python

440B tokens

↓4%

DeepSeek V4 Pro

by deepseek · Python

390B tokens

↑8%

gpt-oss-120b

by openai · Python

360B tokens

↑14%

GLM 5.1

by zhipu · Python

306B tokens

↑6%

10.

Mistral Medium 3.5

by mistralai · Python

240B tokens

Images

模型处理图像任务的累计趋势

Nano Banana (Gemini 2.5)

by google · image processing

48.2M images

↑22%

Gemini 2.5 Flash Image

by google · image processing

41.8M images

↑13%

GPT Image 1

by openai · image processing

30.6M images

↑6%

Claude Sonnet 4.6

by anthropic · image input

24.7M images

↑3%

Qwen Image

by qwen · image processing

21.9M images

↑11%

MiniMax M2.7

by minimax · image input

17.1M images

↓4%

Gemini 3 Flash Preview

by google · image input

15.8M images

↓1%

Mistral Medium 3.5

by mistralai · image input

13.2M images

GLM 5.1

by zhipu · image input

10.9M images

↑6%

10.

DeepSeek V4 Pro

by deepseek · image input

8.6M images

↑8%

Audio Input

模型处理音频输入的累计趋势

GPT-4o Transcribe

by openai · audio input

26.4M prompts

↑16%

Gemini 2.5 Flash

by google · audio input

21.9M prompts

↓1%

Whisper Large V3

by openai · audio input

18.2M prompts

↓4%

MiniMax Speech 2.5

by minimax · audio input

14.7M prompts

↑12%

Gemini 3 Flash Preview

by google · audio input

12.6M prompts

↑7%

Claude Sonnet 4.6

by anthropic · audio input

10.1M prompts

↑3%

Qwen Audio

by qwen · audio input

8.8M prompts

↑9%

Mistral Voxtral

by mistralai · audio input

7.4M prompts

GLM Audio

by zhipu · audio input

5.9M prompts

↑5%

10.

Llama 3.1 Audio

by meta · audio input

4.8M prompts

↓6%

Top Apps

按应用与 Agent 场景观察模型采用情况

Hermes Agent

by nousresearch

353B tokens

OpenClaw

by openclaw

195B tokens

Kilo Code

by kilocode

166B tokens

Claude Code

by anthropic

70.5B tokens

CSS AI Pro

by css

66.7B tokens

Descript

by descript

62.7B tokens

by inflection

39.9B tokens

Janitor AI

by janitorai

27.7B tokens

ISEKAI ZERO

by isekai

25B tokens

10.

Roo Code

by roocode

22.8B tokens

AI Model Rankings

Top Models

LLM Leaderboard

Market Share

Tool Calls

Benchmarks

Fastest models

Context Length

Categories

Languages

Programming

Images

Audio Input

Top Apps