AI Model Rankings

基于模型热度、真实调用趋势与能力指标，追踪主流 AI 模型的市场表现。

Top Models

主流模型每周使用趋势

LLM Leaderboard

按时间范围查询不同模型的调用量

DeepSeek V4 Flash

by deepseek

5.35T tokens

↑14%

MiMo-V2.5

by xiaomi

4.30T tokens

↓5%

MiniMax M3

by minimax

4.02T tokens

↑7%

Hy3 preview

by tencent

3.37T tokens

↑3%

GLM 5.2

by z-ai

2.54T tokens

↑28%

DeepSeek V4 Pro

by deepseek

2.25T tokens

↑10%

Claude Opus 4.8

by anthropic

2.08T tokens

↑6%

Claude Opus 4.7

by anthropic

2.02T tokens

↓14%

Step 3.7 Flash

by stepfun

1.56T tokens

↑5%

10.

Claude Sonnet 4.6

by anthropic

1.47T tokens

↓2%

11.

owl-alpha

by MGEO

1.31T tokens

↓61%

12.

GPT-5.5

by openai

1.18T tokens

↑21%

13.

nemotron-3-ultra-550b-a55b-20260604:free

by nvidia

958B tokens

↑45%

14.

Gemini 3 Flash Preview

by google

935B tokens

↑4%

15.

laguna-m.1-20260312:free

by poolside

768B tokens

↑38%

16.

Gemini 2.5 Flash Lite

by google

612B tokens

↓3%

17.

Gemini 2.5 Flash

by google

596B tokens

↑3%

18.

MiMo-V2.5-Pro

by xiaomi

555B tokens

↑16%

19.

gpt-oss-120b

by openai

474B tokens

↑20%

20.

DeepSeek V3.2

by deepseek

461B tokens

↓41%

Tool Calls

对比不同模型的工具调用使用量

Others

by others · tool usage

240M calls

↑3%

DeepSeek V4 Flash

by deepseek · tool usage

62M calls

↑20%

MiMo-V2.5

by xiaomi · tool usage

39M calls

↓22%

MiniMax M3

by minimax · tool usage

35M calls

↑5%

Hy3 preview

by tencent · tool usage

32M calls

↓16%

GLM 5.2

by z-ai · tool usage

29M calls

↑25%

DeepSeek V4 Pro

by deepseek · tool usage

22M calls

↑3%

GPT-4.1 Mini

by openai · tool usage

21M calls

Gemini 3 Flash Preview

by google · tool usage

18M calls

↓6%

10.

Claude Sonnet 4.6

by anthropic · tool usage

17M calls

Benchmarks

按综合能力指标对比模型表现

GPT-5.5 (xhigh)

by openai

66.8

Claude Opus 4.7 (Adaptive)

by anthropic

64.9

MiMo-V2.5-Pro

by xiaomi

63.7

Grok 4.3

by x-ai

61.4

Claude Sonnet 4.6

by anthropic

55.1

Qwen3.6 35B A3B (Reasoning)

by qwen

51.7

MiniMax-M2.1

by minimax

48.2

Mistral Medium 3.5

by mistralai

45.6

Grok 4.1 Fast (Reasoning)

by x-ai

43.1

10.

Gemini 3 Flash Preview

by google

40.5

Fastest models

对比不同服务商下的模型吞吐表现

Highest throughput

gpt-oss-safeguard-20b

fastest on Groq

645 tok/s

$0.07/M

gpt-oss-20b

fastest on Groq

634 tok/s

$0.07/M

gpt-oss-120b

fastest on Cerebras

626 tok/s

$0.35/M

Mercury 2

fastest on Inception

355 tok/s

$0.25/M

Qwen3 32B

fastest on Cerebras

351 tok/s

$0.29/M

GLM 4.7

fastest on Cerebras

302 tok/s

$2.25/M

MiniMax M2.5

fastest on Mara

261 tok/s

$0.30/M

Llama 3.1 8B Instruct

fastest on Cerebras

230 tok/s

$0.10/M

Qwen3.6 35B A3B

fastest on WandB

165 tok/s

$0.25/M

10.

Nano Banana (Gemini 2.5)

fastest on Google

162 tok/s

$0.35/M

Context Length

按上下文窗口对比模型使用情况

10K

Others

by others

655M requests

↑1%

DeepSeek V4 Flash

by deepseek

243M requests

↓1%

Gemini 2.5 Flash Lite

by google

102M requests

↓5%

Gemini 2.5 Flash

by google

87M requests

↓1%

gpt-oss-120b

by openai

65M requests

↑22%

Gemini 3 Flash Preview

by google

63M requests

↓0%

Mistral Nemo

by mistralai

58M requests

↑3%

Gemini 3.1 Flash Lite

by google

57M requests

↓5%

Gemma 4 26B A4B

by google

44M requests

↓1%

10.

DeepSeek V3.2

by deepseek

34M requests

↓13%

Languages

按自然语言使用量对比模型表现

Others

by others

2.51T tokens

↓5%

DeepSeek V4 Flash

by deepseek

863B tokens

↑1%

MiMo-V2.5

by xiaomi

660B tokens

↑22%

MiniMax M3

by minimax

622B tokens

↓1%

GLM 5.2

by z-ai

439B tokens

↑5%

Hy3 preview

by tencent

332B tokens

↑0%

DeepSeek V4 Pro

by deepseek

324B tokens

↑4%

Step 3.7 Flash

by stepfun

212B tokens

↑4%

GPT-5.5

by openai

163B tokens

↓4%

10.

Claude Sonnet 4.6

by anthropic

155B tokens

Programming

按编程语言使用量对比模型表现

Python

Others

by others

652B tokens

↓4%

DeepSeek V4 Flash

by deepseek

216B tokens

↑0%

MiMo-V2.5

by xiaomi

142B tokens

↑14%

MiniMax M3

by minimax

137B tokens

↓6%

GLM 5.2

by z-ai

107B tokens

↑7%

DeepSeek V4 Pro

by deepseek

88.0B tokens

↑5%

Hy3 preview

by tencent

88.0B tokens

↓3%

Step 3.7 Flash

by stepfun

61.1B tokens

↑24%

Claude Opus 4.7

by anthropic

37.4B tokens

↑9%

10.

nemotron-3-ultra-550b-a55b-20260604:free

by nvidia

34.5B tokens

Images

模型处理图像任务的累计趋势

Others

by others

224M requests

↑15%

Gemini 2.5 Flash Lite

by google

176M requests

↓27%

Gemini 2.5 Flash

by google

63M requests

↑83%

Gemini 3 Flash Preview

by google

38M requests

↓2%

Qwen3 VL 235B A22B Instruct

by qwen

36M requests

↑138%

Claude Sonnet 4.6

by anthropic

28M requests

↓15%

MiMo-V2.5

by xiaomi

26M requests

↓10%

GPT-5.5

by openai

22M requests

↑4%

Qwen3.6 Plus

by qwen

18M requests

↓32%

10.

Claude Opus 4.8

by anthropic

17M requests

Audio Input

模型处理音频输入的累计趋势

GPT-4o Transcribe

by openai · audio input

26.4M prompts

↑16%

Gemini 2.5 Flash

by google · audio input

21.9M prompts

↓1%

Whisper Large V3

by openai · audio input

18.2M prompts

↓4%

MiniMax Speech 2.5

by minimax · audio input

14.7M prompts

↑12%

Gemini 3 Flash Preview

by google · audio input

12.6M prompts

↑7%

Claude Sonnet 4.6

by anthropic · audio input

10.1M prompts

↑3%

Qwen Audio

by qwen · audio input

8.8M prompts

↑9%

Mistral Voxtral

by mistralai · audio input

7.4M prompts

GLM Audio

by zhipu · audio input

5.9M prompts

↑5%

10.

Llama 3.1 Audio

by meta · audio input

4.8M prompts

↓6%

Top Apps

按应用与 Agent 场景观察模型采用情况

Hermes Agent

by nousresearch

353B tokens

OpenClaw

by openclaw

195B tokens

Kilo Code

by kilocode

166B tokens

Claude Code

by anthropic

70.5B tokens

CSS AI Pro

by css

66.7B tokens

Descript

by descript

62.7B tokens

by inflection

39.9B tokens

Janitor AI

by janitorai

27.7B tokens

ISEKAI ZERO

by isekai

25B tokens

10.

Roo Code

by roocode

22.8B tokens

AI Model Rankings

Top Models

LLM Leaderboard

Market Share

Tool Calls

Benchmarks

Fastest models

Context Length

Categories

Languages

Programming

Images

Audio Input

Top Apps