deepseek-v4-flash
158B
1048576
FP8
0.59 (3.33)
1s 10s -6h -12h -24h
37.94 (30.78)
10 100 -6h -12h -24h
—
deepseek-v4-pro
1T
1048576
FP8
9.96 (3.53)
1s 10s -6h -12h -24h
13.18 (30.01)
10 100 -6h -12h -24h
—
glm-5.1
756B
202752
FP8
3.14 (1.73)
1s 10s -6h -12h -24h
15.32 (31.26)
10 100 -6h -12h -24h
—
gemma4:31b
32B
262144
BF16
0.39 (1.13)
1s 10s -6h -12h -24h
127.90 (48.92)
10 100 -6h -12h -24h
—
kimi-k2.6
1T
262144
INT4
n/a (1.85)
1s 10s -6h -12h -24h
n/a (39.66)
10 100 -6h -12h -24h
Server error '503 Service Unavailable' for url 'https://ollama.com/api/chat'
For more information check: https://developer.mozilla.org/en-US/docs/Web/HTTP/Status/503
minimax-m2.5
230B
196608
FP8
0.35 (1.11)
1s 10s -6h -12h -24h
37.25 (35.20)
10 100 -6h -12h -24h
—