Phi-3 Mini 3.8B general
phi3:mini 3 GB · ~180 tok/s
Mistral 7B general
mistral:7b 5 GB · ~138 tok/s
Mistral 7B Abliterated general
mannix/mistral-7b-instruct-abliterated 5 GB · ~138 tok/s
Llama 3.1 8B general
llama3.1:8b 6 GB · ~128 tok/s
Qwen 2.5 7B multilingual
qwen2.5:7b 5 GB · ~132 tok/s
Dolphin Mistral 7B uncensored
dolphin-mistral 5 GB · ~128 tok/s
DeepSeek R1 7B reasoning
deepseek-r1:7b 5 GB · ~120 tok/s
DeepSeek R1 14B reasoning
deepseek-r1:14b 10 GB · ~86 tok/s
Mathstral 7B math
mathstral:7b 5 GB · ~138 tok/s
Phi-3 Medium 14B coding
phi3:medium 9 GB · ~68 tok/s
Qwen 2.5 Coder 7B coding
qwen2.5-coder:7b 5 GB · ~132 tok/s
Qwen 2.5 Coder 14B coding
qwen2.5-coder:14b 10 GB · ~80 tok/s
Qwen 2.5 Coder 32B coding
qwen2.5-coder:32b 20 GB · ~44 tok/s
DeepSeek R1 32B reasoning
deepseek-r1:32b 20 GB · ~44 tok/s
SOLAR 10.7B reasoning
solar:10.7b 7 GB · ~98 tok/s
Nous Hermes 2 general
nous-hermes2 5 GB · ~130 tok/s
Llama 3.1 70B general
llama3.1:70b 40 GB · ~21 tok/s
Llama 3.1 70B Abliterated uncensored
huihui_ai/llama3.1-abliterated:70b 40 GB · ~21 tok/s
Meditron 7B medical
meditron:7b 5 GB · ~130 tok/s
Nomic Embed Text embedding
nomic-embed-text 1 GB
MxBAI Embed Large embedding
mxbai-embed-large 1 GB
LLaVA 7B (vision) vision
llava:7b 6 GB · ~80 tok/s