DefiledAI Research

MODEL DATABASE

Open-weight models catalogued by family, with quantization options, context windows, and minimum VRAM requirements for local inference.

Llamaby Meta
ACTIVE
ModelParamsContextMin VRAMLicenseQuants Available
Llama 3.1 8B8B128K6GBLlama 3
Q4_K_MQ5_K_MQ8_0F16
Llama 3.1 70B70B128K40GBLlama 3
Q2_KQ4_K_MQ5_K_MIQ3_M
Llama 3.1 405B405B128K240GBLlama 3
Q2_KIQ1_M
Qwenby Alibaba
ACTIVE
ModelParamsContextMin VRAMLicenseQuants Available
Qwen 3 7B7B32K6GBApache 2.0
Q4_K_MQ5_K_MQ8_0
Qwen 3 14B14B32K10GBApache 2.0
Q4_K_MQ5_K_M
Qwen 3 72B72B32K40GBApache 2.0
Q2_KQ4_K_MQ5_K_M
DeepSeekby DeepSeek
ACTIVE
ModelParamsContextMin VRAMLicenseQuants Available
DeepSeek R1 7B7B32K6GBMIT
Q4_K_MQ8_0
DeepSeek R1 70B70B32K40GBMIT
Q4_K_MIQ3_M
DeepSeek V3671B MoE128KMulti-GPUMIT
Q2_KIQ1_M
Mistralby Mistral AI
ACTIVE
ModelParamsContextMin VRAMLicenseQuants Available
Mistral 7B v0.37B32K6GBApache 2.0
Q4_K_MQ5_K_MQ8_0F16
Mixtral 8x7B56B MoE32K24GBApache 2.0
Q4_K_MQ5_K_M
Mixtral 8x22B141B MoE64K48GBApache 2.0
Q2_KQ4_K_M
Gemmaby Google
ACTIVE
ModelParamsContextMin VRAMLicenseQuants Available
Gemma 2 2B2B8K2GBGemma
Q4_K_MQ8_0F16
Gemma 2 9B9B8K6GBGemma
Q4_K_MQ5_K_MQ8_0
Gemma 2 27B27B8K16GBGemma
Q4_K_MQ5_K_M
Phiby Microsoft
ACTIVE
ModelParamsContextMin VRAMLicenseQuants Available
Phi-3 Mini3.8B128K3GBMIT
Q4_K_MQ8_0F16
Phi-3 Medium14B128K10GBMIT
Q4_K_MQ5_K_M
Missing a model? Request it on the forum.