TOOLS
26 interactive tools for local AI inference. No sign-up. Everything runs in your browser.
Local MoE Pipeline Builder
NEW & UNIQUEDesign a macro-scale Mixture-of-Experts pipeline from independent local models. Router + domain experts + conditional synthesizer. Generates Python code and YAML config.
Hardware Simulator
NEW & UNIQUEBeyond basic VRAM calculator: Input your exact setup (e.g., 2× RTX 4090 + 128GB RAM, or 1× 5090 + CPU offload) and get realistic estimates
Abliteration Test Suite
NEW & UNIQUEAbliteration test suite with publishable results score-card
Multi-Model Planner
NEW & UNIQUEGenerate code to run multiple models in a variety of ways
Prompt Tester
NEW & UNIQUETest prompt effectiveness
Quant Quality Estimator
NEW & UNIQUEVisual representation for Quant quality V baseline
True VRAM Calculator
NEW & UNIQUETrue VRAM Calculator based on all metrics
Model Compatibility Checker
MOST USEFULSelect your GPU — see every uncensored and abliterated model that fits with estimated tok/s and HuggingFace links.
Can I Run It?
EMBEDDABLEQuick GPU check for uncensored models. Embeddable iframe for Discord and websites.
Inference Speed Estimator
POPULARPredict tokens per second before downloading. 30+ GPUs, all quants, 6 backends.
Inference Profiler
ADVANCEDDetailed profile: throughput, time-to-first-token, bandwidth utilisation, CPU offload analysis. Compare two configs side-by-side.
GPU Price / Performance
UPDATED WEEKLYCurrent GPU rankings by inference value. Tok/s per dollar, sortable by metric. Updated weekly.
Benchmark Compare
VISUALSide-by-side GPU comparison across 7B, 13B, and 70B model sizes with visual bar charts.
Hardware Advisor
NEW4-question wizard giving specific GPU and build recommendations. Budget-aware, use-case aware.
VRAM Calculator
PRECISEExact VRAM for any model size, quant, and context length including KV cache breakdown.
Context Length Calculator
UNIQUEFind your maximum context window given VRAM, model, and KV cache quantization.
Token Budget Calculator
NEWPlan context usage, generation time, and API cost. Works for local and cloud models.
Quant Picker
BEGINNERAnswer 3 questions — get the right quantization format with a clear explanation.
Backend Picker
PRACTICAL4 questions to find the right inference backend for your GPU, OS, and use case.
Abliteration Quality Scorer
UNIQUECompare base vs abliterated benchmark scores. Grade retention S–D. 14 known models.
Model Diff
RESEARCHSide-by-side comparison of base vs abliterated outputs with real examples.
HuggingFace Tracker
CURATEDCurated list of abliterated, uncensored, and Dolphin uploads. Updated weekly.