Calculator
Inference Calculator
Estimate OOM risk and memory requirements. Compare baseline engines vs S88 (expected).
Inputs
Use presets or enter custom details. Estimates are directional.
Uses the standard KV-cache sizing formula (layers × KV heads × head dim × sequence length × batch × dtype), plus conservative overhead ranges. Validate with a real benchmark before production deployment.