Calculator

Inference Calculator

Estimate OOM risk and memory requirements. Compare baseline engines vs S88 (expected).

Inputs

Use presets or enter custom details. Estimates are directional.

Serving setup

Shared prefix KV: 0%

Business impact (optional)

These are simple inputs for ROI / Opex / Capex math. Leave blank to ignore.

No data leaves your browser.

Uses the standard KV-cache sizing formula (layers × KV heads × head dim × sequence length × batch × dtype), plus conservative overhead ranges. Validate with a real benchmark before production deployment.