AI Cost Tools
Back to Calculator

AI Token Cost Calculation Methodology

This page explains how the calculator turns token usage into estimated API cost. It is meant to make the assumptions visible so users can verify the result instead of treating it as a billing statement.

Core formulas

Total cost = input cost + cached input cost + output cost Input cost = input tokens x calls x input price per 1M tokens / 1,000,000 Cached input cost = cached input tokens x calls x cached input price per 1M tokens / 1,000,000 Output cost = output tokens x calls x output price per 1M tokens / 1,000,000

Pricing fields

FieldMeaningCommon mistake
Input pricePrice for new prompt or context tokens sent to the model.Including cached tokens here again.
Cached input pricePrice for repeated context when the provider bills a cache hit separately.Assuming every repeated prompt is automatically cached.
Output pricePrice for generated tokens returned by the model.Forgetting that output is often priced higher than input.
CallsNumber of model requests in the selected period.Counting user tasks but not internal agent calls.

Data sources and review

Pricing entries are compiled from official provider pricing pages for OpenAI, Anthropic Claude, Google Gemini, and DeepSeek. The pricing table links to official sources so users can verify each provider directly.

The site is reviewed when official pricing changes are known, when users report a mismatch, or when new supported models are added. The calculator does not automatically sync with provider billing systems.

What is not included

Why results are estimates

Real bills depend on exact tokenizer output, provider-side rounding, model settings, retries, failed requests, feature-specific charges, and account-level terms. Use the calculator to compare scenarios and set a budget range, then verify with provider dashboards after launch.

Correction policy

If a pricing entry appears wrong, send the model name, official pricing URL, and the expected input, cached input, or output price to contact@aitokencost.net. Corrections are prioritized when they affect visible calculator results.

FAQ

Does the calculator use official prices?

It uses official provider pages as references, but users should verify current prices before production budgeting.

Why are cached input tokens separate?

Some providers bill repeated context differently. Separating cached input prevents users from mixing cache-hit and cache-miss prices.

Can I rely on this for invoices?

No. Use it for planning and comparison. The provider dashboard and invoice are the source of truth for billing.