Updated monthly

TOLVYN AI Cost Index

Real-world AI API costs from production workloads. Updated monthly from anonymized, aggregated data across TOLVYN customers. No sponsored content. No vendor-submitted data. Real costs from real production calls.

Loading data…

Top models by volume

Production cost benchmarks

Average cost per API request, based on the last 30 days of production traffic.

Loading benchmarks…

Last 90 days

Model comparison

Click any column header to sort. Click again to reverse.

Loading data…

30-day trend

Avg cost per request over time

Top 5 models by request volume. Costs in USD per request.

Loading chart…

Methodology

  • Source: Data is collected nightly from TOLVYN customers who have opted into data sharing.
  • What we collect: Provider name, model ID, token counts, cost per request, request timestamp.
  • What we never collect: Request content, prompts, completions, user data, company names, or any personally identifiable information.
  • Minimum sample size: Each data point requires at least 3 companies to contribute. Data points with fewer than 3 contributing companies are suppressed entirely.
  • How we calculate cost: Using the provider's published pricing at the time of each request, applied to the reported input and output token counts.
  • Opt-out: TOLVYN customers can opt out of data sharing at any time from their account settings. Opted-out data is excluded immediately from the next nightly collection.
  • Independence: TOLVYN is not affiliated with OpenAI, Anthropic, Google, or any AI provider. This index is published for informational purposes only.

Want to track your own AI costs?

TOLVYN gives you per-request cost tracking, budget enforcement, and your own cost benchmarks — compared to the index.

Start free — 10,000 requests View docs