LLM Cost Estimation Made Simple
Estimate standard API costs for current OpenAI, Anthropic, and Google models. One API call, instant results.
-H "Authorization: Bearer sk_..." \
-H "Content-Type: application/json" \
-d '{
"model": "claude-fable-5",
"metrics": '{
"input_tokens": 1024,
"output_tokens": 512
}
}'
Why ModelPricing.ai?
Everything you need to predict and control your AI spend.
Accurate Pricing
Current standard rates for OpenAI, Anthropic, and Google — including cache, multimodal, and breakpoint pricing where supported.
Lightning Fast
Low-latency responses powered by a global edge network. No cold starts, no waiting.
Simple Integration
One REST endpoint. Multiple client SDKs included. Get cost estimates in three lines of code.
LLM Pricing for Every Major Provider
We track per-token costs across Anthropic, OpenAI, and Google so you can compare AI model pricing in one place.
Anthropic Claude
Fable 5, Mythos 5, Opus 4.8
Compare current Claude API rates across Haiku, Sonnet, Opus, Fable, and Mythos, including prompt caching and 1M-context flagship models.
View pricingOpenAI
GPT-5.5, GPT-5.4, o3, o4-mini
Full pricing breakdown for GPT-5.x, GPT-4.1/4o, Codex, and o-series reasoning models, including long-context breakpoints where they apply.
View pricingGoogle Gemini
Gemini 3.x, Flash, Pro
Explore Gemini costs across Flash, Pro, image, audio, and preview models, with breakpoint and multimodal pricing called out.
View pricingNeed a side-by-side view? Compare all models or try our cost calculator.
How the LLM Pricing API Works
Estimate your AI costs programmatically in three steps. No spreadsheets, no manual lookups.
Create a Free API Key
Sign up and generate an API key in seconds. No credit card required — the free tier includes 2,000 requests per day.
Send a Model and Token Count
POST to /v1/estimate with the model name (e.g. claude-fable-5, gpt-5.5) and your input/output token counts.
Get an Instant Cost Estimate
Receive a detailed cost breakdown in milliseconds — including per-token rates, total cost, and pricing tier information.
Simple, Transparent Pricing
Start free, scale as you grow. No hidden fees.
For testing and personal projects
Always free
- API Requests per Minute 10
- API Requests per Day 2,000
- API Keys 2
- Data Retention 7 days
For small teams getting started
Only billed monthly
- API Requests per Minute 60
- API Requests per Day 50,000
- API Keys 10
- Data Retention 30 days
For scaling teams and production use
Only billed monthly
- API Requests per Minute 300
- API Requests per Day 200,000
- API Keys Unlimited
- Data Retention 90 days
Frequently Asked Questions
Common questions about LLM pricing and the ModelPricing.ai API.
What is ModelPricing.ai?
ModelPricing.ai is a REST API that returns real-time cost estimates for large language model (LLM) API calls. Send a model name and token count, and get back an instant price breakdown — no spreadsheets or manual lookups required.
Which LLM providers do you support?
We support the current standard-rate tables for Anthropic Claude (Haiku 4.5, Sonnet 4.6, Opus 4.8, Fable 5, and Mythos 5), OpenAI (GPT-5.5, GPT-5.4, GPT-5, GPT-4.1, GPT-4o, Codex, and o-series reasoning models), and Google Gemini (Flash, Flash Lite, Pro, image, audio, and preview models).
How much does the API cost?
The API is free for up to 2,000 requests per day with no credit card required. Paid plans start at $5/month for 50,000 requests/day and go up to $10/month for 200,000 requests/day with unlimited API keys.
How is LLM pricing calculated?
LLM providers charge per token — typically with separate rates for input tokens (your prompt) and output tokens (the model response). Some models also use breakpoint pricing where the per-token rate increases beyond a context-length threshold (e.g. 200K tokens).
What is breakpoint pricing?
Breakpoint pricing means the per-token cost changes at certain context-length thresholds. Gemini Pro models commonly switch at 200K input tokens, while GPT-5.4 and GPT-5.5 use a 272K-token threshold. Our API applies the matching rate tier from the model table.
Can I compare pricing across different LLM providers?
Yes. Our pricing comparison page lets you view per-token costs side-by-side across Anthropic, OpenAI, and Google models. You can also use the cost calculator to estimate spend for a specific workload across multiple models at once.
How do I get started?
Sign up for a free account, generate an API key, and POST to /v1/estimate with a model name and token counts. You will receive a cost breakdown in milliseconds. Python and JavaScript client libraries are available.