AICredits logo

AI Models

Browse and compare 438 available models by supported API, capability, and price.

Model availability subject to change. See Terms.

438Total
414Chat / Text
162Vision
23Embedding
17Images API
27Audio / TTS
438 models
ai21/jamba-large-1.7
Context256K
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$8.00794.66
aion-labs/aion-1.0
Context131K
Input / 1M$4.00397.33
Cached / 1M$0.40039.73
Output / 1M$8.00794.66
aion-labs/aion-1.0-mini
Context131K
Input / 1M$0.70069.53
Cached / 1M$0.0706.95
Output / 1M$1.40139.06
aion-labs/aion-2.0
Context131K
Input / 1M$0.80079.47
Cached / 1M$0.0807.95
Output / 1M$1.60158.93
aion-labs/aion-rp-llama-3.1-8b
Context33K
Input / 1M$0.80079.47
Cached / 1M$0.0807.95
Output / 1M$1.60158.93
alfredpros/codellama-7b-instruct-solidity
Context4K
Input / 1M$0.80079.47
Cached / 1M$0.0807.95
Output / 1M$1.20119.20
allenai/olmo-3-32b-think
Context66K
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$0.50049.67
Amazon: Nova 2 LiteChat APIVision
amazon/nova-2-lite-v1
Context1.0M
Input / 1M$0.30029.80
Cached / 1M$0.0302.98
Output / 1M$2.50248.33
Amazon: Nova Lite 1.0Chat APIVision
amazon/nova-lite-v1
Context300K
Input / 1M$0.0605.96
Cached / 1M$0.00600.60
Output / 1M$0.24023.84
amazon/nova-micro-v1
Context128K
Input / 1M$0.0353.48
Cached / 1M$0.00350.35
Output / 1M$0.14013.91
amazon/nova-premier-v1
Context1.0M
Input / 1M$2.50248.33
Cached / 1M$0.25024.83
Output / 1M$12.501241.65
Amazon: Nova Pro 1.0Chat APIVision
amazon/nova-pro-v1
Context300K
Input / 1M$0.80079.47
Cached / 1M$0.0807.95
Output / 1M$3.20317.86
Anthropic: Claude 3 HaikuMessages APIChat APIVision
anthropic/claude-3-haiku
Context200K
Input / 1M$0.25024.83
Cached / 1M$0.0252.48
Output / 1M$1.25124.17
Anthropic: Claude 3.5 HaikuMessages APIChat APIVision
anthropic/claude-3.5-haiku
Context200K
Input / 1M$0.80079.47
Cached / 1M$0.0807.95
Output / 1M$4.00397.33
Anthropic: Claude Haiku 4.5Messages APIChat APIVision
anthropic/claude-haiku-4.5
Context200K
Input / 1M$1.0099.33
Cached / 1M$0.1009.93
Output / 1M$5.00496.66
Anthropic: Claude Opus 4Messages APIChat APIVision
anthropic/claude-opus-4
Context200K
Input / 1M$15.001489.98
Cached / 1M$1.50149.00
Output / 1M$75.007449.91
Anthropic: Claude Opus 4.1Messages APIChat APIVision
anthropic/claude-opus-4.1
Context200K
Input / 1M$15.001489.98
Cached / 1M$1.50149.00
Output / 1M$75.007449.91
Anthropic: Claude Opus 4.5Messages APIChat APIVision
anthropic/claude-opus-4.5
Context200K
Input / 1M$5.00496.66
Cached / 1M$0.50049.67
Output / 1M$25.002483.30
Anthropic: Claude Opus 4.6Messages APIChat APIVision
anthropic/claude-opus-4.6
Context1.0M
Input / 1M$5.00496.66
Cached / 1M$0.50049.67
Output / 1M$25.002483.30
Anthropic: Claude Opus 4.6 (Fast)Messages APIChat APIVision
anthropic/claude-opus-4.6-fast
Context1.0M
Input / 1M$30.002979.96
Cached / 1M$3.00298.00
Output / 1M$150.0014899.82
Anthropic: Claude Opus 4.7Messages APIChat APIVision
anthropic/claude-opus-4.7
Context1.0M
Input / 1M$5.00496.66
Cached / 1M$0.50049.67
Output / 1M$25.002483.30
Anthropic: Claude Opus 4.7 (Fast)Messages APIChat APIVision
anthropic/claude-opus-4.7-fast
Context1.0M
Input / 1M$30.002979.96
Cached / 1M$3.00298.00
Output / 1M$150.0014899.82
Anthropic: Claude Opus 4.8Messages APIChat APIVision
anthropic/claude-opus-4.8
Context1.0M
Input / 1M$5.00496.66
Output / 1M$25.002483.30
Anthropic: Claude Opus 4.8 (Fast)Messages APIChat APIVision
anthropic/claude-opus-4.8-fast
Context1.0M
Input / 1M$10.00993.32
Output / 1M$50.004966.61
Anthropic: Claude Sonnet 4Messages APIChat APIVision
anthropic/claude-sonnet-4
Context1.0M
Input / 1M$3.00298.00
Cached / 1M$0.30029.80
Output / 1M$15.001489.98
Anthropic: Claude Sonnet 4.5Messages APIChat APIVision
anthropic/claude-sonnet-4.5
Context1.0M
Input / 1M$3.00298.00
Cached / 1M$0.30029.80
Output / 1M$15.001489.98
Anthropic: Claude Sonnet 4.6Messages APIChat APIVision
anthropic/claude-sonnet-4.6
Context1.0M
Input / 1M$3.00298.00
Cached / 1M$0.30029.80
Output / 1M$15.001489.98
arcee-ai/coder-large
Context33K
Input / 1M$0.50049.67
Cached / 1M$0.0504.97
Output / 1M$0.80079.47
arcee-ai/maestro-reasoning
Context131K
Input / 1M$0.90089.40
Cached / 1M$0.0908.94
Output / 1M$3.30327.80
Arcee AI: SpotlightChat APIVision
arcee-ai/spotlight
Context131K
Input / 1M$0.18017.88
Cached / 1M$0.0181.79
Output / 1M$0.18017.88
arcee-ai/trinity-large-thinking
Context262K
Input / 1M$0.22021.85
Cached / 1M$0.0222.19
Output / 1M$0.85084.43
arcee-ai/trinity-mini
Context131K
Input / 1M$0.0454.47
Cached / 1M$0.00450.45
Output / 1M$0.15014.90
arcee-ai/virtuoso-large
Context131K
Input / 1M$0.75074.50
Cached / 1M$0.0757.45
Output / 1M$1.20119.20
baai/bge-base-en-v1.5Embeddings APIEmbedding
baai/bge-base-en-v1.5
Context512
Input / 1M$0.00500.50
baai/bge-en-iclEmbeddings APIEmbedding
baai/bge-en-icl
Context8K
Input / 1M$0.0100.99
baai/bge-large-en-v1.5Embeddings APIEmbedding
baai/bge-large-en-v1.5
Context512
Input / 1M$0.0100.99
baai/bge-m3Embeddings APIEmbedding
baai/bge-m3
Context8K
Input / 1M$0.0100.99
baidu/ernie-4.5-vl-28b-a3b
Context131K
Input / 1M$0.14013.91
Cached / 1M$0.0141.39
Output / 1M$0.56055.63
baidu/ernie-4.5-vl-424b-a47b
Context131K
Input / 1M$0.42041.72
Cached / 1M$0.0424.17
Output / 1M$1.25124.17
baidu/ernie-4.5-21b-a3b
Context131K
Input / 1M$0.0706.95
Cached / 1M$0.00700.70
Output / 1M$0.28027.81
baidu/ernie-4.5-21b-a3b-thinking
Context131K
Input / 1M$0.0706.95
Cached / 1M$0.00700.70
Output / 1M$0.28027.81
baidu/ernie-4.5-300b-a47b
Context131K
Input / 1M$0.28027.81
Cached / 1M$0.0282.78
Output / 1M$1.10109.27
baidu/qianfan-ocr-fast
Context66K
Input / 1M$0.68067.55
Cached / 1M$0.0686.75
Output / 1M$2.81279.12
black-forest-labs/flux-1-devImages APIImage Generation
black-forest-labs/flux-1-dev
Per Image~$0.009/img₹0.89/img
black-forest-labs/flux-1-kontext-devImages APIImage Generation
black-forest-labs/flux-1-kontext-dev
Per Image~$0.010/img₹0.99/img
black-forest-labs/flux-1-redux-devImages APIImage Generation
black-forest-labs/flux-1-redux-dev
Per Image~$0.012/img₹1.19/img
black-forest-labs/flux-1-schnellImages APIImage Generation
black-forest-labs/flux-1-schnell
Per Image~$0.002/img₹0.20/img
black-forest-labs/flux-1.1-proImages APIImage Generation
black-forest-labs/flux-1.1-pro
Per Image$0.040/img₹3.97/img
black-forest-labs/flux-2-devImages APIImage Generation
black-forest-labs/flux-2-dev
Per Image~$0.010/img₹0.99/img
black-forest-labs/flux-2-klein-4bImages APIImage Generation
black-forest-labs/flux-2-klein-4b
Per Image~$0.014/img₹1.39/img
black-forest-labs/flux-2-klein-9bImages APIImage Generation
black-forest-labs/flux-2-klein-9b
Per Image~$0.015/img₹1.49/img
black-forest-labs/flux-2-maxImages APIImage Generation
black-forest-labs/flux-2-max
Per Image$0.070/img₹6.95/img
black-forest-labs/flux-2-proImages APIImage Generation
black-forest-labs/flux-2-pro
Per Image$0.015/img₹1.49/img
black-forest-labs/flux-proImages APIImage Generation
black-forest-labs/flux-pro
Per Image$0.050/img₹4.97/img
bytedance-seed/seed-1.6
Context262K
Input / 1M$0.25024.83
Cached / 1M$0.0252.48
Output / 1M$2.00198.66
bytedance-seed/seed-1.6-flash
Context262K
Input / 1M$0.0757.45
Cached / 1M$0.00750.74
Output / 1M$0.30029.80
bytedance-seed/seed-2.0-lite
Context262K
Input / 1M$0.25024.83
Cached / 1M$0.0252.48
Output / 1M$2.00198.66
bytedance-seed/seed-2.0-mini
Context262K
Input / 1M$0.1009.93
Cached / 1M$0.0100.99
Output / 1M$0.40039.73
bytedance/ui-tars-1.5-7b
Context128K
Input / 1M$0.1009.93
Cached / 1M$0.0100.99
Output / 1M$0.20019.87
chatgpt-image-latestChat APIResponses APIImage Output via Chat API
openai/chatgpt-image-latest
Context8K
Input / 1M$5.00496.66
Output / 1M$10.00993.32
Claude Haiku LatestMessages APIChat APIVisionLatest
anthropic/claude-haiku-latest
Context200K
Input / 1M$1.0099.33
Cached / 1M$0.1009.93
Output / 1M$5.00496.66
Claude Opus LatestMessages APIChat APIVisionLatest
anthropic/claude-opus-latest
Context1.0M
Input / 1M$5.00496.66
Cached / 1M$0.50049.67
Output / 1M$25.002483.30
Claude Sonnet LatestMessages APIChat APIVisionLatest
anthropic/claude-sonnet-latest
Context1.0M
Input / 1M$3.00298.00
Cached / 1M$0.30029.80
Output / 1M$15.001489.98
codex-mini-latestChat APIResponses API
openai/codex-mini-latest
Context8K
Input / 1M$1.50149.00
Output / 1M$6.00595.99
cohere/command-a
Context256K
Input / 1M$2.50248.33
Cached / 1M$0.25024.83
Output / 1M$10.00993.32
cohere/command-r-08-2024
Context128K
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$0.60059.60
cohere/command-r-plus-08-2024
Context128K
Input / 1M$2.50248.33
Cached / 1M$0.25024.83
Output / 1M$10.00993.32
cohere/command-r7b-12-2024
Context128K
Input / 1M$0.0373.72
Cached / 1M$0.00370.37
Output / 1M$0.15014.90
dall-e-2Images APIImage GenerationFree
dall-e-2
Context8K
Per Image$0.016/img₹1.59/img
dall-e-2Images APIImage GenerationFree
openai/dall-e-2
Context8K
Per Image$0.016/img₹1.59/img
dall-e-3Images APIImage GenerationFree
dall-e-3
Context8K
Per Image$0.040/img₹3.97/img
dall-e-3Images APIImage GenerationFree
openai/dall-e-3
Context8K
Per Image$0.040/img₹3.97/img
deepcogito/cogito-v2.1-671b
Context128K
Input / 1M$1.25124.17
Cached / 1M$0.12512.42
Output / 1M$1.25124.17
DeepSeek LatestChat APILatest
deepseek/deepseek-latest
Context1.0M
Input / 1M$0.43543.21
Cached / 1M$0.00700.70
Output / 1M$0.87086.42
deepseek/deepseek-chat
Context131K
Input / 1M$0.22922.73
Cached / 1M$0.0232.27
Output / 1M$0.91490.83
deepseek/deepseek-chat-v3-0324
Context164K
Input / 1M$0.20019.87
Cached / 1M$0.0201.99
Output / 1M$0.77076.49
deepseek/deepseek-chat-v3.1
Context164K
Input / 1M$0.21020.86
Cached / 1M$0.0212.09
Output / 1M$0.79078.47
deepseek/deepseek-v3.1-terminus
Context164K
Input / 1M$0.27026.82
Cached / 1M$0.0272.68
Output / 1M$0.95094.37
deepseek/deepseek-v3.2
Context131K
Input / 1M$0.26025.83
Cached / 1M$0.0252.50
Output / 1M$0.38037.75
deepseek/deepseek-v3.2-exp
Context164K
Input / 1M$0.27026.82
Cached / 1M$0.0272.68
Output / 1M$0.41040.73
deepseek/deepseek-v4-flash
Context1.0M
Input / 1M$0.14013.91
Cached / 1M$0.0100.99
Output / 1M$0.28027.81
deepseek/deepseek-v4-pro
Context1.0M
Input / 1M$0.43543.21
Cached / 1M$0.00700.70
Output / 1M$0.87086.42
DeepSeek: R1Chat API
deepseek/deepseek-r1
Context164K
Input / 1M$0.70069.53
Cached / 1M$0.0706.95
Output / 1M$2.50248.33
deepseek/deepseek-r1-0528
Context164K
Input / 1M$0.50049.67
Cached / 1M$0.0504.97
Output / 1M$2.15213.56
deepseek/deepseek-r1-distill-llama-70b
Context128K
Input / 1M$0.70069.53
Cached / 1M$0.0706.95
Output / 1M$0.80079.47
deepseek/deepseek-r1-distill-qwen-32b
Context128K
Input / 1M$0.29028.81
Cached / 1M$0.0292.88
Output / 1M$0.29028.81
deepseek/deepseek-v3.1
Context164K
Input / 1M$0.21020.86
Cached / 1M$0.13012.91
Output / 1M$0.79078.47
deepseek/deepseek-v3.2-speciale
Context164K
Input / 1M$0.28728.51
Cached / 1M$0.0292.85
Output / 1M$0.43142.81
essentialai/rnj-1-instruct
Context33K
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$0.15014.90
Gemini Flash LatestTranscriptions APIVisionTranscriptionLatest
google/gemini-flash-latest
Context1.0M
PricingPer minute
Gemini Pro LatestTranscriptions APIVisionTranscriptionLatest
google/gemini-pro-latest
Context1.0M
PricingPer minute
google/gemini-2.0-flash
Context8K
Input / 1M$0.1009.93
Output / 1M$0.40039.73
google/gemini-2.0-flash-lite
Context8K
Input / 1M$0.0757.45
Output / 1M$0.30029.80
google/gemini-2.5-flash-lite-preview
Context8K
Input / 1M$0.1009.93
Output / 1M$0.40039.73
google/gemini-2.5-flash-native-audio-preview-12-2025
Context8K
Input / 1M$0.50049.67
Output / 1M$2.00198.66
google/gemini-2.5-flash-preview
Context8K
Input / 1M$0.30029.80
Output / 1M$2.50248.33
google/gemini-3.1-flash-live-preview
Context8K
Input / 1M$0.75074.50
Output / 1M$4.50446.99
gemini-embedding-001Embeddings APIEmbedding
google/gemini-embedding-001
Context2K
Input / 1M$0.15014.90
gemini-embedding-2-previewEmbeddings APIEmbedding
google/gemini-embedding-2-preview
Context8K
Input / 1M$0.20019.87
gemma2-9b-itChat API
groq/gemma2-9b-it
Context8K
Input / 1M$0.20019.87
Output / 1M$0.20019.87
Google: Gemini 2.5 FlashTranscriptions APIVisionTranscription
google/gemini-2.5-flash
Context1.0M
PricingPer minute
Google: Gemini 2.5 Flash LiteTranscriptions APIVisionTranscription
google/gemini-2.5-flash-lite
Context1.0M
PricingPer minute
Google: Gemini 2.5 Flash Lite Preview 09-2025Transcriptions APIVisionTranscription
google/gemini-2.5-flash-lite-preview-09-2025
Context1.0M
PricingPer minute
Google: Gemini 2.5 ProTranscriptions APIVisionTranscription
google/gemini-2.5-pro
Context1.0M
PricingPer minute
Google: Gemini 2.5 Pro Preview 05-06Transcriptions APIVisionTranscription
google/gemini-2.5-pro-preview-05-06
Context1.0M
PricingPer minute
Google: Gemini 2.5 Pro Preview 06-05Transcriptions APIVisionTranscription
google/gemini-2.5-pro-preview
Context1.0M
PricingPer minute
Google: Gemini 3 Flash PreviewTranscriptions APIVisionTranscription
google/gemini-3-flash-preview
Context1.0M
PricingPer minute
Google: Gemini 3.1 Flash LiteTranscriptions APIVisionTranscription
google/gemini-3.1-flash-lite
Context1.0M
PricingPer minute
Google: Gemini 3.1 Flash Lite PreviewTranscriptions APIVisionTranscription
google/gemini-3.1-flash-lite-preview
Context1.0M
PricingPer minute
Google: Gemini 3.1 Pro PreviewTranscriptions APIVisionTranscription
google/gemini-3.1-pro-preview
Context1.0M
PricingPer minute
Google: Gemini 3.1 Pro Preview Custom ToolsTranscriptions APIVisionTranscription
google/gemini-3.1-pro-preview-customtools
Context1.0M
PricingPer minute
Google: Gemini 3.5 FlashTranscriptions APIVisionTranscription
google/gemini-3.5-flash
Context1.0M
PricingPer minute
google/gemma-2-27b-it
Context8K
Input / 1M$0.65064.57
Cached / 1M$0.0656.46
Output / 1M$0.65064.57
Google: Gemma 3 12BChat APIVision
google/gemma-3-12b-it
Context131K
Input / 1M$0.0504.97
Cached / 1M$0.00400.40
Output / 1M$0.15014.90
Google: Gemma 3 27BChat APIVision
google/gemma-3-27b-it
Context131K
Input / 1M$0.0807.95
Cached / 1M$0.00800.79
Output / 1M$0.16015.89
Google: Gemma 3 4BChat APIVision
google/gemma-3-4b-it
Context131K
Input / 1M$0.0504.97
Cached / 1M$0.00400.40
Output / 1M$0.1009.93
google/gemma-3n-e4b-it
Context33K
Input / 1M$0.0605.96
Cached / 1M$0.00600.60
Output / 1M$0.12011.92
google/gemma-4-26b-a4b-it
Context262K
Input / 1M$0.0706.95
Cached / 1M$0.00600.60
Output / 1M$0.34033.77
Google: Gemma 4 31BChat APIVision
google/gemma-4-31b-it
Context262K
Input / 1M$0.13012.91
Cached / 1M$0.0121.19
Output / 1M$0.38037.75
Google: Lyria 3 Clip PreviewTranscriptions APIVisionTranscriptionFree
google/lyria-3-clip-preview
Context1.0M
PricingPer minute
Google: Lyria 3 Pro PreviewTranscriptions APIVisionTranscriptionFree
google/lyria-3-pro-preview
Context1.0M
PricingPer minute
Google: Nano Banana (Gemini 2.5 Flash Image)Chat APIVisionImage Output via Chat API
google/gemini-2.5-flash-image
Context33K
Input / 1M$0.30029.80
Cached / 1M$0.0302.98
Output / 1M$2.50248.33
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)Chat APIVisionImage Output via Chat API
google/gemini-3.1-flash-image-preview
Context131K
Input / 1M$0.50049.67
Cached / 1M$0.0504.97
Output / 1M$3.00298.00
Google: Nano Banana 2 (Gemini 3.1 Flash Image)Chat APIVisionImage Output via Chat API
google/gemini-3.1-flash-image
Context131K
Input / 1M$0.50049.67
Output / 1M$3.00298.00
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)Chat APIVisionImage Output via Chat API
google/gemini-3-pro-image-preview
Context66K
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$12.001191.99
Google: Nano Banana Pro (Gemini 3 Pro Image)Chat APIVisionImage Output via Chat API
google/gemini-3-pro-image
Context66K
Input / 1M$2.00198.66
Output / 1M$12.001191.99
google/gemini-2.0-flash-001Transcriptions APIVisionTranscription
google/gemini-2.0-flash-001
Context1.0M
PricingPer minute
google/gemini-2.0-flash-lite-001Transcriptions APIVisionTranscription
google/gemini-2.0-flash-lite-001
Context1.0M
PricingPer minute
google/gemma-4-31b-it-turbo
Context262K
Input / 1M$0.12011.92
Cached / 1M$0.0121.19
Output / 1M$0.37036.75
GPT LatestChat APIResponses APIVisionLatest
openai/gpt-latest
Context1.1M
Input / 1M$5.00496.66
Cached / 1M$0.50049.67
Output / 1M$30.002979.96
GPT Mini LatestChat APIResponses APIVisionLatest
openai/gpt-mini-latest
Context400K
Input / 1M$0.75074.50
Cached / 1M$0.0757.45
Output / 1M$4.50446.99
gpt-4oChat APIResponses API
gpt-4o
Context8K
Input / 1M$2.50248.33
Output / 1M$10.00993.32
gpt-4o-miniChat APIResponses API
gpt-4o-mini
Context8K
Input / 1M$0.15014.90
Output / 1M$0.60059.60
gpt-4o-mini-realtime-previewChat APIResponses API
gpt-4o-mini-realtime-preview
Context8K
Input / 1M$0.60059.60
Output / 1M$2.40238.40
gpt-4o-mini-realtime-previewChat APIResponses API
openai/gpt-4o-mini-realtime-preview
Context8K
Input / 1M$0.60059.60
Output / 1M$2.40238.40
gpt-4o-realtime-previewChat APIResponses API
gpt-4o-realtime-preview
Context8K
Input / 1M$5.00496.66
Output / 1M$20.001986.64
gpt-4o-realtime-previewChat APIResponses API
openai/gpt-4o-realtime-preview
Context8K
Input / 1M$5.00496.66
Output / 1M$20.001986.64
gpt-5Chat APIResponses API
gpt-5
Context8K
Input / 1M$1.25124.17
Output / 1M$10.00993.32
gpt-5-chat-latestChat APIResponses API
openai/gpt-5-chat-latest
Context8K
Input / 1M$1.25124.17
Output / 1M$10.00993.32
gpt-5-codexChat APIResponses API
gpt-5-codex
Context8K
Input / 1M$1.25124.17
Output / 1M$10.00993.32
gpt-5-miniChat APIResponses API
gpt-5-mini
Context8K
Input / 1M$0.25024.83
Output / 1M$2.00198.66
gpt-5-nanoChat APIResponses API
gpt-5-nano
Context8K
Input / 1M$0.0504.97
Output / 1M$0.40039.73
gpt-5-search-apiChat APIResponses API
gpt-5-search-api
Context8K
Input / 1M$1.25124.17
Output / 1M$10.00993.32
gpt-5-search-apiChat APIResponses API
openai/gpt-5-search-api
Context8K
Input / 1M$1.25124.17
Output / 1M$10.00993.32
gpt-image-1Images APIImage Generation
gpt-image-1
Context8K
Per ImageFormula
gpt-image-1Images APIImage Generation
openai/gpt-image-1
Context8K
Per ImageFormula
gpt-image-1-miniChat APIResponses APIImage Output via Chat API
gpt-image-1-mini
Context8K
Input / 1M$2.00198.66
Output / 1MFree
gpt-image-1-miniChat APIResponses APIImage Output via Chat API
openai/gpt-image-1-mini
Context8K
Input / 1M$2.00198.66
Output / 1MFree
gpt-image-1.5Chat APIResponses APIImage Output via Chat API
gpt-image-1.5
Context8K
Input / 1M$8.00794.66
Output / 1M$32.003178.63
gpt-image-1.5Chat APIResponses APIImage Output via Chat API
openai/gpt-image-1.5
Context8K
Input / 1M$8.00794.66
Output / 1M$32.003178.63
gpt-realtimeChat APIResponses API
gpt-realtime
Context8K
Input / 1M$4.00397.33
Output / 1M$16.001589.31
gpt-realtimeChat APIResponses API
openai/gpt-realtime
Context8K
Input / 1M$4.00397.33
Output / 1M$16.001589.31
gpt-realtime-miniChat APIResponses API
gpt-realtime-mini
Context8K
Input / 1M$0.60059.60
Output / 1M$2.40238.40
gpt-realtime-miniChat APIResponses API
openai/gpt-realtime-mini
Context8K
Input / 1M$0.60059.60
Output / 1M$2.40238.40
Grok LatestChat APIVisionLatest
x-ai/grok-latest
Context1.0M
Input / 1M$1.25124.17
Cached / 1M$0.12512.42
Output / 1M$2.50248.33
ibm-granite/granite-4.0-h-micro
Context131K
Input / 1M$0.0171.69
Cached / 1M$0.00170.17
Output / 1M$0.11211.13
ibm-granite/granite-4.1-8b
Context131K
Input / 1M$0.0504.97
Cached / 1M$0.00500.50
Output / 1M$0.1009.93
inception/mercury-2
Context128K
Input / 1M$0.25024.83
Cached / 1M$0.0252.48
Output / 1M$0.75074.50
inclusionai/ling-2.6-1t
Context262K
Input / 1M$0.0757.45
Cached / 1M$0.00750.74
Output / 1M$0.62562.08
inclusionai/ling-2.6-flash
Context262K
Input / 1M$0.0100.99
Cached / 1M$0.00100.10
Output / 1M$0.0302.98
inclusionai/ring-2.6-1t
Context262K
Input / 1M$0.0757.45
Cached / 1M$0.00750.74
Output / 1M$0.62562.08
inflection/inflection-3-pi
Context8K
Input / 1M$2.50248.33
Cached / 1M$0.25024.83
Output / 1M$10.00993.32
inflection/inflection-3-productivity
Context8K
Input / 1M$2.50248.33
Cached / 1M$0.25024.83
Output / 1M$10.00993.32
intfloat/e5-base-v2Embeddings APIEmbedding
intfloat/e5-base-v2
Context512
Input / 1M$0.00500.50
intfloat/e5-large-v2Embeddings APIEmbedding
intfloat/e5-large-v2
Context512
Input / 1M$0.0100.99
intfloat/multilingual-e5-largeEmbeddings APIEmbedding
intfloat/multilingual-e5-large
Context512
Input / 1M$0.0100.99
intfloat/multilingual-e5-large-instruct
Context512
Input / 1M$0.0100.99
kwaipilot/kat-coder-pro-v2
Context256K
Input / 1M$0.30029.80
Cached / 1M$0.0302.98
Output / 1M$1.20119.20
liquid/lfm-2-24b-a2b
Context128K
Input / 1M$0.0302.98
Cached / 1M$0.00300.30
Output / 1M$0.12011.92
meta-llama/llama-guard-3-8b
Context131K
Input / 1M$0.48548.18
Cached / 1M$0.0484.81
Output / 1M$0.0302.98
groq/llama-3.1-70b-versatile
Context8K
Input / 1M$0.59058.61
Output / 1M$0.79078.47
groq/llama-3.1-8b-instant
Context8K
Input / 1M$0.0504.97
Output / 1M$0.0807.95
groq/llama-3.3-70b-versatile
Context8K
Input / 1M$0.59058.61
Output / 1M$0.79078.47
groq/llama3-70b-8192
Context8K
Input / 1M$0.59058.61
Output / 1M$0.79078.47
groq/llama3-8b-8192
Context8K
Input / 1M$0.0504.97
Output / 1M$0.0807.95
mancer/weaver
Context8K
Input / 1M$0.75074.50
Cached / 1M$0.0757.45
Output / 1M$1.0099.33
meta-llama/llama-3.2-90b-vision-instruct
Context131K
Input / 1M$0.72071.52
Cached / 1M$0.0727.15
Output / 1M$0.72071.52
meta-llama/llama-3-70b-instruct
Context8K
Input / 1M$0.51050.66
Cached / 1M$0.0515.07
Output / 1M$0.74073.51
meta-llama/llama-3-8b-instruct
Context8K
Input / 1M$0.0403.97
Cached / 1M$0.00400.40
Output / 1M$0.0403.97
meta-llama/llama-3.1-70b-instruct
Context131K
Input / 1M$0.40039.73
Cached / 1M$0.0403.97
Output / 1M$0.40039.73
meta-llama/llama-3.1-8b-instruct
Context131K
Input / 1M$0.0201.99
Cached / 1M$0.00200.20
Output / 1M$0.0504.97
meta-llama/llama-3.2-11b-vision-instruct
Context131K
Input / 1M$0.34534.27
Cached / 1M$0.0252.43
Output / 1M$0.34534.27
meta-llama/llama-3.2-1b-instruct
Context131K
Input / 1M$0.0272.68
Cached / 1M$0.00270.27
Output / 1M$0.20119.97
meta-llama/llama-3.2-3b-instruct
Context131K
Input / 1M$0.0515.06
Cached / 1M$0.00510.51
Output / 1M$0.33533.28
meta-llama/llama-3.3-70b-instruct
Context131K
Input / 1M$0.1009.93
Cached / 1M$0.0100.99
Output / 1M$0.32031.79
meta-llama/llama-4-maverick
Context1.0M
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$0.60059.60
Meta: Llama 4 ScoutChat APIVision
meta-llama/llama-4-scout
Context10.0M
Input / 1M$0.0807.95
Cached / 1M$0.00800.79
Output / 1M$0.30029.80
meta-llama/llama-guard-4-12b
Context164K
Input / 1M$0.18017.88
Cached / 1M$0.0181.79
Output / 1M$0.18017.88
microsoft/phi-4
Context16K
Input / 1M$0.0706.95
Cached / 1M$0.00650.65
Output / 1M$0.14013.91
microsoft/phi-4-mini-instruct
Context131K
Input / 1M$0.0807.95
Cached / 1M$0.00800.79
Output / 1M$0.35034.77
minimax/minimax-m1
Context1.0M
Input / 1M$0.40039.73
Cached / 1M$0.0403.97
Output / 1M$2.20218.53
minimax/minimax-m2
Context205K
Input / 1M$0.25525.33
Cached / 1M$0.0252.53
Output / 1M$1.0099.33
minimax/minimax-m2-her
Context66K
Input / 1M$0.30029.80
Cached / 1M$0.0302.98
Output / 1M$1.20119.20
minimax/minimax-m2.1
Context205K
Input / 1M$0.29028.81
Cached / 1M$0.0292.88
Output / 1M$0.95094.37
minimax/minimax-m2.5
Context205K
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$1.15114.23
minimax/minimax-m2.7
Context205K
Input / 1M$0.27927.71
Cached / 1M$0.0282.77
Output / 1M$1.20119.20
MiniMax: MiniMax M3Chat APIVision
minimax/minimax-m3
Context1.0M
Input / 1M$0.30029.80
Output / 1M$1.20119.20
MiniMax: MiniMax-01Chat APIVision
minimax/minimax-01
Context1.0M
Input / 1M$0.20019.87
Cached / 1M$0.0201.99
Output / 1M$1.10109.27
mistralai/mistral-large
Context128K
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$6.00595.99
mistralai/mistral-large-2407
Context131K
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$6.00595.99
mistralai/codestral-2508
Context256K
Input / 1M$0.30029.80
Cached / 1M$0.0302.98
Output / 1M$0.90089.40
mistralai/devstral-2512
Context262K
Input / 1M$0.40039.73
Cached / 1M$0.0403.97
Output / 1M$2.00198.66
mistralai/ministral-14b-2512
Context262K
Input / 1M$0.20019.87
Cached / 1M$0.0201.99
Output / 1M$0.20019.87
mistralai/ministral-3b-2512
Context131K
Input / 1M$0.1009.93
Cached / 1M$0.0100.99
Output / 1M$0.1009.93
mistralai/ministral-8b-2512
Context262K
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$0.15014.90
mistralai/mistral-large-2512
Context262K
Input / 1M$0.50049.67
Cached / 1M$0.0504.97
Output / 1M$1.50149.00
mistralai/mistral-medium-3
Context131K
Input / 1M$0.40039.73
Cached / 1M$0.0403.97
Output / 1M$2.00198.66
mistralai/mistral-medium-3.1
Context131K
Input / 1M$0.40039.73
Cached / 1M$0.0403.97
Output / 1M$2.00198.66
mistralai/mistral-medium-3-5
Context262K
Input / 1M$1.50149.00
Cached / 1M$0.15014.90
Output / 1M$7.50744.99
mistralai/mistral-nemo
Context131K
Input / 1M$0.0201.99
Cached / 1M$0.00200.20
Output / 1M$0.0302.98
mistralai/mistral-small-24b-instruct-2501
Context33K
Input / 1M$0.0504.97
Cached / 1M$0.00500.50
Output / 1M$0.0807.95
mistralai/mistral-small-3.1-24b-instruct
Context128K
Input / 1M$0.35134.87
Cached / 1M$0.0353.49
Output / 1M$0.55555.13
mistralai/mistral-small-3.2-24b-instruct
Context128K
Input / 1M$0.0757.45
Cached / 1M$0.00750.74
Output / 1M$0.20019.87
mistralai/mistral-small-2603
Context262K
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$0.60059.60
mistralai/mixtral-8x22b-instruct
Context66K
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$6.00595.99
mistralai/mistral-saba
Context33K
Input / 1M$0.20019.87
Cached / 1M$0.0201.99
Output / 1M$0.60059.60
Mistral: Voxtral Small 24B 2507Transcriptions APITranscription
mistralai/voxtral-small-24b-2507
Context32K
PricingPer minute
mistralai/devstral-medium
Context131K
Input / 1M$0.40039.73
Cached / 1M$0.0403.97
Output / 1M$2.00198.66
mistralai/devstral-small
Context131K
Input / 1M$0.1009.93
Cached / 1M$0.0100.99
Output / 1M$0.30029.80
mistralai/mistral-7b-instruct-v0.1
Context4K
Input / 1M$0.11010.93
Cached / 1M$0.0111.09
Output / 1M$0.19018.87
mistralai/mistral-large-2411
Context131K
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$6.00595.99
mistralai/pixtral-large-2411
Context131K
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$6.00595.99
mistralai/voxtral-mini-3b-2507Transcriptions APITranscription
mistralai/voxtral-mini-3b-2507
PricingPer minute
groq/mixtral-8x7b-32768
Context8K
Input / 1M$0.24023.84
Output / 1M$0.24023.84
moonshotai/kimi-k2
Context131K
Input / 1M$0.57056.62
Cached / 1M$0.0575.66
Output / 1M$2.30228.46
moonshotai/kimi-k2-0905
Context262K
Input / 1M$0.60059.60
Cached / 1M$0.0605.96
Output / 1M$2.50248.33
moonshotai/kimi-k2-thinking
Context262K
Input / 1M$0.60059.60
Cached / 1M$0.0605.96
Output / 1M$2.50248.33
MoonshotAI: Kimi K2.5Chat APIVision
moonshotai/kimi-k2.5
Context262K
Input / 1M$0.40039.73
Cached / 1M$0.0403.97
Output / 1M$1.90188.73
MoonshotAI: Kimi K2.6Chat APIVision
moonshotai/kimi-k2.6
Context262K
Input / 1M$0.73072.51
Cached / 1M$0.0737.25
Output / 1M$3.49346.67
moonshotai/kimi-k2.7-code
Context262K
Input / 1M$0.95094.37
Output / 1M$4.00397.33
morph/morph-v3-fast
Context82K
Input / 1M$0.80079.47
Cached / 1M$0.0807.95
Output / 1M$1.20119.20
morph/morph-v3-large
Context262K
Input / 1M$0.90089.40
Cached / 1M$0.0908.94
Output / 1M$1.90188.73
MythoMax 13BChat API
gryphe/mythomax-l2-13b
Context4K
Input / 1M$0.0605.96
Cached / 1M$0.00600.60
Output / 1M$0.0605.96
nex-agi/deepseek-v3.1-nex-n1
Context131K
Input / 1M$0.13513.41
Cached / 1M$0.0131.34
Output / 1M$0.50049.67
Nex AGI: Nex-N2-ProChat APIVision
nex-agi/nex-n2-pro
Context262K
Input / 1M$0.25024.83
Output / 1M$1.0099.33
nousresearch/hermes-3-llama-3.1-405b
Context131K
Input / 1M$1.0099.33
Cached / 1M$0.1009.93
Output / 1M$1.0099.33
nousresearch/hermes-3-llama-3.1-70b
Context131K
Input / 1M$0.30029.80
Cached / 1M$0.0302.98
Output / 1M$0.30029.80
nousresearch/hermes-4-405b
Context131K
Input / 1M$1.0099.33
Cached / 1M$0.1009.93
Output / 1M$3.00298.00
nousresearch/hermes-4-70b
Context131K
Input / 1M$0.13012.91
Cached / 1M$0.0131.29
Output / 1M$0.40039.73
nousresearch/hermes-2-pro-llama-3-8b
Context8K
Input / 1M$0.14013.91
Cached / 1M$0.0141.39
Output / 1M$0.14013.91
nvidia/llama-3.3-nemotron-super-49b-v1.5
Context131K
Input / 1M$0.1009.93
Cached / 1M$0.0100.99
Output / 1M$0.40039.73
nvidia/nemotron-3-nano-30b-a3b
Context262K
Input / 1M$0.0504.97
Cached / 1M$0.00500.50
Output / 1M$0.20019.87
nvidia/nemotron-3-super-120b-a12b
Context1.0M
Input / 1M$0.1009.93
Cached / 1M$0.00900.89
Output / 1M$0.50049.67
nvidia/nemotron-3-ultra-550b-a55b
Context1.0M
Input / 1M$0.50049.67
Output / 1M$2.50248.33
nvidia/nemotron-nano-9b-v2
Context131K
Input / 1M$0.0403.97
Cached / 1M$0.00400.40
Output / 1M$0.16015.89
nvidia/nemotron-3-nano-omni-30b-a3b
Context262K
Input / 1M$0.20019.87
Cached / 1M$0.0201.99
Output / 1M$0.80079.47
nvidia/nemotron-nano-12b-v2
Context131K
Input / 1M$0.0605.96
Cached / 1M$0.00600.60
Output / 1M$0.0605.96
o1Chat APIResponses API
o1
Context8K
Input / 1M$15.001489.98
Output / 1M$60.005959.93
o1-miniChat APIResponses API
o1-mini
Context8K
Input / 1M$1.10109.27
Output / 1M$4.40437.06
o1-miniChat APIResponses API
openai/o1-mini
Context8K
Input / 1M$1.10109.27
Output / 1M$4.40437.06
o3Chat APIResponses API
o3
Context8K
Input / 1M$2.00198.66
Output / 1M$8.00794.66
o3-deep-researchChat APIResponses API
o3-deep-research
Context8K
Input / 1M$10.00993.32
Output / 1M$40.003973.28
o3-miniChat APIResponses API
o3-mini
Context8K
Input / 1M$1.10109.27
Output / 1M$4.40437.06
o4-miniChat APIResponses API
o4-mini
Context8K
Input / 1M$1.10109.27
Output / 1M$4.40437.06
o4-mini-deep-researchChat APIResponses API
o4-mini-deep-research
Context8K
Input / 1M$2.00198.66
Output / 1M$8.00794.66
OpenAI: GPT AudioTranscriptions APITranscription
openai/gpt-audio
Context128K
PricingPer minute
OpenAI: GPT Audio MiniTranscriptions APITranscription
openai/gpt-audio-mini
Context128K
PricingPer minute
OpenAI: GPT Chat LatestChat APIResponses APIVision
openai/gpt-chat-latest
Context400K
Input / 1M$5.00496.66
Cached / 1M$0.50049.67
Output / 1M$30.002979.96
OpenAI: GPT-3.5 TurboChat APIResponses API
openai/gpt-3.5-turbo
Context16K
Input / 1M$0.50049.67
Cached / 1M$0.0504.97
Output / 1M$1.50149.00
openai/gpt-3.5-turbo-0613
Context4K
Input / 1M$1.0099.33
Cached / 1M$0.1009.93
Output / 1M$2.00198.66
OpenAI: GPT-3.5 Turbo 16kChat APIResponses API
openai/gpt-3.5-turbo-16k
Context16K
Input / 1M$3.00298.00
Cached / 1M$0.30029.80
Output / 1M$4.00397.33
OpenAI: GPT-3.5 Turbo InstructChat APIResponses API
openai/gpt-3.5-turbo-instruct
Context4K
Input / 1M$1.50149.00
Cached / 1M$0.15014.90
Output / 1M$2.00198.66
OpenAI: GPT-4Chat APIResponses API
openai/gpt-4
Context8K
Input / 1M$30.002979.96
Cached / 1M$3.00298.00
Output / 1M$60.005959.93
OpenAI: GPT-4 TurboChat APIResponses APIVision
openai/gpt-4-turbo
Context128K
Input / 1M$10.00993.32
Cached / 1M$1.0099.33
Output / 1M$30.002979.96
openai/gpt-4-1106-preview
Context128K
Input / 1M$10.00993.32
Cached / 1M$1.0099.33
Output / 1M$30.002979.96
OpenAI: GPT-4 Turbo PreviewChat APIResponses API
openai/gpt-4-turbo-preview
Context128K
Input / 1M$10.00993.32
Cached / 1M$1.0099.33
Output / 1M$30.002979.96
OpenAI: GPT-4.1Chat APIResponses APIVision
openai/gpt-4.1
Context1.0M
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$8.00794.66
OpenAI: GPT-4.1 MiniChat APIResponses APIVision
openai/gpt-4.1-mini
Context1.0M
Input / 1M$0.40039.73
Cached / 1M$0.0403.97
Output / 1M$1.60158.93
OpenAI: GPT-4.1 NanoChat APIResponses APIVision
openai/gpt-4.1-nano
Context1.0M
Input / 1M$0.1009.93
Cached / 1M$0.0100.99
Output / 1M$0.40039.73
OpenAI: GPT-4oChat APIResponses APIVision
openai/gpt-4o
Context128K
Input / 1M$2.50248.33
Cached / 1M$0.25024.83
Output / 1M$10.00993.32
OpenAI: GPT-4o (2024-05-13)Chat APIResponses APIVision
openai/gpt-4o-2024-05-13
Context128K
Input / 1M$5.00496.66
Cached / 1M$0.50049.67
Output / 1M$15.001489.98
OpenAI: GPT-4o (2024-08-06)Chat APIResponses APIVision
openai/gpt-4o-2024-08-06
Context128K
Input / 1M$2.50248.33
Cached / 1M$0.25024.83
Output / 1M$10.00993.32
OpenAI: GPT-4o (2024-11-20)Chat APIResponses APIVision
openai/gpt-4o-2024-11-20
Context128K
Input / 1M$2.50248.33
Cached / 1M$0.25024.83
Output / 1M$10.00993.32
OpenAI: GPT-4o Search PreviewChat APIResponses API
openai/gpt-4o-search-preview
Context128K
Input / 1M$2.50248.33
Cached / 1M$0.25024.83
Output / 1M$10.00993.32
OpenAI: GPT-4o-miniChat APIResponses APIVision
openai/gpt-4o-mini
Context128K
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$0.60059.60
OpenAI: GPT-4o-mini (2024-07-18)Chat APIResponses APIVision
openai/gpt-4o-mini-2024-07-18
Context128K
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$0.60059.60
openai/gpt-4o-mini-search-preview
Context128K
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$0.60059.60
OpenAI: GPT-5Chat APIResponses APIVision
openai/gpt-5
Context400K
Input / 1M$1.25124.17
Cached / 1M$0.12512.42
Output / 1M$10.00993.32
OpenAI: GPT-5 ChatChat APIResponses APIVision
openai/gpt-5-chat
Context128K
Input / 1M$1.25124.17
Cached / 1M$0.12512.42
Output / 1M$10.00993.32
OpenAI: GPT-5 CodexChat APIResponses APIVision
openai/gpt-5-codex
Context400K
Input / 1M$1.25124.17
Cached / 1M$0.12512.42
Output / 1M$10.00993.32
OpenAI: GPT-5 MiniChat APIResponses APIVision
openai/gpt-5-mini
Context400K
Input / 1M$0.25024.83
Cached / 1M$0.0252.48
Output / 1M$2.00198.66
OpenAI: GPT-5 NanoChat APIResponses APIVision
openai/gpt-5-nano
Context400K
Input / 1M$0.0504.97
Cached / 1M$0.00500.50
Output / 1M$0.40039.73
OpenAI: GPT-5 ProChat APIResponses APIVision
openai/gpt-5-pro
Context400K
Input / 1M$15.001489.98
Cached / 1M$1.50149.00
Output / 1M$120.0011919.85
OpenAI: GPT-5.1Chat APIResponses APIVision
openai/gpt-5.1
Context400K
Input / 1M$1.25124.17
Cached / 1M$0.12512.42
Output / 1M$10.00993.32
OpenAI: GPT-5.1 ChatChat APIResponses APIVision
openai/gpt-5.1-chat
Context128K
Input / 1M$1.25124.17
Cached / 1M$0.12512.42
Output / 1M$10.00993.32
OpenAI: GPT-5.1-CodexChat APIResponses APIVision
openai/gpt-5.1-codex
Context400K
Input / 1M$1.25124.17
Cached / 1M$0.12512.42
Output / 1M$10.00993.32
OpenAI: GPT-5.1-Codex-MaxChat APIResponses APIVision
openai/gpt-5.1-codex-max
Context400K
Input / 1M$1.25124.17
Cached / 1M$0.12512.42
Output / 1M$10.00993.32
OpenAI: GPT-5.1-Codex-MiniChat APIResponses APIVision
openai/gpt-5.1-codex-mini
Context400K
Input / 1M$0.25024.83
Cached / 1M$0.0252.48
Output / 1M$2.00198.66
OpenAI: GPT-5.2Chat APIResponses APIVision
openai/gpt-5.2
Context400K
Input / 1M$1.75173.83
Cached / 1M$0.17517.38
Output / 1M$14.001390.65
OpenAI: GPT-5.2 ChatChat APIResponses APIVision
openai/gpt-5.2-chat
Context128K
Input / 1M$1.75173.83
Cached / 1M$0.17517.38
Output / 1M$14.001390.65
OpenAI: GPT-5.2 ProChat APIResponses APIVision
openai/gpt-5.2-pro
Context400K
Input / 1M$21.002085.97
Cached / 1M$2.10208.60
Output / 1M$168.0016687.79
OpenAI: GPT-5.2-CodexChat APIResponses APIVision
openai/gpt-5.2-codex
Context400K
Input / 1M$1.75173.83
Cached / 1M$0.17517.38
Output / 1M$14.001390.65
OpenAI: GPT-5.3 ChatChat APIResponses APIVision
openai/gpt-5.3-chat
Context128K
Input / 1M$1.75173.83
Cached / 1M$0.17517.38
Output / 1M$14.001390.65
OpenAI: GPT-5.3-CodexChat APIResponses APIVision
openai/gpt-5.3-codex
Context400K
Input / 1M$1.75173.83
Cached / 1M$0.17517.38
Output / 1M$14.001390.65
OpenAI: GPT-5.4Chat APIResponses APIVision
openai/gpt-5.4
Context1.1M
Input / 1M$2.50248.33
Cached / 1M$0.25024.83
Output / 1M$15.001489.98
OpenAI: GPT-5.4 MiniChat APIResponses APIVision
openai/gpt-5.4-mini
Context400K
Input / 1M$0.75074.50
Cached / 1M$0.0757.45
Output / 1M$4.50446.99
OpenAI: GPT-5.4 NanoChat APIResponses APIVision
openai/gpt-5.4-nano
Context400K
Input / 1M$0.20019.87
Cached / 1M$0.0201.99
Output / 1M$1.25124.17
OpenAI: GPT-5.4 ProChat APIResponses APIVision
openai/gpt-5.4-pro
Context1.1M
Input / 1M$30.002979.96
Cached / 1M$3.00298.00
Output / 1M$180.0017879.78
OpenAI: GPT-5.5Chat APIResponses APIVision
openai/gpt-5.5
Context1.1M
Input / 1M$5.00496.66
Cached / 1M$0.50049.67
Output / 1M$30.002979.96
OpenAI: GPT-5.5 ProChat APIResponses APIVision
openai/gpt-5.5-pro
Context1.1M
Input / 1M$30.002979.96
Cached / 1M$3.00298.00
Output / 1M$180.0017879.78
openai/gpt-oss-120b
Context131K
Input / 1M$0.0393.87
Cached / 1M$0.00390.39
Output / 1M$0.18017.88
openai/gpt-oss-20b
Context131K
Input / 1M$0.0302.98
Cached / 1M$0.00300.30
Output / 1M$0.14013.91
OpenAI: gpt-oss-safeguard-20bChat APIResponses API
openai/gpt-oss-safeguard-20b
Context131K
Input / 1M$0.0757.45
Cached / 1M$0.00750.74
Output / 1M$0.30029.80
OpenAI: o1Chat APIResponses APIVision
openai/o1
Context200K
Input / 1M$15.001489.98
Cached / 1M$1.50149.00
Output / 1M$60.005959.93
OpenAI: o1-proChat APIResponses APIVision
openai/o1-pro
Context200K
Input / 1M$150.0014899.82
Cached / 1M$15.001489.98
Output / 1M$600.0059599.26
OpenAI: o3Chat APIResponses APIVision
openai/o3
Context200K
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$8.00794.66
OpenAI: o3 Deep ResearchChat APIResponses APIVision
openai/o3-deep-research
Context200K
Input / 1M$10.00993.32
Cached / 1M$1.0099.33
Output / 1M$40.003973.28
OpenAI: o3 MiniChat APIResponses API
openai/o3-mini
Context200K
Input / 1M$1.10109.27
Cached / 1M$0.11010.93
Output / 1M$4.40437.06
OpenAI: o3 Mini HighChat APIResponses API
openai/o3-mini-high
Context200K
Input / 1M$1.10109.27
Cached / 1M$0.11010.93
Output / 1M$4.40437.06
OpenAI: o3 ProChat APIResponses APIVision
openai/o3-pro
Context200K
Input / 1M$20.001986.64
Cached / 1M$2.00198.66
Output / 1M$80.007946.57
OpenAI: o4 MiniChat APIResponses APIVision
openai/o4-mini
Context200K
Input / 1M$1.10109.27
Cached / 1M$0.11010.93
Output / 1M$4.40437.06
OpenAI: o4 Mini Deep ResearchChat APIResponses APIVision
openai/o4-mini-deep-research
Context200K
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$8.00794.66
OpenAI: o4 Mini HighChat APIResponses APIVision
openai/o4-mini-high
Context200K
Input / 1M$1.10109.27
Cached / 1M$0.11010.93
Output / 1M$4.40437.06
OpenAI: TTS-1Transcriptions APITranscription
openai/tts-1
PricingPer minute
OpenAI: TTS-1 HDTranscriptions APITranscription
openai/tts-1-hd
PricingPer minute
OpenAI: Whisper-1Transcriptions APITranscription
openai/whisper-1
PricingPer minute
openai/gpt-4-0314Chat APIResponses API
openai/gpt-4-0314
Context8K
Input / 1M$30.002979.96
Cached / 1M$3.00298.00
Output / 1M$60.005959.93
openai/gpt-4o-audio-previewTranscriptions APITranscription
openai/gpt-4o-audio-preview
Context128K
PricingPer minute
perceptron/perceptron-mk1
Context33K
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$1.50149.00
Perplexity: SonarChat APIVision
perplexity/sonar
Context127K
Input / 1M$1.0099.33
Cached / 1M$0.1009.93
Output / 1M$1.0099.33
perplexity/sonar-deep-research
Context128K
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$8.00794.66
Perplexity: Sonar ProChat APIVision
perplexity/sonar-pro
Context200K
Input / 1M$3.00298.00
Cached / 1M$0.30029.80
Output / 1M$15.001489.98
perplexity/sonar-reasoning-pro
Context128K
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$8.00794.66
poolside/laguna-m.1
Context262K
Input / 1M$0.20019.87
Output / 1M$0.40039.73
poolside/laguna-xs.2
Context262K
Input / 1M$0.1009.93
Output / 1M$0.20019.87
prime-intellect/intellect-3
Context131K
Input / 1M$0.20019.87
Cached / 1M$0.0201.99
Output / 1M$1.10109.27
Qwen LatestChat APILatest
qwen/qwen-latest
Context131K
Input / 1M$0.45645.30
Cached / 1M$0.0454.52
Output / 1M$1.82180.78
qwen-qwq-32bChat API
groq/qwen-qwq-32b
Context8K
Input / 1M$0.29028.81
Output / 1M$0.39038.74
qwen/qwen-plus-2025-07-28
Context1.0M
Input / 1M$0.26025.83
Cached / 1M$0.0262.58
Output / 1M$0.78077.48
qwen/qwen-plus-2025-07-28:thinking
Context1.0M
Input / 1M$0.26025.83
Output / 1M$0.78077.48
qwen/qwen-plus
Context1.0M
Input / 1M$0.26025.83
Cached / 1M$0.0262.58
Output / 1M$0.78077.48
qwen/qwen-2.5-7b-instruct
Context131K
Input / 1M$0.0403.97
Cached / 1M$0.00400.40
Output / 1M$0.1009.93
qwen/qwen2.5-vl-72b-instruct
Context131K
Input / 1M$0.25024.83
Cached / 1M$0.0252.48
Output / 1M$0.75074.50
qwen/qwen3-14b
Context132K
Input / 1M$0.12011.92
Cached / 1M$0.0100.99
Output / 1M$0.24023.84
qwen/qwen3-235b-a22b
Context131K
Input / 1M$0.45645.30
Cached / 1M$0.0454.52
Output / 1M$1.82180.78
qwen/qwen3-235b-a22b-2507
Context262K
Input / 1M$0.0908.94
Cached / 1M$0.00710.71
Output / 1M$0.1009.93
qwen/qwen3-235b-a22b-thinking-2507
Context262K
Input / 1M$0.23022.85
Cached / 1M$0.0151.49
Output / 1M$2.30228.46
qwen/qwen3-30b-a3b
Context131K
Input / 1M$0.12011.92
Cached / 1M$0.00900.89
Output / 1M$0.50049.67
qwen/qwen3-30b-a3b-instruct-2507
Context131K
Input / 1M$0.0908.94
Cached / 1M$0.00900.89
Output / 1M$0.30029.80
qwen/qwen3-30b-a3b-thinking-2507
Context131K
Input / 1M$0.0807.95
Cached / 1M$0.00800.79
Output / 1M$0.40039.73
qwen/qwen3-32b
Context131K
Input / 1M$0.0807.95
Cached / 1M$0.00800.79
Output / 1M$0.28027.81
qwen/qwen3-8b
Context131K
Input / 1M$0.0504.97
Cached / 1M$0.00500.50
Output / 1M$0.40039.73
qwen/qwen3-coder-30b-a3b-instruct
Context160K
Input / 1M$0.0706.95
Cached / 1M$0.00700.70
Output / 1M$0.27026.82
qwen/qwen3-coder
Context1.0M
Input / 1M$0.22021.85
Cached / 1M$0.0222.19
Output / 1M$1.80178.80
qwen/qwen3-coder-flash
Context1.0M
Input / 1M$0.19519.37
Cached / 1M$0.0191.94
Output / 1M$0.97596.85
qwen/qwen3-coder-next
Context262K
Input / 1M$0.11010.93
Cached / 1M$0.0111.09
Output / 1M$0.80079.47
qwen/qwen3-coder-plus
Context1.0M
Input / 1M$0.65064.57
Cached / 1M$0.0656.46
Output / 1M$3.25322.83
qwen/qwen3-max
Context262K
Input / 1M$0.78077.48
Cached / 1M$0.0787.75
Output / 1M$3.90387.40
qwen/qwen3-max-thinking
Context262K
Input / 1M$0.78077.48
Cached / 1M$0.0787.75
Output / 1M$3.90387.40
qwen/qwen3-next-80b-a3b-instruct
Context262K
Input / 1M$0.0908.94
Cached / 1M$0.00900.89
Output / 1M$1.10109.27
qwen/qwen3-next-80b-a3b-thinking
Context262K
Input / 1M$0.0989.68
Cached / 1M$0.00970.97
Output / 1M$0.78077.48
qwen/qwen3-vl-235b-a22b-instruct
Context262K
Input / 1M$0.20019.87
Cached / 1M$0.0201.99
Output / 1M$0.88087.41
qwen/qwen3-vl-235b-a22b-thinking
Context131K
Input / 1M$0.26025.83
Cached / 1M$0.0262.58
Output / 1M$2.60258.26
qwen/qwen3-vl-30b-a3b-instruct
Context262K
Input / 1M$0.15014.90
Cached / 1M$0.0131.29
Output / 1M$0.60059.60
qwen/qwen3-vl-30b-a3b-thinking
Context131K
Input / 1M$0.13012.91
Cached / 1M$0.0131.29
Output / 1M$1.56154.96
qwen/qwen3-vl-32b-instruct
Context262K
Input / 1M$0.10510.43
Cached / 1M$0.0101.03
Output / 1M$0.42041.72
qwen/qwen3-vl-8b-instruct
Context256K
Input / 1M$0.0807.95
Cached / 1M$0.00800.79
Output / 1M$0.50049.67
qwen/qwen3-vl-8b-thinking
Context256K
Input / 1M$0.11711.62
Cached / 1M$0.0121.16
Output / 1M$1.36135.59
qwen/qwen3.5-397b-a17b
Context256K
Input / 1M$0.39038.74
Cached / 1M$0.0393.87
Output / 1M$2.34232.44
qwen/qwen3.5-plus-02-15
Context1.0M
Input / 1M$0.26025.83
Cached / 1M$0.0262.58
Output / 1M$1.56154.96
qwen/qwen3.5-plus-20260420
Context1.0M
Input / 1M$0.30029.80
Cached / 1M$0.0302.98
Output / 1M$1.80178.80
qwen/qwen3.5-122b-a10b
Context262K
Input / 1M$0.26025.83
Cached / 1M$0.0262.58
Output / 1M$2.08206.61
Qwen: Qwen3.5-27BChat APIVision
qwen/qwen3.5-27b
Context262K
Input / 1M$0.19519.37
Cached / 1M$0.0191.94
Output / 1M$1.56154.96
Qwen: Qwen3.5-35B-A3BChat APIVision
qwen/qwen3.5-35b-a3b
Context262K
Input / 1M$0.13913.81
Cached / 1M$0.0141.38
Output / 1M$1.0099.33
Qwen: Qwen3.5-9BChat APIVision
qwen/qwen3.5-9b
Context262K
Input / 1M$0.0403.97
Cached / 1M$0.00400.40
Output / 1M$0.15014.90
Qwen: Qwen3.5-FlashChat APIVision
qwen/qwen3.5-flash-02-23
Context1.0M
Input / 1M$0.0656.46
Cached / 1M$0.00650.65
Output / 1M$0.26025.83
Qwen: Qwen3.6 27BChat APIVision
qwen/qwen3.6-27b
Context262K
Input / 1M$0.30029.80
Cached / 1M$0.0302.98
Output / 1M$3.20317.86
Qwen: Qwen3.6 35B A3BChat APIVision
qwen/qwen3.6-35b-a3b
Context262K
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$1.0099.33
Qwen: Qwen3.6 FlashChat APIVision
qwen/qwen3.6-flash
Context1.0M
Input / 1M$0.18818.62
Cached / 1M$0.0191.86
Output / 1M$1.13111.75
qwen/qwen3.6-max-preview
Context262K
Input / 1M$1.04103.31
Cached / 1M$0.10410.33
Output / 1M$6.24619.83
Qwen: Qwen3.6 PlusChat APIVision
qwen/qwen3.6-plus
Context1.0M
Input / 1M$0.32532.28
Cached / 1M$0.0333.23
Output / 1M$1.95193.70
qwen/qwen3.7-max
Context1.0M
Input / 1M$2.50248.33
Cached / 1M$0.25024.83
Output / 1M$7.50744.99
Qwen: Qwen3.7 PlusChat APIVision
qwen/qwen3.7-plus
Context1.0M
Input / 1M$0.40039.73
Output / 1M$1.60158.93
qwen/qwen2.5-72b-instruct
Context33K
Input / 1M$0.36035.76
Cached / 1M$0.0363.58
Output / 1M$0.40039.73
qwen/qwen-2.5-72b-instruct
Context131K
Input / 1M$0.36035.76
Cached / 1M$0.0363.58
Output / 1M$0.40039.73
qwen/qwen-2.5-coder-32b-instruct
Context128K
Input / 1M$0.66065.56
Cached / 1M$0.0666.56
Output / 1M$1.0099.33
Reka EdgeChat APIVision
rekaai/reka-edge
Context16K
Input / 1M$0.1009.93
Cached / 1M$0.0100.99
Output / 1M$0.1009.93
Reka Flash 3Chat API
rekaai/reka-flash-3
Context66K
Input / 1M$0.1009.93
Cached / 1M$0.0100.99
Output / 1M$0.20019.87
relace/relace-apply-3
Context256K
Input / 1M$0.85084.43
Cached / 1M$0.0858.44
Output / 1M$1.25124.17
relace/relace-search
Context256K
Input / 1M$1.0099.33
Cached / 1M$0.1009.93
Output / 1M$3.00298.00
undi95/remm-slerp-l2-13b
Context6K
Input / 1M$0.45044.70
Cached / 1M$0.0454.47
Output / 1M$0.65064.57
Sakana: Fugu UltraChat APIVision
sakana/fugu-ultra
Context1.0M
Input / 1M$5.00496.66
Output / 1M$30.002979.96
sao10k/l3-lunaris-8b
Context8K
Input / 1M$0.0403.97
Cached / 1M$0.00400.40
Output / 1M$0.0504.97
sao10k/l3.1-70b-hanami-x1
Context16K
Input / 1M$3.00298.00
Cached / 1M$0.30029.80
Output / 1M$3.00298.00
sao10k/l3.1-euryale-70b
Context131K
Input / 1M$0.85084.43
Cached / 1M$0.0858.44
Output / 1M$0.85084.43
sao10k/l3.3-euryale-70b
Context131K
Input / 1M$0.65064.57
Cached / 1M$0.0656.46
Output / 1M$0.75074.50
sao10k/l3-euryale-70b
Context8K
Input / 1M$1.48147.01
Cached / 1M$0.14814.70
Output / 1M$1.48147.01
sarvam/bulbul-v2Speech APISpeech
sarvam/bulbul-v2
PricingPer minute
sarvam/bulbul-v3Speech APISpeech
sarvam/bulbul-v3
PricingPer minute
sarvam/saarika-v2Transcriptions APITranscription
sarvam/saarika-v2
PricingPer minute
sarvam/saarika-v2.5Transcriptions APITranscription
sarvam/saarika-v2.5
PricingPer minute
sentence-transformers/all-minilm-l12-v2
Context512
Input / 1M$0.00500.50
sentence-transformers/all-minilm-l6-v2
Context512
Input / 1M$0.00500.50
sentence-transformers/all-mpnet-base-v2
Context512
Input / 1M$0.00500.50
sentence-transformers/paraphrase-minilm-l6-v2
Context512
Input / 1M$0.00500.50
stepfun/step-3.5-flash
Context262K
Input / 1M$0.0908.94
Cached / 1M$0.00900.89
Output / 1M$0.30029.80
stepfun/step-3.7-flash
Context256K
Input / 1M$0.20019.87
Output / 1M$1.15114.23
switchpoint/router
Context131K
Input / 1M$0.85084.43
Cached / 1M$0.0858.44
Output / 1M$3.40337.73
tencent/hunyuan-a13b-instruct
Context131K
Input / 1M$0.14013.91
Cached / 1M$0.0141.39
Output / 1M$0.57056.62
tencent/hy3-preview
Context262K
Input / 1M$0.0666.56
Cached / 1M$0.00660.66
Output / 1M$0.26025.83
text-embedding-004Embeddings APIEmbedding
google/text-embedding-004
Context2K
Input / 1M$0.0252.48
text-embedding-3-largeEmbeddings APIEmbedding
openai/text-embedding-3-large
Context8K
Input / 1M$0.13012.91
text-embedding-3-largeEmbeddings APIEmbedding
text-embedding-3-large
Context8K
Input / 1M$0.13012.91
text-embedding-3-smallEmbeddings APIEmbedding
openai/text-embedding-3-small
Context8K
Input / 1M$0.0201.99
text-embedding-3-smallEmbeddings APIEmbedding
text-embedding-3-small
Context8K
Input / 1M$0.0201.99
text-embedding-ada-002Embeddings APIEmbedding
openai/text-embedding-ada-002
Context8K
Input / 1M$0.1009.93
text-embedding-ada-002Embeddings APIEmbedding
text-embedding-ada-002
Context8K
Input / 1M$0.1009.93
thedrummer/cydonia-24b-v4.1
Context131K
Input / 1M$0.30029.80
Cached / 1M$0.0302.98
Output / 1M$0.50049.67
thedrummer/rocinante-12b
Context33K
Input / 1M$0.17016.89
Cached / 1M$0.0171.69
Output / 1M$0.43042.71
thedrummer/skyfall-36b-v2
Context33K
Input / 1M$0.55054.63
Cached / 1M$0.0555.46
Output / 1M$0.80079.47
thedrummer/unslopnemo-12b
Context33K
Input / 1M$0.40039.73
Cached / 1M$0.0403.97
Output / 1M$0.40039.73
thenlper/gte-baseEmbeddings APIEmbedding
thenlper/gte-base
Context512
Input / 1M$0.00500.50
thenlper/gte-largeEmbeddings APIEmbedding
thenlper/gte-large
Context512
Input / 1M$0.0100.99
upstage/solar-pro-3
Context128K
Input / 1M$0.15014.90
Cached / 1M$0.0151.49
Output / 1M$0.60059.60
microsoft/wizardlm-2-8x22b
Context66K
Input / 1M$0.62061.59
Cached / 1M$0.0626.16
Output / 1M$0.62061.59
writer/palmyra-x5
Context1.0M
Input / 1M$0.60059.60
Cached / 1M$0.0605.96
Output / 1M$6.00595.99
xAI: Grok 4.20Chat APIVision
x-ai/grok-4.20
Context2.0M
Input / 1M$1.25124.17
Cached / 1M$0.12512.42
Output / 1M$2.50248.33
x-ai/grok-4.20-multi-agent
Context2.0M
Input / 1M$2.00198.66
Cached / 1M$0.20019.87
Output / 1M$6.00595.99
xAI: Grok 4.3Chat APIVision
x-ai/grok-4.3
Context1.0M
Input / 1M$1.25124.17
Cached / 1M$0.12512.42
Output / 1M$2.50248.33
xAI: Grok Build 0.1Chat APIVision
x-ai/grok-build-0.1
Context256K
Input / 1M$1.0099.33
Cached / 1M$0.1009.93
Output / 1M$2.00198.66
xiaomi/mimo-v2-flash
Context262K
Input / 1M$0.1009.93
Cached / 1M$0.0100.99
Output / 1M$0.30029.80
Xiaomi: MiMo-V2.5Transcriptions APIVisionTranscription
xiaomi/mimo-v2.5
Context1.0M
PricingPer minute
xiaomi/mimo-v2.5-pro
Context1.0M
Input / 1M$1.0099.33
Cached / 1M$0.1009.93
Output / 1M$3.00298.00
xiaomi/mimo-v2-omniTranscriptions APIVisionTranscription
xiaomi/mimo-v2-omni
Context262K
PricingPer minute
xiaomi/mimo-v2-pro
Context1.0M
Input / 1M$1.0099.33
Cached / 1M$0.1009.93
Output / 1M$3.00298.00
z-ai/glm-4-32b
Context128K
Input / 1M$0.1009.93
Cached / 1M$0.0100.99
Output / 1M$0.1009.93
z-ai/glm-4.5
Context131K
Input / 1M$0.60059.60
Cached / 1M$0.0605.96
Output / 1M$2.20218.53
z-ai/glm-4.5-air
Context131K
Input / 1M$0.13012.91
Cached / 1M$0.0131.29
Output / 1M$0.85084.43
Z.ai: GLM 4.5VChat APIVision
z-ai/glm-4.5v
Context66K
Input / 1M$0.60059.60
Cached / 1M$0.0605.96
Output / 1M$1.80178.80
z-ai/glm-4.6
Context203K
Input / 1M$0.43042.71
Cached / 1M$0.0434.27
Output / 1M$1.74172.84
Z.ai: GLM 4.6VChat APIVision
z-ai/glm-4.6v
Context131K
Input / 1M$0.30029.80
Cached / 1M$0.0302.98
Output / 1M$0.90089.40
z-ai/glm-4.7
Context203K
Input / 1M$0.40039.73
Cached / 1M$0.0403.97
Output / 1M$1.75173.83
z-ai/glm-4.7-flash
Context203K
Input / 1M$0.0605.96
Cached / 1M$0.00600.60
Output / 1M$0.40039.73
Z.ai: GLM 5Chat API
z-ai/glm-5
Context203K
Input / 1M$0.60059.60
Cached / 1M$0.0605.96
Output / 1M$1.92190.72
z-ai/glm-5-turbo
Context262K
Input / 1M$1.20119.20
Cached / 1M$0.12011.92
Output / 1M$4.00397.33
z-ai/glm-5.1
Context203K
Input / 1M$0.98097.35
Cached / 1M$0.0989.73
Output / 1M$3.08305.94
z-ai/glm-5.2
Context1.0M
Input / 1M$1.40139.06
Output / 1M$4.40437.06
Z.ai: GLM 5V TurboChat APIVision
z-ai/glm-5v-turbo
Context203K
Input / 1M$1.20119.20
Cached / 1M$0.12011.92
Output / 1M$4.00397.33