Google Models with INR Pricing

gemini-2.0-flashGoogle

google/gemini-2.0-flash

Chat API

Context

8K

Input

Input from ₹9.92/1M

Output

Output ₹39.66/1M

View details

google/gemini-2.0-flash-001Google

google/gemini-2.0-flash-001

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5). It...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹0.99/1M

View details

gemini-2.0-flash-liteGoogle

google/gemini-2.0-flash-lite

Chat API

Context

8K

Input

Input from ₹7.44/1M

Output

Output ₹29.75/1M

View details

google/gemini-2.0-flash-lite-001Google

google/gemini-2.0-flash-lite-001

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5),...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹0.74/1M

View details

Google: Gemini 2.5 FlashGoogle

google/gemini-2.5-flash

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹2.97/1M

View details

Google: Nano Banana (Gemini 2.5 Flash Image)Google

google/gemini-2.5-flash-image

Gemini 2.5 Flash Image, a.k.a. "Nano Banana," is now generally available. It is a state of the art image generation model with contextual understanding. It is capable of image generation,...

Chat APIVisionImage Output via Chat API

Context

33K

Input

Per image pricing

Cached input

Cached ₹2.97/1M

View details

Google: Gemini 2.5 Flash LiteGoogle

google/gemini-2.5-flash-lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹0.99/1M

View details

gemini-2.5-flash-lite-previewGoogle

google/gemini-2.5-flash-lite-preview

Chat API

Context

8K

Input

Input from ₹9.92/1M

Output

Output ₹39.66/1M

View details

Google: Gemini 2.5 Flash Lite Preview 09-2025Google

google/gemini-2.5-flash-lite-preview-09-2025

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹0.99/1M

View details

gemini-2.5-flash-native-audio-preview-12-2025Google

google/gemini-2.5-flash-native-audio-preview-12-2025

Chat API

Context

8K

Input

Input from ₹49.58/1M

Output

Output ₹198.32/1M

View details

gemini-2.5-flash-previewGoogle

google/gemini-2.5-flash-preview

Chat API

Context

8K

Input

Input from ₹29.75/1M

Output

Output ₹247.90/1M

View details

Google: Gemini 2.5 ProGoogle

google/gemini-2.5-pro

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹12.39/1M

View details

Google: Gemini 2.5 Pro Preview 06-05Google

google/gemini-2.5-pro-preview

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹12.39/1M

View details

Google: Gemini 2.5 Pro Preview 05-06Google

google/gemini-2.5-pro-preview-05-06

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹12.39/1M

View details

Google: Gemini 3 Flash PreviewGoogle

google/gemini-3-flash-preview

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹4.96/1M

View details

Google: Nano Banana Pro (Gemini 3 Pro Image)Google

google/gemini-3-pro-image

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...

Chat APIVisionImage Output via Chat API

Context

66K

Input

Per image pricing

View details

Google: Nano Banana Pro (Gemini 3 Pro Image Preview)Google

google/gemini-3-pro-image-preview

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...

Chat APIVisionImage Output via Chat API

Context

66K

Input

Per image pricing

Cached input

Cached ₹19.83/1M

View details

Google: Nano Banana 2 (Gemini 3.1 Flash Image)Google

google/gemini-3.1-flash-image

Gemini 3.1 Flash Image, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines advanced...

Chat APIVisionImage Output via Chat API

Context

131K

Input

Per image pricing

View details

Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)Google

google/gemini-3.1-flash-image-preview

Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines...

Chat APIVisionImage Output via Chat API

Context

131K

Input

Per image pricing

Cached input

Cached ₹4.96/1M

View details

Google: Gemini 3.1 Flash LiteGoogle

google/gemini-3.1-flash-lite

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹2.48/1M

View details

Google: Gemini 3.1 Flash Lite PreviewGoogle

google/gemini-3.1-flash-lite-preview

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹2.48/1M

View details

gemini-3.1-flash-live-previewGoogle

google/gemini-3.1-flash-live-preview

Chat API

Context

8K

Input

Input from ₹74.37/1M

Output

Output ₹446.22/1M

View details

Google: Gemini 3.1 Pro PreviewGoogle

google/gemini-3.1-pro-preview

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹19.83/1M

View details

Google: Gemini 3.1 Pro Preview Custom ToolsGoogle

google/gemini-3.1-pro-preview-customtools

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹19.83/1M

View details

Google: Gemini 3.5 FlashGoogle

google/gemini-3.5-flash

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

Cached input

Cached ₹14.87/1M

View details

gemini-embedding-001Google

google/gemini-embedding-001

Embeddings APIEmbedding

Context

2K

Input

Input from ₹14.87/1M

View details

gemini-embedding-2-previewGoogle

google/gemini-embedding-2-preview

Embeddings APIEmbedding

Context

8K

Input

Input from ₹19.83/1M

View details

Google: Gemma 2 27BGoogle

google/gemma-2-27b-it

Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). Gemma models are well-suited for a variety of...

Chat API

Context

8K

Input

Input from ₹64.45/1M

Cached input

Cached ₹6.45/1M

Output

Output ₹64.45/1M

View details

Google: Gemma 3 12BGoogle

google/gemma-3-12b-it

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Chat APIVision

Context

131K

Input

Input from ₹4.96/1M

Cached input

Cached ₹0.40/1M

Output

Output ₹14.87/1M

View details

Google: Gemma 3 27BGoogle

google/gemma-3-27b-it

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Chat APIVision

Context

131K

Input

Input from ₹7.93/1M

Cached input

Cached ₹0.79/1M

Output

Output ₹15.87/1M

View details

Google: Gemma 3 4BGoogle

google/gemma-3-4b-it

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Chat APIVision

Context

131K

Input

Input from ₹4.96/1M

Cached input

Cached ₹0.40/1M

Output

Output ₹9.92/1M

View details

Google: Gemma 3n 4BGoogle

google/gemma-3n-e4b-it

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...

Chat API

Context

33K

Input

Input from ₹5.95/1M

Cached input

Cached ₹0.59/1M

Output

Output ₹11.90/1M

View details

Google: Gemma 4 26B A4B Google

google/gemma-4-26b-a4b-it

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...

Chat APIVision

Context

262K

Input

Input from ₹6.94/1M

Cached input

Cached ₹0.59/1M

Output

Output ₹33.71/1M

View details

Google: Gemma 4 31BGoogle

google/gemma-4-31b-it

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

Chat APIVision

Context

262K

Input

Input from ₹12.89/1M

Cached input

Cached ₹1.19/1M

Output

Output ₹37.68/1M

View details

google/gemma-4-31b-it-turboGoogle

google/gemma-4-31b-it-turbo

Chat API

Context

262K

Input

Input from ₹11.90/1M

Cached input

Cached ₹1.19/1M

Output

Output ₹36.69/1M

View details

Google: Lyria 3 Clip PreviewGoogle

google/lyria-3-clip-preview

30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

View details

Google: Lyria 3 Pro PreviewGoogle

google/lyria-3-pro-preview

Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz...

Transcriptions APIVisionTranscription

Context

1.0M

Input

Per minute pricing

View details

text-embedding-004Google

google/text-embedding-004

Embeddings APIEmbedding

Context

2K

Input