Gemini 2.5 Flash Lite Model Card

Gemini 2.5 Flash Lite overview

Provider

The company that provides the model

Google

Context window

The number of tokens you can send in a prompt

1,000,000 tokens

Maximum output

The maximum number of tokens a model can generate in one request

64,000 tokens

Input token cost

The cost of prompt tokens sent to the model

$0.10 / 1M input tokens

Output token cost

The cost of output tokens generated by the model

$0.40 / 1M output tokens

Knowledge cut-off date

When the model's knowledge ends

January 1, 2025

Unknown

Release date

When the model was launched

June 17, 2025

Gemini 2.5 Flash Lite functionality

Function (tool calling) support

Capability for the model to use external tools

Yes

Vision support

Ability to process and analyze visual inputs, like images

Yes

Multilingual

Support for multiple languages

Yes

Fine-tuning

Whether the model supports fine-tuning on custom datasets

Common questions about Gemini 2.5 Flash Lite

What is Gemini 2.5 Flash Lite?

Gemini 2.5 Flash Lite is Google’s cost- and latency-optimized version of the hybrid reasoning Gemini 2.5 Flash model, letting you balance speed, quality, and expense.

How much does Gemini 2.5 Flash Lite cost?

It’s free while experimental. Once paid, it’s $0.10 per million input tokens and $0.40 per million output tokens.

What is the context window for Gemini 2.5 Flash Lite?

It supports up to 1,048,576 tokens (1M+), ideal for very large or complex inputs.

What is the maximum output length for Gemini 2.5 Flash Lite?

It can generate up to 65,536 tokens in a single response.

When was Gemini 2.5 Flash Lite released?

Gemini 2.5 Flash Lite launched on June 17, 2025.

How recent is the training data for Gemini 2.5 Flash Lite?

Its knowledge cut-off date is January 1, 2025.

Does Gemini 2.5 Flash Lite support vision capabilities?

Yes. It can process and analyze visual inputs like images.

Can Gemini 2.5 Flash Lite perform tool calling or functions?

Yes. It supports function-calling to integrate with external tools.

Is Gemini 2.5 Flash Lite multilingual?

Yes. It handles multiple languages for both input and output.

Does Gemini 2.5 Flash Lite support fine-tuning?

No. Fine-tuning is not supported for the Flash Lite variant.

Where can I find the official documentation for Gemini 2.5 Flash Lite?

See the Gemini API docs here