Skip to content
Models/google/Gemini 3.5 Flash

Gemini 3.5 Flash

gemini-3.5-flashText·

Google Gemini 3.5 Flash GA, multimodal, configurable thinking_level

Context Window
1.0M
Input Price /M in
Free/M in
Output Price /M out
Free/M out
Cached Input Price /M
Free/M
Max Completion
66K
Input Modalities
text, image, audio, video
Output Modalities
text
reasoningFunction callingChatVisionJSONStreamingNewFreefast

Description

Google Gemini 3.5 Flash GA, multimodal, configurable thinking_level

Available Providers

AllToken can route requests to the providers below based on route priority and policy.

ProviderContextInputOutputCached / MLatencyThroughput

Best For

Google Gemini 3.5 Flash GA, multimodal, configurable thinking_level

How To Use This Model

Use the exact model ID shown below. This is the safest way to avoid call failures, variant mismatches, or incorrect route assumptions.

curl https://api.alltoken.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3.5-flash",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'
Supported Parameters
temperaturetop_pmax_tokenstoolsresponse_formatreasoning_effort
API Key Setup
Smart Routing

Let the platform choose the best provider path automatically.

Default Model

If a request does not specify a model, default the key to gemini-3.5-flash.

Forced Model

Always override incoming requests and lock the key to gemini-3.5-flash.