
Gemini 3.5 Flash
gemini-3.5-flashText·—Google Gemini 3.5 Flash GA, multimodal, configurable thinking_level
Context Window
1.0M
Input Price /M in
Free/M in
Output Price /M out
Free/M out
Cached Input Price /M
Free/M
Max Completion
66K
Input Modalities
text, image, audio, video
Output Modalities
text
reasoningFunction callingChatVisionJSONStreamingNewFreefast
Description
Google Gemini 3.5 Flash GA, multimodal, configurable thinking_level
Available Providers
AllToken can route requests to the providers below based on route priority and policy.
ProviderContextInputOutputCached / MLatencyThroughput
Best For
Google Gemini 3.5 Flash GA, multimodal, configurable thinking_level
How To Use This Model
Use the exact model ID shown below. This is the safest way to avoid call failures, variant mismatches, or incorrect route assumptions.
curl https://api.alltoken.ai/v1/chat/completions \
-H "Authorization: Bearer sk-your-key" \
-H "Content-Type: application/json" \
-d '{
"model": "gemini-3.5-flash",
"messages": [
{"role": "user", "content": "Hello!"}
]
}'Supported Parameters
temperaturetop_pmax_tokenstoolsresponse_formatreasoning_effortAPI Key Setup
Smart Routing
Let the platform choose the best provider path automatically.
Default Model
If a request does not specify a model, default the key to gemini-3.5-flash.
Forced Model
Always override incoming requests and lock the key to gemini-3.5-flash.