Models/deepseek/DeepSeek V4 Flash

DeepSeek V4 Flash

Name: DeepSeek V4 Flash API via AllToken
Brand: AllToken

DeepSeek·deepseek

deepseek-v4-flashText·—

DeepSeek V4 Flash cost-efficient, 284B total / 13B active params, 1M context, fast response

Usage ← Models

Overview Providers Pricing Usage Related Models

Context Window

1.0M

Input Price /M in

$0.15$0.14/M in

Output Price /M out

$0.31$0.28/M out

Cached Input Price /M

$0.0027/M

Max Completion

384K

Input Modalities

text

Output Modalities

text

reasoningFunction callingChatJSONStreamingrecommendedNew-10%GeneralProgrammingScience

Description

DeepSeek V4 Flash cost-efficient, 284B total / 13B active params, 1M context, fast response

Available Providers

AllToken can route requests to the providers below based on route priority and policy.

ProviderContextInputOutputCached / MLatencyThroughput

Best For

DeepSeek V4 Flash cost-efficient, 284B total / 13B active params, 1M context, fast response

How To Use This Model

Use the exact model ID shown below. This is the safest way to avoid call failures, variant mismatches, or incorrect route assumptions.

curl https://api.alltoken.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-v4-flash",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'

Supported Parameters

temperaturetop_pmax_tokenstoolsresponse_formatreasoning_effort

API Key Setup

Smart Routing

Let the platform choose the best provider path automatically.

Default Model

If a request does not specify a model, default the key to deepseek-v4-flash.

Forced Model

Always override incoming requests and lock the key to deepseek-v4-flash.

Usage