Skip to content
Models/openai/GPT-4o Mini

GPT-4o Mini

OpenAI·openai·gpt-4o-mini·

OpenAI compact model, fast and affordable, ideal for lightweight tasks

Context Window
128K
Input price / 1M tokens
$0.151M tokens
Output price / 1M tokens
$0.601M tokens
Cached input / 1M tokens
$0.071M tokens
Max Completion
16K
Input Modalities
text, image
Output Modalities
text
Function callingChatVisionJSONStreaming

Description

OpenAI compact model, fast and affordable, ideal for lightweight tasks

Available Providers

AllToken can route requests to the providers below based on route priority and policy.

ProviderContext LengthInput PriceOutput PriceCached / MLatency p50Throughput

Best For

OpenAI compact model, fast and affordable, ideal for lightweight tasks

How To Use This Model

Use the exact model ID shown below. This is the safest way to avoid call failures, variant mismatches, or incorrect route assumptions.

curl https://api.alltoken.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o-mini",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'
Supported Parameters
temperaturetop_pmax_tokenstoolsresponse_format
API Key Setup
Smart Routing

Let the platform choose the best provider path automatically.

Default Model

If a request does not specify a model, default the key to gpt-4o-mini.

Forced Model

Always override incoming requests and lock the key to gpt-4o-mini.