Models/llama/Llama 3.1 70B
A

llama

Llama 3.1 70B

llama-3.1-70b128K context$0.88 / $0.88 per Mtool calling

Context Window

128K

Input Price / M

$0.88

Output Price / M

$0.88

Max Completion

Input Modalities

text

Output Modalities

text

Description

Available Providers

AllToken can route requests to the providers below based on route priority and policy.

Provider
Context Length
Input Price
Output Price
Cache Read
Priority

Amazon Bedrock

llama-3.1-70b
128K
$0.88 / M
$0.88 / M
1

Status Snapshot

Quick route status for this model. Open the full Status page to compare it against the wider market and current incident patterns.

Open Status

Effective Pricing

Input

$0.88

per 1M input tokens

Output

$0.88

per 1M output tokens

Cached Read

Free

if prompt caching applies

Best For

ChatToolsStreaming

How To Use This Model

Use the exact model ID shown below. This is the safest way to avoid call failures, variant mismatches, or incorrect route assumptions.

Exact request field

"model": "llama-3.1-70b"
curl https://api.alltoken.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama-3.1-70b",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'

Supported Parameters

temperaturetop_pmax_tokenstools

API Key Setup

Smart Routing

Let the platform choose the best provider path automatically.

Default Model

If a request does not specify a model, default the key to llama-3.1-70b.

Forced Model

Always override incoming requests and lock the key to llama-3.1-70b.