No subscriptions. No seats. No minimums. Get an API key, add credits, and pay per million tokens — starting at $0.40 in / $0.80 out per 1M for Zoraxe. Up to 25× cheaper than frontier US vendors.
Add credits to your account, make API calls, pay only for what you use. Priced per million tokens — separately for input and output. No subscriptions, no seats, no idle charges.
Sign up, add prepaid credits, and receive your private API key. Point your SDK at api.zoraxe.ai/v1 and you're live.
Choose from 10 models — Zoraxe, GLM, DeepSeek, Qwen, Kimi. Switch per request. Input and output billed separately at the per-model rates in the table below.
Usage is deducted from your credit balance in real time. Set per-key spend caps to stay in control. Top up anytime. Volume discounts kick in automatically at scale.
Typical spend: $5–$50/month for light prototyping, $100–$500/month for active production apps. Need unlimited usage on your own infrastructure? See enterprise pricing ↓
All models run on the same sovereign Zoraxe infrastructure — your data never leaves. Pricing is per million tokens, charged separately for input (your prompt) and output (the response). No base fee on top.
| Model | Alias | Input | Output |
|---|---|---|---|
| — Zoraxe native models | |||
|
Zoraxe RECOMMENDED
Our general-purpose workhorse. Fast, versatile, private.
|
zoras |
$0.40/1M | $0.80/1M |
|
Zoraxe Coder
Tuned for code completion, agent mode, and repo-aware tasks.
|
zoras-coder |
$0.40/1M | $0.80/1M |
| — GLM family | |||
|
GLM 5.1
Long context (1M tokens), strong reasoning.
|
glm-5.1 |
$1.68/1M | $5.28/1M |
|
GLM 5
Flagship GLM generation; prior version of 5.1.
|
glm-5 |
$1.68/1M | $5.28/1M |
|
GLM 4.7
Mid-tier GLM; excellent quality-to-price.
|
glm-4.7 |
$0.72/1M | $2.64/1M |
| — DeepSeek family | |||
|
DeepSeek V3.1
Strong generalist with competitive throughput.
|
deepseek-v3.1 |
$0.67/1M | $2.02/1M |
|
DeepSeek V3.2
Latest DeepSeek; sharper reasoning, same pricing.
|
deepseek-v3.2 |
$0.67/1M | $2.02/1M |
| — Qwen & Kimi | |||
|
Qwen3 Plus
High-quality multilingual with cheap input.
|
qwen3-plus |
$0.60/1M | $3.60/1M |
|
Kimi K2.5
Moonshot custom variant; solid generalist.
|
kimi-k2.5-custom |
$0.72/1M | $3.60/1M |
|
Kimi K2.6
Latest Kimi; stronger reasoning and tool-use.
|
kimi-k2.6-custom |
$1.14/1M | $4.80/1M |
All prices in USD per 1,000,000 tokens. Input tokens are counted from your prompt (system + user messages + function schemas). Output tokens are counted from the model's response. Embeddings and fine-tuning priced separately — contact us.
If you need Zoraxe deployed in your own VPC or on-prem — with the API, Chat, and Code running on your infrastructure and your hardware — pricing moves to a flat annual platform fee plus an implementation engagement. No per-token markup. You own the GPU capacity.
Get an API key, add credits, and point your SDK at our endpoint. You'll be making calls in minutes — no subscriptions, no surprises on the bill.