Pricing

DOS AI uses a simple pay-as-you-go pricing model. You only pay for the tokens you use, with no minimum commitments, no monthly fees, and no hidden charges.

Free Tier

Every new account receives $5.00 in free credits to get started. This is enough for substantial experimentation and prototyping before you need to add funds.

Model
Approximate Free Usage

Qwen3.5-35B-A3B

~33 million tokens

Llama 3.3 70B

~25 million tokens

DeepSeek V3

~20 million tokens

Llama 3.1 8B

~100 million tokens

Free credits do not expire. No credit card is required to start.

Per-Token Pricing

Pricing is calculated per 1 million tokens (both input and output).

Model
Input Price (per 1M tokens)
Output Price (per 1M tokens)

Qwen3.5-35B-A3B (default)

$0.15

$0.15

Llama 4 Maverick 17B-128E

$0.17

$0.66

Llama 4 Scout 17B-16E

$0.11

$0.38

DeepSeek V3

$0.25

$0.25

Llama 3.3 70B

$0.20

$0.20

Llama 3.1 8B

$0.05

$0.05

Prices are DB-driven and may be updated. Check the dashboard or GET /v1/catalog for the latest pricing.

What is a Token?

A token is roughly 3-4 characters of English text, or about 0.75 words. For example:

  • "Hello, world!" = approximately 4 tokens

  • A typical 500-word blog post = approximately 650-700 tokens

  • A full 128K context window = approximately 96,000 words

How Billing Works

  1. Add credits to your account via the dashboard.

  2. Make API calls as normal. Each request deducts tokens used from your balance.

  3. Monitor usage in real time through the dashboard billing page.

Token usage is calculated after each request completes. Both input tokens (your prompt) and output tokens (the model's response) are counted and billed at the rates above.

Usage Tracking

Every API response includes a usage object showing exactly how many tokens were consumed:

You can also view historical usage and spending breakdowns on the dashboard.

Enterprise & Volume Discounts

For organizations with high-volume needs, we offer custom pricing:

  • Volume discounts for sustained usage above $100/month

  • Dedicated capacity with guaranteed throughput

  • Custom rate limits tailored to your workload

  • Priority support with SLA guarantees

Contact us at [email protected] to discuss enterprise pricing.

Comparison with Other Providers

DOS AI pricing is designed to be significantly more affordable than major cloud LLM providers, while offering comparable model quality. Our infrastructure runs on dedicated GPUs, allowing us to pass the savings directly to you.

FAQ

Do free credits expire?

No. Your free credits remain in your account until used.

Is there a minimum top-up amount?

The minimum credit purchase is $5.00.

What happens when my balance reaches zero?

API requests will return a 402 Payment Required error. Add credits to resume usage immediately. No data is lost.

Can I set spending limits?

Yes. You can configure monthly spending alerts and hard limits in the dashboard settings.

Are there any hidden fees?

No. You pay only for the tokens you consume. There are no platform fees, no per-request fees, and no bandwidth charges.

Last updated