> For the complete documentation index, see [llms.txt](https://docs.dos.ai/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.dos.ai/models/pricing.md).

# Pricing

DOS AI uses a simple **pay-as-you-go** pricing model. You only pay for the tokens you use, with no minimum commitments, no monthly fees, and no hidden charges.

## Free Tier

Every new account receives **$5.00 in free credits** to get started. This is enough for substantial experimentation and prototyping before you need to add funds.

| Model           | Approximate Free Usage |
| --------------- | ---------------------- |
| Qwen3.5-35B-A3B | \~33 million tokens    |
| Llama 3.3 70B   | \~25 million tokens    |
| DeepSeek V3     | \~20 million tokens    |
| Llama 3.1 8B    | \~100 million tokens   |

> Free credits do not expire. No credit card is required to start.

## Per-Token Pricing

Pricing is calculated per **1 million tokens** (both input and output).

| Model                         | Input Price (per 1M tokens) | Output Price (per 1M tokens) |
| ----------------------------- | --------------------------- | ---------------------------- |
| **Qwen3.5-35B-A3B** (default) | $0.15                       | $0.15                        |
| **Llama 4 Maverick 17B-128E** | $0.17                       | $0.66                        |
| **Llama 4 Scout 17B-16E**     | $0.11                       | $0.38                        |
| **DeepSeek V3**               | $0.25                       | $0.25                        |
| **Llama 3.3 70B**             | $0.20                       | $0.20                        |
| **Llama 3.1 8B**              | $0.05                       | $0.05                        |

> Prices are DB-driven and may be updated. Check the [dashboard](https://app.dos.ai/models) or `GET /v1/catalog` for the latest pricing.

### What is a Token?

A token is roughly 3-4 characters of English text, or about 0.75 words. For example:

* "Hello, world!" = approximately 4 tokens
* A typical 500-word blog post = approximately 650-700 tokens
* A full 128K context window = approximately 96,000 words

## How Billing Works

1. **Add credits** to your account via the [dashboard](https://app.dos.ai).
2. **Make API calls** as normal. Each request deducts tokens used from your balance.
3. **Monitor usage** in real time through the dashboard billing page.

Token usage is calculated after each request completes. Both input tokens (your prompt) and output tokens (the model's response) are counted and billed at the rates above.

### Usage Tracking

Every API response includes a `usage` object showing exactly how many tokens were consumed:

```json
{
  "usage": {
    "prompt_tokens": 125,
    "completion_tokens": 320,
    "total_tokens": 445
  }
}
```

You can also view historical usage and spending breakdowns on the [dashboard](https://app.dos.ai).

## Enterprise & Volume Discounts

For organizations with high-volume needs, we offer custom pricing:

* **Volume discounts** for sustained usage above $100/month
* **Dedicated capacity** with guaranteed throughput
* **Custom rate limits** tailored to your workload
* **Priority support** with SLA guarantees

Contact us at **<support@dos.ai>** to discuss enterprise pricing.

## Comparison with Other Providers

DOS AI pricing is designed to be significantly more affordable than major cloud LLM providers, while offering comparable model quality. Our infrastructure runs on dedicated GPUs, allowing us to pass the savings directly to you.

## FAQ

### Do free credits expire?

No. Your free credits remain in your account until used.

### Is there a minimum top-up amount?

The minimum credit purchase is $5.00.

### What happens when my balance reaches zero?

API requests will return a `402 Payment Required` error. Add credits to resume usage immediately. No data is lost.

### Can I set spending limits?

Yes. You can configure monthly spending alerts and hard limits in the dashboard settings.

### Are there any hidden fees?

No. You pay only for the tokens you consume. There are no platform fees, no per-request fees, and no bandwidth charges.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.dos.ai/models/pricing.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.