# Pricing

DOS AI uses a simple **pay-as-you-go** pricing model. You only pay for the tokens you use, with no minimum commitments, no monthly fees, and no hidden charges.

## Free Tier

Every new account receives **$5.00 in free credits** to get started. This is enough for substantial experimentation and prototyping before you need to add funds.

| Model           | Approximate Free Usage |
| --------------- | ---------------------- |
| Qwen3.5-35B-A3B | \~33 million tokens    |
| Llama 3.3 70B   | \~25 million tokens    |
| DeepSeek V3     | \~20 million tokens    |
| Llama 3.1 8B    | \~100 million tokens   |

> Free credits do not expire. No credit card is required to start.

## Per-Token Pricing

Pricing is calculated per **1 million tokens** (both input and output).

| Model               | Input Price (per 1M tokens) | Output Price (per 1M tokens) |
| ------------------- | --------------------------- | ---------------------------- |
| **Qwen3.5-35B-A3B** | $0.15                       | $0.15                        |
| **Llama 3.3 70B**   | $0.20                       | $0.20                        |
| **DeepSeek V3**     | $0.25                       | $0.25                        |
| **Llama 3.1 8B**    | $0.05                       | $0.05                        |

### What is a Token?

A token is roughly 3-4 characters of English text, or about 0.75 words. For example:

* "Hello, world!" = approximately 4 tokens
* A typical 500-word blog post = approximately 650-700 tokens
* A full 128K context window = approximately 96,000 words

## How Billing Works

1. **Add credits** to your account via the [dashboard](https://app.dos.ai).
2. **Make API calls** as normal. Each request deducts tokens used from your balance.
3. **Monitor usage** in real time through the dashboard billing page.

Token usage is calculated after each request completes. Both input tokens (your prompt) and output tokens (the model's response) are counted and billed at the rates above.

### Usage Tracking

Every API response includes a `usage` object showing exactly how many tokens were consumed:

```json
{
  "usage": {
    "prompt_tokens": 125,
    "completion_tokens": 320,
    "total_tokens": 445
  }
}
```

You can also view historical usage and spending breakdowns on the [dashboard](https://app.dos.ai).

## Enterprise & Volume Discounts

For organizations with high-volume needs, we offer custom pricing:

* **Volume discounts** for sustained usage above $100/month
* **Dedicated capacity** with guaranteed throughput
* **Custom rate limits** tailored to your workload
* **Priority support** with SLA guarantees

Contact us at **<support@dos.ai>** to discuss enterprise pricing.

## Comparison with Other Providers

DOS AI pricing is designed to be significantly more affordable than major cloud LLM providers, while offering comparable model quality. Our infrastructure runs on dedicated GPUs, allowing us to pass the savings directly to you.

## FAQ

### Do free credits expire?

No. Your free credits remain in your account until used.

### Is there a minimum top-up amount?

The minimum credit purchase is $5.00.

### What happens when my balance reaches zero?

API requests will return a `402 Payment Required` error. Add credits to resume usage immediately. No data is lost.

### Can I set spending limits?

Yes. You can configure monthly spending alerts and hard limits in the dashboard settings.

### Are there any hidden fees?

No. You pay only for the tokens you consume. There are no platform fees, no per-request fees, and no bandwidth charges.
