Usage Quotas
Every AI Platform tenant has a quota that limits how much AI can be used per day and per month. Quotas are enforced at the AI Gateway level before requests reach any provider — not after the fact.
Quota Limits by Tier
AI Tokens (Chat and Embed)
| Tier | Tokens/Month |
|---|---|
| Free | 100,000 |
| Starter | 1,000,000 |
| Professional | 10,000,000 |
| Enterprise | Unlimited |
Voice and Phone
| Tier | Voice Minutes/Month | Phone Minutes/Month |
|---|---|---|
| Free | 10 | 0 |
| Starter | 100 | 50 |
| Professional | 1,000 | 500 |
| Enterprise | Unlimited | Unlimited |
Other Primitives
| Tier | Web Pages/Month | Emails/Month | Custom GPTs | API Keys | Team Members | Storage |
|---|---|---|---|---|---|---|
| Free | 50 | 100 | 2 | 1 | 3 | 100 MB |
| Starter | 500 | 5,000 | 10 | 5 | 10 | 1 GB |
| Professional | 5,000 | 50,000 | 50 | 20 | 50 | 10 GB |
| Enterprise | Unlimited | Unlimited | Unlimited | Unlimited | Unlimited | Unlimited |
Soft Limits vs. Hard Limits
The Gateway enforces two token limits per day:
Soft limit — When daily token usage exceeds 80% of your daily allocation, the Gateway starts including a warning header in responses: X-Quota-Warning: approaching-daily-limit. Requests continue normally. This warning appears in the platform dashboard as a yellow banner.
Hard limit — When daily token usage reaches 100% of your daily allocation, requests that would exceed it are blocked with HTTP 429. The error message includes when the quota resets.
Daily quotas reset at midnight UTC.
Monthly Budget Cap
In addition to token counts, each tenant has a monthly spend cap in USD. The AI Gateway tracks the rolling cost of all AI primitives (tokens, voice, phone, email, crawl, scans) and blocks new requests if the month-to-date cost would exceed the cap.
The default monthly budget cap is:
- Free: $5
- Starter: $50
- Professional: $500
- Enterprise: Custom (contact sales)
Enterprise tenants can set a custom cap. If you need a higher cap without upgrading tiers, contact support.
Feature Allowlists
By default, all features for your tier are enabled. On Enterprise, platform administrators can restrict which AI features specific sub-tenants or API keys can access. For example:
- Allow only
ticket-suggestandkb-generatefor a support technician API key - Block
proposal-generatefor sub-tenants in a restricted department - Allow
governance-scanonly for security-role users
Checking Your Quota
In the platform:
- Navigate to Usage Analytics
- The Quota tab shows current month tokens used vs. limit, daily usage chart, and days remaining
- The progress bar turns yellow at 80% and red at 95%
Via API:
GET /api/usage/quota
Authorization: Bearer oneai_your_key_here
Response:
{
"tier": "professional",
"tokens_used_today": 42150,
"tokens_daily_limit": 333333,
"tokens_used_month": 3100000,
"tokens_monthly_limit": 10000000,
"monthly_spend_usd": 12.40,
"monthly_budget_usd": 500,
"quota_reset_utc": "2026-04-01T00:00:00Z"
}
What Happens When You Hit the Limit
When a request is blocked by quota:
| Source | What the user sees |
|---|---|
| Playground chat | "Your daily token quota has been reached. Resets at midnight UTC." |
| Jarvis (in another product) | "Jarvis is unavailable — quota exceeded. Try again tomorrow." |
| Custom GPT | "This agent is temporarily unavailable — quota limit reached." |
| API call | HTTP 429 with { "error": "quota_exceeded", "resets_at": "..." } |
Blocked requests do not count against your quota — the check happens before the provider call.
Quota Reset Schedule
- Daily tokens: Reset at midnight UTC every day
- Monthly tokens and budget: Reset on the 1st of each calendar month at midnight UTC
- Rate limit (requests per minute): Resets on a rolling 60-second window — 60 RPM for all tiers
Upgrading Your Tier
To increase quota limits:
- Navigate to Settings → Billing
- Click Upgrade plan
- Select your new tier and complete checkout
- Quota limits are updated immediately after payment
Token usage for the current month carries over — upgrading does not reset your usage, only your limit.
Studio Quota (Add-On)
Studio usage (deployed app API calls, client interactions) is billed separately from AI Platform core quota. Studio credits ($50/month included with the $199 platform fee) cover AI usage within deployed apps. See the Studio App Builder documentation for details (coming soon).