Usage Quotas

Every AI Platform tenant has a quota that limits how much AI can be used per day and per month. Quotas are enforced at the AI Gateway level before requests reach any provider — not after the fact.

Quota Limits by Tier

AI Tokens (Chat and Embed)

Tier	Tokens/Month
Free	100,000
Starter	1,000,000
Professional	10,000,000
Enterprise	Unlimited

Voice and Phone

Tier	Voice Minutes/Month	Phone Minutes/Month
Free	10	0
Starter	100	50
Professional	1,000	500
Enterprise	Unlimited	Unlimited

Other Primitives

Tier	Web Pages/Month	Emails/Month	Custom GPTs	API Keys	Team Members	Storage
Free	50	100	2	1	3	100 MB
Starter	500	5,000	10	5	10	1 GB
Professional	5,000	50,000	50	20	50	10 GB
Enterprise	Unlimited	Unlimited	Unlimited	Unlimited	Unlimited	Unlimited

Soft Limits vs. Hard Limits

The Gateway enforces two token limits per day:

Soft limit — When daily token usage exceeds 80% of your daily allocation, the Gateway starts including a warning header in responses: X-Quota-Warning: approaching-daily-limit. Requests continue normally. This warning appears in the platform dashboard as a yellow banner.

Hard limit — When daily token usage reaches 100% of your daily allocation, requests that would exceed it are blocked with HTTP 429. The error message includes when the quota resets.

Daily quotas reset at midnight UTC.

Monthly Budget Cap

In addition to token counts, each tenant has a monthly spend cap in USD. The AI Gateway tracks the rolling cost of all AI primitives (tokens, voice, phone, email, crawl, scans) and blocks new requests if the month-to-date cost would exceed the cap.

The default monthly budget cap is:

Free: $5
Starter: $50
Professional: $500
Enterprise: Custom (contact sales)

Enterprise tenants can set a custom cap. If you need a higher cap without upgrading tiers, contact support.

Feature Allowlists

By default, all features for your tier are enabled. On Enterprise, platform administrators can restrict which AI features specific sub-tenants or API keys can access. For example:

Allow only ticket-suggest and kb-generate for a support technician API key
Block proposal-generate for sub-tenants in a restricted department
Allow governance-scan only for security-role users

ℹ️Feature allowlists are an Enterprise-only capability. On Starter and Professional, all features in your tier are always available.

Checking Your Quota

In the platform:

Navigate to Usage Analytics
The Quota tab shows current month tokens used vs. limit, daily usage chart, and days remaining
The progress bar turns yellow at 80% and red at 95%

Via API:

GET /api/usage/quota
Authorization: Bearer oneai_your_key_here

Response:

{
  "tier": "professional",
  "tokens_used_today": 42150,
  "tokens_daily_limit": 333333,
  "tokens_used_month": 3100000,
  "tokens_monthly_limit": 10000000,
  "monthly_spend_usd": 12.40,
  "monthly_budget_usd": 500,
  "quota_reset_utc": "2026-04-01T00:00:00Z"
}

What Happens When You Hit the Limit

When a request is blocked by quota:

Source	What the user sees
Playground chat	"Your daily token quota has been reached. Resets at midnight UTC."
Jarvis (in another product)	"Jarvis is unavailable — quota exceeded. Try again tomorrow."
Custom GPT	"This agent is temporarily unavailable — quota limit reached."
API call	HTTP 429 with `{ "error": "quota_exceeded", "resets_at": "..." }`

Blocked requests do not count against your quota — the check happens before the provider call.

Quota Reset Schedule

Daily tokens: Reset at midnight UTC every day
Monthly tokens and budget: Reset on the 1st of each calendar month at midnight UTC
Rate limit (requests per minute): Resets on a rolling 60-second window — 60 RPM for all tiers

Upgrading Your Tier

To increase quota limits:

Navigate to Settings → Billing
Click Upgrade plan
Select your new tier and complete checkout
Quota limits are updated immediately after payment

Token usage for the current month carries over — upgrading does not reset your usage, only your limit.

Studio Quota (Add-On)

Studio usage (deployed app API calls, client interactions) is billed separately from AI Platform core quota. Studio credits ($50/month included with the $199 platform fee) cover AI usage within deployed apps. See the Studio App Builder documentation for details (coming soon).

Quota Limits by Tier​

AI Tokens (Chat and Embed)​

Voice and Phone​

Other Primitives​

Soft Limits vs. Hard Limits​

Monthly Budget Cap​

Feature Allowlists​

Checking Your Quota​

What Happens When You Hit the Limit​

Quota Reset Schedule​

Upgrading Your Tier​

Studio Quota (Add-On)​

Quota Limits by Tier

AI Tokens (Chat and Embed)

Voice and Phone

Other Primitives

Soft Limits vs. Hard Limits

Monthly Budget Cap

Feature Allowlists

Checking Your Quota

What Happens When You Hit the Limit

Quota Reset Schedule

Upgrading Your Tier

Studio Quota (Add-On)