Skip to main content

Usage Quotas

Every AI Platform tenant has a quota that limits how much AI can be used per day and per month. Quotas are enforced at the AI Gateway level before requests reach any provider — not after the fact.

Quota Limits by Tier

AI Tokens (Chat and Embed)

TierTokens/Month
Free100,000
Starter1,000,000
Professional10,000,000
EnterpriseUnlimited

Voice and Phone

TierVoice Minutes/MonthPhone Minutes/Month
Free100
Starter10050
Professional1,000500
EnterpriseUnlimitedUnlimited

Other Primitives

TierWeb Pages/MonthEmails/MonthCustom GPTsAPI KeysTeam MembersStorage
Free50100213100 MB
Starter5005,000105101 GB
Professional5,00050,00050205010 GB
EnterpriseUnlimitedUnlimitedUnlimitedUnlimitedUnlimitedUnlimited

Soft Limits vs. Hard Limits

The Gateway enforces two token limits per day:

Soft limit — When daily token usage exceeds 80% of your daily allocation, the Gateway starts including a warning header in responses: X-Quota-Warning: approaching-daily-limit. Requests continue normally. This warning appears in the platform dashboard as a yellow banner.

Hard limit — When daily token usage reaches 100% of your daily allocation, requests that would exceed it are blocked with HTTP 429. The error message includes when the quota resets.

Daily quotas reset at midnight UTC.

Monthly Budget Cap

In addition to token counts, each tenant has a monthly spend cap in USD. The AI Gateway tracks the rolling cost of all AI primitives (tokens, voice, phone, email, crawl, scans) and blocks new requests if the month-to-date cost would exceed the cap.

The default monthly budget cap is:

  • Free: $5
  • Starter: $50
  • Professional: $500
  • Enterprise: Custom (contact sales)

Enterprise tenants can set a custom cap. If you need a higher cap without upgrading tiers, contact support.

Feature Allowlists

By default, all features for your tier are enabled. On Enterprise, platform administrators can restrict which AI features specific sub-tenants or API keys can access. For example:

  • Allow only ticket-suggest and kb-generate for a support technician API key
  • Block proposal-generate for sub-tenants in a restricted department
  • Allow governance-scan only for security-role users
ℹ️Feature allowlists are an Enterprise-only capability. On Starter and Professional, all features in your tier are always available.

Checking Your Quota

In the platform:

  1. Navigate to Usage Analytics
  2. The Quota tab shows current month tokens used vs. limit, daily usage chart, and days remaining
  3. The progress bar turns yellow at 80% and red at 95%

Via API:

GET /api/usage/quota
Authorization: Bearer oneai_your_key_here

Response:

{
"tier": "professional",
"tokens_used_today": 42150,
"tokens_daily_limit": 333333,
"tokens_used_month": 3100000,
"tokens_monthly_limit": 10000000,
"monthly_spend_usd": 12.40,
"monthly_budget_usd": 500,
"quota_reset_utc": "2026-04-01T00:00:00Z"
}

What Happens When You Hit the Limit

When a request is blocked by quota:

SourceWhat the user sees
Playground chat"Your daily token quota has been reached. Resets at midnight UTC."
Jarvis (in another product)"Jarvis is unavailable — quota exceeded. Try again tomorrow."
Custom GPT"This agent is temporarily unavailable — quota limit reached."
API callHTTP 429 with { "error": "quota_exceeded", "resets_at": "..." }

Blocked requests do not count against your quota — the check happens before the provider call.

Quota Reset Schedule

  • Daily tokens: Reset at midnight UTC every day
  • Monthly tokens and budget: Reset on the 1st of each calendar month at midnight UTC
  • Rate limit (requests per minute): Resets on a rolling 60-second window — 60 RPM for all tiers

Upgrading Your Tier

To increase quota limits:

  1. Navigate to Settings → Billing
  2. Click Upgrade plan
  3. Select your new tier and complete checkout
  4. Quota limits are updated immediately after payment

Token usage for the current month carries over — upgrading does not reset your usage, only your limit.

Studio Quota (Add-On)

Studio usage (deployed app API calls, client interactions) is billed separately from AI Platform core quota. Studio credits ($50/month included with the $199 platform fee) cover AI usage within deployed apps. See the Studio App Builder documentation for details (coming soon).