Anthropic Rate Limits Overview The Claude API uses token-bucket rate limiting measured in three dimensions: requests per minute (RPM), input tokens per minute (ITPM), and output tokens per minute (OTPM). Limits increase automatically as you move through usage tiers. Rate Limit Dimensions | Dimension | Header | Description | |-----------|--------|-------------| | RPM | | Requests per minute | | ITPM | | Input tokens per minute | | OTPM | | Output tokens per minute | Limits are per-organization and per-model-class. Cached input tokens do NOT count toward ITPM limits. Usage Tiers (Auto-Upgrade)…