API Limits: Hidden Rules Governing Your Cloud Operations
Cloud providers use throttling mechanisms to ensure fair resource distribution, prevent overuse, and maintain service stability in multi-tenant distributed environments. These limits can apply to factors like requests per second, data transfer volume, network usage, vCPU quotas, and more. Throttling can occur at various levels, including service, region, account or subscription.
When throttling kicks in, users may face delays or temporary restrictions until their resource usage falls within acceptable limits.
Here are some examples.
Service
| Level
| Limits
|
AWS S3
| S3 Prefix
| 3500 PUT per second
3500 DELETE per second
5500 GET per second
|
AWS EBS
| Snapshot
|
|
AWS EBS
| Account
|
|
AWS EBS
| Account
|
|
Azure Standard storage
| Storage Account
|
|
Azure vCPUs
| Subscription
|
|
This is not an exhaustive list. More details for Azure services can be found here. For AWS, documentation of each service mentions quotas/API limits, examples including DDB and S3.
Some limits are hard limits, while few quotas can be increased by submitting a service request to the cloud provider.