Skip to content

Limits

Workers AI is now Generally Available. We’ve updated our rate limits to reflect this.

Note that model inferences in local mode using Wrangler will also count towards these limits. Beta models may have lower rate limits while we work on performance and scale.

Rate limits are default per task type, with some per-model limits defined as follows:

Rate limits by task type

Automatic Speech Recognition

  • 720 requests per minute

Image Classification

  • 3000 requests per minute

Image-to-Text

  • 720 requests per minute

Object Detection

  • 3000 requests per minute

Summarization

  • 1500 requests per minute

Text Classification

  • 2000 requests per minute

Text Embeddings

Text Generation

Text-to-Image

Translation

  • 720 requests per minute