Rate Limit Overview
To ensure platform stability and fair usage, Memories.ai API enforces rate limits based on the type of API endpoint you are calling.Rate limits are applied per account. All API keys under the same account share the same rate limit quota.
Rate Limit Tiers
Standard APIs
1 QPS (Query Per Second)Applies to most API endpoints including:
- Video / Audio / Image Upload
- Transcription
- Embeddings Generation
Scraping & Task APIs
Varies by endpoint and channelApplies to scraping and long-running task endpoints including:
- YouTube / TikTok / Instagram / Twitter Scraping
- Async task-based processing
Understanding Models
Model Provider Rate LimitRate limits follow the underlying model provider’s own limits:
- Video Understanding Models (VLM)
- Image Understanding Models (ILM)
Stream Processing
Concurrent Stream LimitLimited by max concurrent streams per account (video + audio combined)
Detailed Rate Limits by Endpoint
Video Processing — 1 QPS
| Endpoint | Rate Limit |
|---|---|
| Upload File | 1 QPS |
| Get Upload Signed URL | 1 QPS |
| Upload File Using Signed URL | 1 QPS |
| Edit Video | 1 QPS |
| Scene Detection | 1 QPS |
| Split Video | 1 QPS |
| Extract Video Frames | 1 QPS |
| Get Asset Metadata | 1 QPS |
| Download Asset | 1 QPS |
| Delete Asset | 1 QPS |
Transcription — 1 QPS
| Endpoint | Rate Limit |
|---|---|
| Sync Generate Audio Transcription | 1 QPS |
| Sync Generate Speaker | 1 QPS |
| Async Generate Video Description | 1 QPS |
| Async Generate Audio Transcription | 1 QPS |
| Async Generate Speaker | 1 QPS |
| Async Generate Summary | 1 QPS |
| Speaker Recognition | 1 QPS |
Embeddings — 1 QPS
| Endpoint | Rate Limit |
|---|---|
| Generate Video Embedding | 1 QPS |
| Generate Image Embedding | 1 QPS |
| Generate Text Embedding | 1 QPS |
Stream Processing — Concurrent Stream Limit
Stream processing endpoints are limited by the maximum number of concurrent streams per account (video + audio combined), rather than QPS.| Endpoint | Rate Limit |
|---|---|
| Start Video Stream Moderation | Max N concurrent streams |
| Stop Video Stream Moderation | No Limit |
| Start Audio Stream Transcription | Max N concurrent streams |
| Stop Audio Stream Transcription | No Limit |
When the server capacity is reached, the API returns status code
16 (Capacity Reached). Please retry later or contact sales for a higher concurrent stream limit.Social Media Scraping
Rate limits for scraping endpoints vary by endpoint type and thechannel parameter used.
Metadata & Transcript Endpoints
These endpoints accept achannel parameter (rapid / memories.ai / apify). Rate limits are enforced per channel.
| Endpoint | Channel | Rate Limit |
|---|---|---|
| YouTube Video Metadata | rapid | 12 QPH |
| YouTube Video Metadata | memories.ai | 10 QPS |
| YouTube Video Metadata | apify | 10 QPS |
| TikTok Video Metadata | rapid / memories.ai | 600 QPM |
| TikTok Video Metadata | apify | 10 QPS |
| Instagram Video Metadata | rapid / memories.ai | 25 QPH |
| Instagram Video Metadata | apify | 10 QPS |
| Twitter Video Metadata | rapid / memories.ai | 20 QPH |
| Twitter Video Metadata | apify | 10 QPS |
| YouTube Video Transcript | rapid / memories.ai | 150 QPM |
| YouTube Video Transcript | apify | 10 QPS |
| TikTok Video Transcript | rapid / memories.ai | 600 QPM |
| TikTok Video Transcript | apify | 10 QPS |
| Instagram Video Transcript | rapid / memories.ai | 150 QPM |
| Instagram Video Transcript | apify | 10 QPS |
| Twitter Video Transcript | rapid / memories.ai | 20 QPH |
| Twitter Video Transcript | apify | 10 QPS |
Detail & Comment Endpoints
These endpoints do not use achannel parameter.
| Endpoint | Rate Limit |
|---|---|
| TikTok Video Detail | 600 QPM |
| TikTok Video Comment | 600 QPM |
| TikTok Video Comment Reply | 600 QPM |
| YouTube Video Detail | 10 QPS |
| YouTube Video Comment | 10 QPS |
| YouTube Video Comment Reply | 10 QPS |
Video Understanding Models — Model Provider Rate Limit
Memories.ai does not impose its own QPS limit on these endpoints. The effective rate limit is determined by the underlying model provider (e.g., Google Gemini, Amazon Nova, Alibaba Qwen). If you exceed the provider’s throughput limit, the API will return an error. Usage is also subject to your account’s token quota and billing limits.
| Endpoint | Rate Limit |
|---|---|
| Gemini Video | Subject to Gemini rate limit |
| Nova Video | Subject to Nova rate limit |
| Qwen Video | Subject to Qwen rate limit |
Image Understanding Models — Model Provider Rate Limit
| Endpoint | Rate Limit |
|---|---|
| Gemini Image | Subject to Gemini rate limit |
| GPT Image | Subject to GPT rate limit |
| Nova Image | Subject to Nova rate limit |
| Qwen Image | Subject to Qwen rate limit |
What Happens When You Exceed the Limit?
If you exceed the rate limit, the API will return a429 Too Many Requests response:
Need Higher Rate Limits?
If your use case requires higher throughput, we offer customized rate limit plans for enterprise customers.Contact Sales
Get in touch with our sales team to discuss a custom rate limit plan tailored to your needs.
Enterprise plans can include increased QPS/QPM limits, dedicated infrastructure, and priority support.
