Contact sales
Grow your business with Jina AI.
Rate Limit
Product | API Endpoint | Description | Allowed Request | Without API Key (RPM) | With API Key (RPM) | With Premium API Key (RPM) | Average Latency (s) | Token Usage Counting | |
---|---|---|---|---|---|---|---|---|---|
Reader API | https://r.jina.ai | Convert URL to LLM-friendly text | GET/POST | 20 | 200 | 1000 | 1.6 | Count the number of tokens in the output response. | |
Reader API | https://s.jina.ai | Search the web and convert results to LLM-friendly text | GET/POST | 5 | 40 | 100 | 7.7 | Count the number of tokens in the output response. | |
Segmenter API | https://segment.jina.ai | Tokenize and segment long text | GET/POST | 20 | 200 | 1000 | 0.3 | Token is not counted as usage. | |
Embedding API | https://api.jina.ai/v1/embeddings | Convert text/images to fixed-length vectors | POST | block | 60keyboard_double_arrow_up | 300keyboard_double_arrow_up | bolt depends on the input size | Count the number of tokens in the input request. | |
Reranker API | https://api.jina.ai/v1/rerank | Tokenize and segment long text | POST | block | 60keyboard_double_arrow_up | 300keyboard_double_arrow_up | bolt depends on the input size | Count the number of tokens in the input request. |
CC BY-NC License Self-Check
Are you using our official API or official images on Azure or AWS?
done
Yes
Are you using a paid API key or free trial key?
Are you using our official model images on AWS and Azure?
close
No
Reader-related common questions
What are the costs associated with using the Reader API?
How does the Reader API function?
Is the Reader API open source?
What is the typical latency for the Reader API?
Why should I use the Reader API instead of scraping the page myself?
Does the Reader API support multiple languages?
What should I do if a website blocks the Reader API?
Can the Reader API extract content from PDF files?
Can the Reader API process media content from web pages?
Is it possible to use the Reader API on local HTML files?
Does Reader API cache the content?
Can I use the Reader API to access content behind a login?
Can I use the Reader API to access PDF on arXiv?
How does image caption work in Reader?
What is the scalability of the Reader? Can I use it in production?
What is the rate limit of the Reader API?
Reranker-related common questions
How much does the Reranker API cost?
What is the difference between the two rerankers?
Is Jina Reranker open source?
Does the reranker support multiple languages?
What is the maximum length for queries and documents?
What is the maximum number of documents I can rerank per query?
What is the batch size and how many query-document tuples can I send in one request?
What latency can I expect when reranking 100 documents?
Can I deploy Jina Reranker on AWS?
Do you offer a fine-tuned reranker on domain-specific data?
Embeddings-related common questions
How were the jina-embeddings-v2 models trained?
What is jina-clip-v1, can I use it for search text and image?
Which languages do your models support?
What is the maximum length for a single sentence input?
What is the maximum number of sentences I can include in a single request?
How do I send images to the jina-clip-v1 model?
How do Jina Embeddings models compare to OpenAI's text-embedding-ada-002 model?
How seamless is the transition from OpenAI's text-embedding-ada-002 to your solution?
How tokens are calculated when using jina-clip-v1?
Do you provide models for embedding images or audio?
Can Jina Embedding models be fine-tuned with private or company data?
Can your endpoints be hosted privately on AWS, Azure, or GCP?
Segmenter-related common questions
How much does the Segmenter API cost?
If I don't provide an API key, what is the rate limit?
If I provide an API key, what is the rate limit?
Will you charge the tokens from my API key?
Does the Segmenter API support multiple languages?
What is the difference between GET and POST requests?
What is the maximum length I can tokenize per request?
How does the chunking feature work? Is it semantic chunking?
How do you handle special tokens such as 'endoftext' in the Segmenter API?
Does chunking support other languages than English?
Auto Fine-Tuning-related common questions
How much does the Fine-tuning API cost?
What do I need to input? Do I need to provide training data?
How long does it take to fine-tune a model?
Where are the fine-tuned models stored?
If I provide a reference URL, how does the system use it?
Can I fine-tune a model for a specific language?
Can I fine-tune non-Jina embeddings, e.g., bge-M3?
How do you ensure the quality of the fine-tuned models?
How do you generate synthetic data?
Can I keep my fine-tuned models and synthetic data private?
How can I use the fine-tuned model?
I never received the email with the evaluation results. What should I do?
API-related common questions
code
Can I use the same API key for embedding, reranking, reader, fine-tuning APIs?
code
Can I monitor the token usage of my API key?
code
What should I do if I forget my API key?
code
Do API keys expire?
code
Why is the first request for some models slow?
code
Is user input data used for training your models?
Billing-related common questions
attach_money
Is billing based on the number of sentences or requests?
attach_money
Is there a free trial available for new users?
attach_money
Are tokens charged for failed requests?
attach_money
What payment methods are accepted?
attach_money
Is invoicing available for token purchases?