Embeddings
Top-performing multimodal multilingual long-context embeddings for search, RAG, agents applications.
Choosing the Right Embeddings
Our embedding models are designed to cover diverse search and GenAI applications.
Embedding API
Try our world-class embedding models to improve your search and RAG systems. Start with a free trial!
Returning data type
Example inputs
upload
Request
curl https://api.jina.ai/v1/embeddings \
-H "Content-Type: application/json" \
-H "Authorization: Bearer " \
-d '{
"model": "undefined",
"normalized": true,
"embedding_type": "float",
"input": [
"A blue cat",
"A red dog",
"btw to represent image u can either use URL or encode image into base64 like below.",
"https://i.pinimg.com/600x315/21/48/7e/21487e8e0970dd366dafaed6ab25d8d8.jpg",
"R0lGODlhEAAQAMQAAORHHOVSKudfOulrSOp3WOyDZu6QdvCchPGolfO0o/XBs/fNwfjZ0frl3/zy7////wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACH5BAkAABAALAAAAAAQABAAAAVVICSOZGlCQAosJ6mu7fiyZeKqNKToQGDsM8hBADgUXoGAiqhSvp5QAnQKGIgUhwFUYLCVDFCrKUE1lBavAViFIDlTImbKC5Gm2hB0SlBCBMQiB0UjIQA7"
]}'
API Pricing
Our API pricing is structured around the number of tokens sent in the requests. For Reader API, it is the number of tokens in the responses. This pricing model is applicable to all products in Jina AI's search foundation: Embedding, Reranking, Reader, Auto Fine-Tuning APIs. With the same API key, you have access to all API services.
Auto-recharge when tokens are low
speed
Understand the rate limit
Top up this API key with more tokens
Please input the right API key to top up
API Integrations
Our Embedding API is natively integrated with various renowned databases, vector stores, RAG, and LLMOps frameworks. To begin, just copy and paste your API key into any of the listed integrations for a quick and seamless start.
Vector Store
LLMOps
RAG
Observability
MongoDB
DataStax
Qdrant
Pinecone
Chroma
Weaviate
Milvus
Epsilla
MyScale
LlamaIndex
Haystack
Langchain
Dify
SuperDuperDB
DashVector
Portkey
Baseten
TiDB
LanceDB
Carbon
On-premises deployment
Deploy Jina Embeddings models in AWS Sagemaker and Microsoft Azure, and soon in Google Cloud Services, or contact our sales team to get customized Kubernetes deployments for your Virtual Private Cloud and on-premises servers.
Our Publications
Understand how our frontier search models were trained from scratch, check out our latest publications. Meet our team at EMNLP, SIGIR, ICLR, NeurIPS, and ICML!
Learning about Embeddings
Where to start with embeddings? We've got you covered. Learn about embeddings from the ground up with our comprehensive guide.
The Evolution of Embeddings Poster
Discover the ideal poster for your space, featuring captivating infographics or breathtaking visuals tracing the evolution of text embedding models since 1950.
Rate Limit
Product | API Endpoint | Description | Allowed Request | Without API Key (RPM) | With API Key (RPM) | With Premium API Key (RPM) | Average Latency (s) | Token Usage Counting | |
---|---|---|---|---|---|---|---|---|---|
Embedding API | https://api.jina.ai/v1/embeddings | Convert text/images to fixed-length vectors | POST | block | 60keyboard_double_arrow_up | 300keyboard_double_arrow_up | bolt depends on the input size | Count the number of tokens in the input request. | |
Reader API | https://r.jina.ai | Convert URL to LLM-friendly text | GET/POST | 20 | 200 | 1000 | 1.6 | Count the number of tokens in the output response. | |
Reader API | https://s.jina.ai | Search the web and convert results to LLM-friendly text | GET/POST | 5 | 40 | 100 | 7.7 | Count the number of tokens in the output response. | |
Segmenter API | https://segment.jina.ai | Tokenize and segment long text | GET/POST | 20 | 200 | 1000 | 0.3 | Token is not counted as usage. | |
Reranker API | https://api.jina.ai/v1/rerank | Tokenize and segment long text | POST | block | 60keyboard_double_arrow_up | 300keyboard_double_arrow_up | bolt depends on the input size | Count the number of tokens in the input request. |
Embeddings-related common questions
How were the jina-embeddings-v2 models trained?
What is jina-clip-v1, can I use it for search text and image?
Which languages do your models support?
What is the maximum length for a single sentence input?
What is the maximum number of sentences I can include in a single request?
How do I send images to the jina-clip-v1 model?
How do Jina Embeddings models compare to OpenAI's text-embedding-ada-002 model?
How seamless is the transition from OpenAI's text-embedding-ada-002 to your solution?
How tokens are calculated when using jina-clip-v1?
Do you provide models for embedding images or audio?
Can Jina Embedding models be fine-tuned with private or company data?
Can your endpoints be hosted privately on AWS, Azure, or GCP?
API-related common questions
code
Can I use the same API key for embedding, reranking, reader, fine-tuning APIs?
code
Can I monitor the token usage of my API key?
code
What should I do if I forget my API key?
code
Do API keys expire?
code
Why is the first request for some models slow?
code
Is user input data used for training your models?
Billing-related common questions
attach_money
Is billing based on the number of sentences or requests?
attach_money
Is there a free trial available for new users?
attach_money
Are tokens charged for failed requests?
attach_money
What payment methods are accepted?
attach_money
Is invoicing available for token purchases?