notifications
News
Products
For Enterprises
Embeddings
Our world-class embeddings for search, RAG, agent systems.
Reranker
Maximize the search relevancy and RAG accuracy at ease.
Reader
Read URLs or search the web, get better grounding for LLMs.
Auto Fine-Tuning
Get fine-tuned embeddings for any domain you want.

For Power Users
PromptPerfect
Premier tool for prompt engineering
More power user tools
SceneXplain
Leading AI solution for image captions and video summaries
BestBanner
Blog to banner, without the prompts!
JinaChat
More modality, longer memory, less cost
Rationale
Ultimate AI decision-making tools

For Developers
DocArray
The data structure for multimodal data
Jina
Build multimodal AI applications on the cloud
More developer tools
CLIP-as-service
Embed images and sentences into fixed-length vectors with CLIP
Finetuner
Fine-tune embeddings on domain specific data for better search quality
JCloud
Deploy a local project as a cloud service. Radically easy, no nasty surprises.
LangChain-Serve
Langchain apps on production with Jina & FastAPI
VectorDB
A Python vector database you just need - no more, no less
DALL-E Flow
A human-in-the-Loop workflow for creating HD images from text
DiscoArt
Create compelling Disco Diffusion artworks in one line of code
ThinkGPT
Agent techniques to augment your LLM and push it beyond its limits
DevGPT
Your virtual development team
RunGPT
An open-source cloud-native of large multimodal models serving framework
Jerboa
An experimental finetuner for open-source LLMs


Company
About us
Contact sales
Intern program
Join us
open_in_new
Download logo
open_in_new


Embeddings

Multimodal, bilingual long-context embeddings for your search and RAG.

Choosing the Right Embeddings

Our embedding models are designed to cover diverse search and GenAI applications.
Multimodal Embeddings
view_in_ar
jina-clip-v1
General-Purpose Embeddings
view_in_ar
jina-embeddings-v2-base
Bilingual Embeddings
view_in_ar
jina-embeddings-v2-base-de
view_in_ar
jina-embeddings-v2-base-zh
view_in_ar
jina-embeddings-v2-base-es
Code Embeddings
view_in_ar
jina-embeddings-v2-base-code

Embedding API

Try our world-class embedding models to improve your search and RAG systems. Start with a free trial!
key
API Key & Billing
code
Usage
verified_user
Integrate
Visualize
search
Search
join_full
Pairwise
schedule_send
Batch Job

Auto preview
Read the docshelp_outlinefaq
API Status

Returning data type
Besides the float, you can ask it to return as binary for faster vector retrieval, or as base64 encoding for faster transmission.

Example inputs
Change them and see how the response changes!

upload
Request
curl https://api.jina.ai/v1/embeddings \
	 -H "Content-Type: application/json" \
	 -H "Authorization: Bearer " \
	 -d '{
	"model": "undefined",
	"embedding_type": "float",
	"input": [
		"A blue cat", 
		"A red dog", 
		"btw to represent image u can either use URL or encode image into base64 like below.", 
		"https://i.pinimg.com/600x315/21/48/7e/21487e8e0970dd366dafaed6ab25d8d8.jpg", 
		"R0lGODlhEAAQAMQAAORHHOVSKudfOulrSOp3WOyDZu6QdvCchPGolfO0o/XBs/fNwfjZ0frl3/zy7////wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACH5BAkAABAALAAAAAAQABAAAAVVICSOZGlCQAosJ6mu7fiyZeKqNKToQGDsM8hBADgUXoGAiqhSvp5QAnQKGIgUhwFUYLCVDFCrKUE1lBavAViFIDlTImbKC5Gm2hB0SlBCBMQiB0UjIQA7"
    ]}'


API Pricing

Our API pricing is structured around the quantity of tokens sent in the requests. For Reader API, it is the amount of tokens in the responses. This pricing model is applicable to all products in Jina AI's search foundation: Embedding, Reranking, Reader, Auto Fine-Tuning APIs. With the same API key, you have access to all API services.
Auto-recharge when tokens are low
Recommended for uninterrupted service in production. When your token balance is below the threshold you set, we will automatically recharge your credit card for the same amount as your last top-up. If you purchased multiple packs in the last top-up, we will recharge only one pack.
check
Top up this API key with more tokens
Depending on your location, you may be charged in USD, EUR, or other currencies. Taxes may apply.
Please input the right API key to top up
API Integrations
Our Embedding API is natively integrated with various renowned databases, vector stores, RAG, and LLMOps frameworks. To begin, just copy and paste your API key into any of the listed integrations for a quick and seamless start.
Vector Store
LLMOps
RAG
Observability
open_in_new
MongoDB
open_in_new
DataStax
open_in_new
Qdrant
open_in_new
Pinecone
open_in_new
Chroma
open_in_new
Weaviate
open_in_new
Milvus
open_in_new
Epsilla
open_in_new
MyScale
open_in_new
LlamaIndex
open_in_new
Haystack
open_in_new
Langchain
open_in_new
Dify
open_in_new
SuperDuperDB
open_in_new
DashVector
open_in_new
Portkey
open_in_new
Baseten
open_in_new
TiDB

On-premises deployment

Deploy Jina Embeddings models in AWS Sagemaker and Microsoft Azure, and soon in Google Cloud Services, or contact our sales team to get customized Kubernetes deployments for your Virtual Private Cloud and on-premises servers.
handshakeContact

Our Publications

Understand how our search foundation were trained from scratch, check out our latest publications. Meet our team at EMNLP, SIGIR, ICLR, NeurIPS, and ICML!
arXiv
June 21, 2024
Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language Models
ICML 2024
May 30, 2024
Jina CLIP: Your CLIP Model Is Also Your Text Retriever
arXiv
February 26, 2024
Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings
arXiv
October 30, 2023
Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents
EMNLP 2023
July 20, 2023
Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models

Learning about Embeddings

Where to start with embeddings? We've got you covered. Learn about embeddings from the ground up with our comprehensive guide.
Abstract 3D render of a neon blue and green grid pattern on a black background, creating a sense of depth.
June 05, 2024 • 8 minutes read
Jina CLIP v1: A Truly Multimodal Embeddings Model for Text and Image
Jina AI's new multimodal embedding model not only outperforms OpenAI CLIP in text-image retrieval, it's a solid image embedding model and state-of-the-art text embedding model at the same time. You don't need different models for different modalities any more.
Sofia Vasileva
Scott Martens
Susana Guzmán
May 15, 2024 • 11 minutes read
Binary Embeddings: All the AI, 3.125% of the Fat
32-bits is a lot of precision for something as robust and inexact as an AI model. So we got rid of 31 of them! Binary embeddings are smaller, faster and highly performant.
Sofia Vasileva
Scott Martens
Futuristic black background with a purple 3D grid, featuring the "Embeddings" and "Reranker" logos with a stylized "A".
April 29, 2024 • 7 minutes read
Jina Embeddings and Reranker on Azure: Scalable Business-Ready AI Solutions
Jina Embeddings and Rerankers are now available on Azure Marketplace. Enterprises that prioritize privacy and security can now easily integrate Jina AI's state-of-the-art models right in their existing Azure ecosystem.
Susana Guzmán
Abstract image with colorful wavy background featuring AWS, Embeddings, and Reranker logos.
March 25, 2024 • 21 minutes read
Next-Level Cloud AI: Jina Embeddings and Rerankers on Amazon SageMaker
Learn to use Jina Embeddings and Reranking models in a full-stack AI application on AWS, using only components available in Amazon SageMaker and the AWS Marketplace.
Scott Martens
Saahil Ognawala

Comparison of Reranker, Vector Search, and BM25

The table below provides a comprehensive comparison of the Reranker, Vector/Embeddings Search, and BM25, highlighting their strengths and weaknesses across various categories.
RerankerVector SearchBM25
Best ForEnhanced search precision and relevanceInitial, rapid filteringGeneral text retrieval across wide-ranging queries
GranularityDetailed: Sub-document and query segmentBroad: Entire documentsIntermediate: Various text segments
Query Time ComplexityHighMediumLow
Indexing Time ComplexityNot requiredHighLow, utilizes pre-built index
Training Time ComplexityHighHighNot required
Search QualitySuperior for nuanced queriesBalanced between efficiency and accuracyConsistent and reliable for a broad set of queries
StrengthsHighly accurate with deep contextual understandingQuick and efficient, with moderate accuracyHighly scalable, with established efficacy
Try reranker API for freeTry embedding API for free

The Evolution of Embeddings Poster

Discover the ideal poster for your space, featuring captivating infographics or breathtaking visuals tracing the evolution of text embedding models since 1950.
Learn how we made it
shopping_cartBuy a hard copy

FAQ

At any time, press
/
to open search bar
Embeddings-related common questions
How were the jina-embeddings-v2 models trained?
For detailed information on our training processes, data sources, and evaluations, please refer to our technical report available on arXiv.
launcharXiv
What is jina-clip-v1, can I use it for search text and image?
Jina CLIP jina-clip-v1 is the latest multimodal embedding model that supports text-text, text-image, image-image, and image-text retrieval tasks. Unlike OpenAI CLIP model that falls short on text-text search, Jina CLIP is trained to be your text retriever. You can read more about it from our tech report.
launcharXiv
Which languages do your models support?
Our models support English, German, Spanish, Chinese, various programming languages and images. For more details, please refer to our publication on bilingual models.
launcharXiv
What is the maximum length for a single sentence input?
Our models allow for an input length of up to 8192 tokens, which is significantly higher than most other models. A token can range from a single character, like 'a', to an entire word, such as 'apple'. The total number of characters that can be input depends on the length and complexity of the words used. This extended input capability enables our jina-embeddings-v2 models to perform more comprehensive text analysis and achieve higher accuracy in context understanding, especially for extensive textual data.
What is the maximum number of sentences I can include in a single request?
A single API call can process up to 2048 sentences or texts, facilitating extensive text analysis in one request.
How do I send images to the jina-clip-v1 model?
You can use either url or bytes in the input field of the API request. For url, provide the URL of the image you want to process. For bytes, encode the image in base64 format and include it in the request. The model will return the embeddings of the image in the response.
How do Jina Embeddings models compare to OpenAI's text-embedding-ada-002 model?
According to the MTEB Leaderboard, our Base model competes closely with OpenAI’s text-embedding-ada-002, exhibiting comparable performance on average. Furthermore, our Base model excels in several tasks, including classification, pair-classification, re-ranking, and summarization, outperforming OpenAI’s model.
How seamless is the transition from OpenAI's text-embedding-ada-002 to your solution?
The transition is streamlined, as our API endpoint, https://api.jina.ai/v1/embeddings, matches the input and output JSON schemas of OpenAI’s text-embeddings-ada-002 model. This compatibility ensures users can easily replace the OpenAI model with ours when using OpenAI’s endpoint.
How tokens are calculated when using jina-clip-v1?
The tokens are calculated based on the text length and image size. For text in the request, tokens are counted in the standard way. For image in the request, the following steps are conducted: 1. Tile Size: Each image is divided into tiles of size 224x224 pixels. 2. Coverage: The number of tiles required to completely cover the input image is calculated. Even if the image dimensions are not perfectly divisible by 224, we will count partial tiles as full tiles. 3. Total Tiles: The total number of tiles covering the image determines the cost. For instance, if an image is 500x500 pixels, it would be covered by 3x3 tiles, resulting in 9 tiles. 4. Cost Calculation: Each tile contributes to the final cost of processing the image. The cost per tile is 1000 tokens. Example: For an image with dimensions 500x500 pixels: • The image is divided into 224x224 pixel tiles. • The total number of tiles required is 3 (horizontal) x 3 (vertical) = 9 tiles. • The cost will be 9*1000 = 9000 tokens
Do you provide models for embedding images or audio?
Yes, jina-clip-v1 can embed both images and texts. Embedding models on more modalities will be announced soon!
Can Jina Embedding models be fine-tuned with private or company data?
For inquiries about fine-tuning our models with specific data, please contact us to discuss your requirements. We are open to exploring how our models can be adapted to meet your needs.
Contact
Can your endpoints be hosted privately on AWS, Azure, or GCP?
Yes, our services are available on the AWS marketplace, and we are in the process of expanding to Azure and GCP marketplaces. If you have particular requirements, please contact us at sales AT jina.ai.
launchAWS SageMaker
API-related common questions
code
Can I use the same API key for embedding, reranking, reader, fine-tuning APIs?
Yes, the same API key is valid for all search foundation products from Jina AI. This includes the embedding, reranking, reader and fine-tuning APIs, with tokens shared between the all services.
code
Can I monitor the token usage of my API key?
Yes, token usage can be monitored in the 'Buy tokens' tab by entering your API key, allowing you to view the usage history and remaining tokens.
code
What should I do if I forget my API key?
If you have misplaced a topped-up key and wish to retrieve it, please contact support AT jina.ai with your registered email for assistance.
Contact
code
Do API keys expire?
No, our API keys do not have an expiration date. However, if you suspect your key has been compromised and wish to retire it or transfer its tokens to a new key, please contact our support team for assistance.
Contact
code
Why is the first request for some models slow?
This is because our serverless architecture offloads certain models during periods of low usage. The initial request activates or 'warms up' the model, which may take a few seconds. After this initial activation, subsequent requests process much more quickly.
code
Is user input data used for training your models?
We adhere to a strict privacy policy and do not use user input data for training our models.
Billing-related common questions
attach_money
Is billing based on the number of sentences or requests?
Our pricing model is based on the total number of tokens processed, allowing users the flexibility to allocate these tokens across any number of sentences, offering a cost-effective solution for diverse text analysis requirements.
attach_money
Is there a free trial available for new users?
We offer a welcoming free trial to new users, which includes one million tokens for use with any of our models, facilitated by an auto-generated API key. Once the free token limit is reached, users can easily purchase additional tokens for their API keys via the 'Buy tokens' tab.
attach_money
Are tokens charged for failed requests?
No, tokens are not deducted for failed requests.
attach_money
What payment methods are accepted?
Payments are processed through Stripe, supporting a variety of payment methods including credit cards, Google Pay, and PayPal for your convenience.
attach_money
Is invoicing available for token purchases?
Yes, an invoice will be issued to the email address associated with your Stripe account upon the purchase of tokens.
Offices
location_on
Berlin, Germany (HQ)
Prinzessinnenstraße 19-20, 10969 Berlin, Germany
Geschäftsanschrift: Leipziger str. 96, 10117 Berlin, Germany
location_on
Beijing, China
Level 5, Building 6, No.48 Haidian West St. Beijing Haidian, China
location_on
Shenzhen, China
402, Floor 4, Fu'an Technology Building, Shenzhen Nanshan, China
Enterprise
Embeddings
Reranker
Reader
Auto Fine-Tuning
API Status
Power Users
PromptPerfect
SceneXplain
BestBanner
JinaChat
Rationale
Company
About us
Contact sales
Newsroom
Intern program
Join us
open_in_new
Download logo
open_in_new
Terms
Terms & Conditions
Privacy
Manage Cookies
email
Jina AI GmbH © 2020-2024.