Products

Convert any URL to Markdown for better grounding LLMs.

World-class multimodal multilingual embeddings.

World-class reranker for maximizing search relevancy.

Search, read and reason until best answer found.

More

Zero-shot and few-shot classification for image and text.

Cut long text into chunks and do tokenization.

Auto codegen for your copilot IDE or LLM

Company

Terms & Conditions

Newsroom

Accelerate search AI, one word at a time.

Featured

Word "Embeddings" followed by a numeric or symbol representation, displayed in multiple colors on a technology-themed, colorf

June 25, 2025 • 12 minutes read

Jina Embeddings v4: Universal Embeddings for Multimodal Multilingual Retrieval

Jina Embeddings v4 is a 3.8 billion parameter universal embedding model for multimodal and multilingual retrieval that supports both single-vector and multi-vector embedding outputs.

April 08, 2025 • 21 minutes read

jina-reranker-m0: Multilingual Multimodal Document Reranker

Introducing jina-reranker-m0, our new multilingual multimodal reranker for retrieving visual documents, with SOTA performance on multilingual long documents and code searching tasks.

Modern dot matrix text display on a dark blue background, conveying a digital feel.

February 25, 2025 • 19 minutes read

A Practical Guide to Implementing DeepSearch/DeepResearch

QPS out, depth in. DeepSearch is the new norm. Find answers through read-search-reason loops. Learn what it is and how to build it.

Abstract interlocking circles pattern in black on orange, with text 'THINK:SEARCH:THINK' below.

Latest

Network illustration of interconnected hexagons, some solid and some hollow blue, connected by red lines indicating paths or

July 14, 2025 • 11 minutes read

Submodular Optimization for Text Selection, Passage Reranking & Context Engineering

Black and white typographic design of "1993" with a 3D effect, minimalistic black border, and a sense of depth on a white bac

July 04, 2025 • 13 minutes read

Submodular Optimization for Diverse Query Generation in DeepResearch

Retro-style digital screen displaying four pixelated images: a cat, a woman, an abstract figure, and a man's portrait, with l

June 30, 2025 • 8 minutes read

Quantization-Aware Training of jina-embeddings-v4

Academic Publications

jina-embeddings-v4: Universal Embeddings for Multimodal Multilingual Retrieval

ReaderLM-v2: Small Language Model for HTML to Markdown and JSON

December 17, 2024

AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark

December 12, 2024

jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images

September 18, 2024

jina-embeddings-v3: Multilingual Embeddings With Task LoRA

September 07, 2024

Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models

August 30, 2024

Jina-ColBERT-v2: A General-Purpose Multilingual Late Interaction Retriever

Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language Models

Jina CLIP: Your CLIP Model Is Also Your Text Retriever

February 26, 2024

Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings

October 30, 2023

Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents

Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models

12 publications in total.

Featured

All

Press release

Tech blog

Opinion

Event

July 14, 2025 • 11 minutes read

Submodular Optimization for Text Selection, Passage Reranking & Context Engineering

While others rely on prompt tuning and hope for the best, you should learn submodular optimization that provides a principled framework with theoretical guarantees for better context engineering.

Network illustration of interconnected hexagons, some solid and some hollow blue, connected by red lines indicating paths or

July 04, 2025 • 13 minutes read

Submodular Optimization for Diverse Query Generation in DeepResearch

Many know the importance of query diversity in DeepResearch, but few know how to solve it rigorously via submodular optimization.

Black and white typographic design of "1993" with a 3D effect, minimalistic black border, and a sense of depth on a white bac

June 30, 2025 • 8 minutes read

Quantization-Aware Training of jina-embeddings-v4

Quantization gives smaller embeddings. We show you fine-tuned quantization gives you even lossless embeddings.

Retro-style digital screen displaying four pixelated images: a cat, a woman, an abstract figure, and a man's portrait, with l

June 25, 2025 • 12 minutes read

Jina Embeddings v4: Universal Embeddings for Multimodal Multilingual Retrieval

Jina Embeddings v4 is a 3.8 billion parameter universal embedding model for multimodal and multilingual retrieval that supports both single-vector and multi-vector embedding outputs.

Word "Embeddings" followed by a numeric or symbol representation, displayed in multiple colors on a technology-themed, colorf

May 28, 2025 • 4 minutes read

Correlations: Vibe-Testing Embeddings in GUI

As serious as we are about MTEB, we also love vibe-testing. Correlations is a simple GUI we use for validating citations in DeepSearch, debugging late chunking, and vibe-testing embeddings. Now it's open-source.

Technical screen showing green and yellow visual data, including charts in the lower half and a heat-map-like visualization a

May 25, 2025 • 21 minutes read

What We Learned at ICLR2025

We collect some most interesting papers in ICLR 2025, featuring TIPS, FlexPrefill, Zero-Shot Rerankers, SVD-LLM, Hymba etc.

Three people smiling on a stage at a conference with an ICLR banner visible, suggesting a warm and lively event atmosphere.

May 25, 2025 • 8 minutes read

Fair Scoring for Multimodal Documents with jina-reranker-m0

Text similarity: 0.7. Image similarity: 0.5. Which document is more relevant? You literally cannot tell—and that's the core problem breaking multimodal search. We solve it with unified reranking.

Stacked glowing green ovals on a background transitioning from black to green, with the top oval having an unusual, split sha

May 07, 2025 • 9 minutes read

Model Soup’s Recipe for Embeddings

Boost robustness and performance with model soups: averaging weights. No extra cost, better results.

Still life drawing of a purple bowl filled with apples and oranges on a white table. The scene features rich colors against a

April 16, 2025 • 10 minutes read

On the Size Bias of Text Embeddings and Its Impact in Search

Size bias refers to how the length of text inputs affects similarity, regardless of semantic relevance. It explains why search systems sometimes return long, barely-relevant documents instead of shorter, more precise matches to your query.

Black background with a simple white ruler marked in centimeters, emphasizing a minimalist design.

April 08, 2025 • 21 minutes read

jina-reranker-m0: Multilingual Multimodal Document Reranker

Introducing jina-reranker-m0, our new multilingual multimodal reranker for retrieving visual documents, with SOTA performance on multilingual long documents and code searching tasks.

Modern dot matrix text display on a dark blue background, conveying a digital feel.

Search by title

Filter by product

Filter by author

Offices

Sunnyvale, CA

710 Lakeway Dr, Ste 200, Sunnyvale, CA 94085, USA

Berlin, Germany (HQ)

Prinzessinnenstraße 19-20, 10969 Berlin, Germany

Beijing, China

Level 5, Building 6, No.48 Haidian West St. Beijing, China

Shenzhen, China

402 Floor 4, Fu'an Technology Building, Shenzhen, China

Search Foundation

API Documentation

Get Jina API key

Company

Terms

Terms & Conditions

Jina AI © 2020-2025.