Segmenter

Free API for segmenting long text into chunks and tokenization.

Segmenter API

Our Segmenter API is crucial for helping LLMs manage input within context limits, and optimizing model performance. It allows developers to count tokens and extract relevant text segments, ensuring efficient data processing and cost management.

Rate Limit

FAQ

Status

https://api.jina.ai/v1/segment?content=

73 tokens, 125 characters.

Return the tokens

Return the tokens and their corresponding ids in the response. Toggle to see the result visualization.

Return the chunks

Chunking the input into semantically meaningful segments while handling a wide variety of text types and edge cases based on common structural cues.

Return the first N tokens

Return the first N tokens of the given content. Boundary exclusive. Can not be used with 'tail'.

Return the last N tokens

Return the last N tokens of the given content. Boundary exclusive. Can not be used with 'head'.

Segmenter

Choose the tokenizer to use.

cl100k_base

Request

Bash

Language

curl -X POST 'https://api.jina.ai/v1/segment' \
  -H "Content-Type: application/json" \
  -d @- <<EOFEOF
  {
    "content": "Jina AI: Your Search Foundation, Supercharged! 🚀\nIhrer Suchgrundlage, aufgeladen! 🚀\n您的搜索底座，从此不同！🚀\n検索ベース,もう二度と同じことはありません！🚀"
  }
EOFEOF

API key

Available tokens

This is your unique key. Store it securely!

What is a Segmenter?

A segmenter is a crucial component that converts text into tokens or chunks, which are the basic units of data that an embedding/reranker model or LLM processes. Tokens can represent whole words, parts of words, or even individual characters.

Input text

Chunking long documents, lightning fast!

You can also use Segmenter API to cut long documents into smaller chunks, making it easier to process them in embeddings or rerankers. We leverage common structural cues and build a set of rules and heuristics which perform well across diverse types of content, e.g. Markdown, HTML, LaTeX and CJK languages.

Input text

Maximum length of each chunk: 1000

Maximum number of characters in each chunk. In practice the chunk length can be smaller than this value, if there is a good boundary in the text.

Segmenter API is free!

By providing your API key, you can access a higher rate limit, and your key won't be charged.

Rate Limit

Rate limits are tracked in three ways: RPM (requests per minute), and TPM (tokens per minute). Limits are enforced per IP/API key and will be triggered when either the RPM or TPM threshold is reached first. When you provide an API key in the request header, we track rate limits by key rather than IP address.

Columns

Product	API Endpoint	Description	w/o API Key	w/ API Key	w/ Premium API Key	Average Latency	Token Usage Counting	Allowed Request
Reader API	`https://r.jina.ai`	Convert URL to LLM-friendly text	20 RPM	500 RPM	5000 RPM	7.9s	Count the number of tokens in the output response.	GET/POST
Reader API	`https://s.jina.ai`	Search the web and convert results to LLM-friendly text		100 RPM	1000 RPM	2.5s	Every request costs a fixed number of tokens, starting from 10000 tokens	GET/POST
Embedding API	`https://api.jina.ai/v1/embeddings`	Convert text/images to fixed-length vectors		500 RPM & 1,000,000 TPM	2,000 RPM & 5,000,000 TPM	depends on the input size	Count the number of tokens in the input request.	POST
Reranker API	`https://api.jina.ai/v1/rerank`	Rank documents by query		500 RPM & 1,000,000 TPM	2,000 RPM & 5,000,000 TPM	depends on the input size	Count the number of tokens in the input request.	POST
Classifier API	`https://api.jina.ai/v1/train`	Train a classifier using labeled examples		20 RPM & 200,000 TPM	60 RPM & 1,000,000 TPM	depends on the input size	Tokens counted as: input_tokens × num_iters	POST
Classifier API (Few-shot)	`https://api.jina.ai/v1/classify`	Classify inputs using a trained few-shot classifier		20 RPM & 200,000 TPM	60 RPM & 1,000,000 TPM	depends on the input size	Tokens counted as: input_tokens	POST
Classifier API (Zero-shot)	`https://api.jina.ai/v1/classify`	Classify inputs using zero-shot classification		200 RPM & 500,000 TPM	1,000 RPM & 3,000,000 TPM	depends on the input size	Tokens counted as: input_tokens + label_tokens	POST
Segmenter API	`https://api.jina.ai/v1/segment`	Tokenize and segment long text	20 RPM	200 RPM	1,000 RPM	0.3s	Token is not counted as usage.	GET/POST
DeepSearch	`https://deepsearch.jina.ai/v1/chat/completions`	Reason, search and iterate to find the best answer		50 RPM	500 RPM	56.7s	Count the total number of tokens in the whole process.	POST

Get your API key

Contact sales

FAQ

How much does the Segmenter API cost?

If I don't provide an API key, what is the rate limit?

If I provide an API key, what is the rate limit?

Will you charge the tokens from my API key?

Does the Segmenter API support multiple languages?

What is the difference between GET and POST requests?

What is the maximum length I can tokenize per request?

How does the chunking feature work? Is it semantic chunking?

How do you handle special tokens such as 'endoftext' in the Segmenter API?

Does chunking support other languages than English?

How to get my API key?

What's the rate limit?

Rate Limit

Columns

Product	API Endpoint	Description	w/o API Key	w/ API Key	w/ Premium API Key	Average Latency	Token Usage Counting	Allowed Request
Reader API	`https://r.jina.ai`	Convert URL to LLM-friendly text	20 RPM	500 RPM	5000 RPM	7.9s	Count the number of tokens in the output response.	GET/POST
Reader API	`https://s.jina.ai`	Search the web and convert results to LLM-friendly text		100 RPM	1000 RPM	2.5s	Every request costs a fixed number of tokens, starting from 10000 tokens	GET/POST
Embedding API	`https://api.jina.ai/v1/embeddings`	Convert text/images to fixed-length vectors		500 RPM & 1,000,000 TPM	2,000 RPM & 5,000,000 TPM	depends on the input size	Count the number of tokens in the input request.	POST
Reranker API	`https://api.jina.ai/v1/rerank`	Rank documents by query		500 RPM & 1,000,000 TPM	2,000 RPM & 5,000,000 TPM	depends on the input size	Count the number of tokens in the input request.	POST
Classifier API	`https://api.jina.ai/v1/train`	Train a classifier using labeled examples		20 RPM & 200,000 TPM	60 RPM & 1,000,000 TPM	depends on the input size	Tokens counted as: input_tokens × num_iters	POST
Classifier API (Few-shot)	`https://api.jina.ai/v1/classify`	Classify inputs using a trained few-shot classifier		20 RPM & 200,000 TPM	60 RPM & 1,000,000 TPM	depends on the input size	Tokens counted as: input_tokens	POST
Classifier API (Zero-shot)	`https://api.jina.ai/v1/classify`	Classify inputs using zero-shot classification		200 RPM & 500,000 TPM	1,000 RPM & 3,000,000 TPM	depends on the input size	Tokens counted as: input_tokens + label_tokens	POST
Segmenter API	`https://api.jina.ai/v1/segment`	Tokenize and segment long text	20 RPM	200 RPM	1,000 RPM	0.3s	Token is not counted as usage.	GET/POST
DeepSearch	`https://deepsearch.jina.ai/v1/chat/completions`	Reason, search and iterate to find the best answer		50 RPM	500 RPM	56.7s	Count the total number of tokens in the whole process.	POST

Can I use the same API key for reader, embedding, reranking, classifying and fine-tuning APIs?

Can I monitor the token usage of my API key?

What should I do if I forget my API key?

Do API keys expire?

Can I transfer tokens between API keys?

Can I revoke my API key?

Why is the first request for some models slow?

Is user input data used for training your models?

Is billing based on the number of sentences or requests?

Is there a free trial available for new users?

Are tokens charged for failed requests?

What payment methods are accepted?

Is invoicing available for token purchases?