Your Search Foundation Supercharged.
Our Customers
For Better Search
Our frontier models form the search foundation for high-quality enterprise search and RAG systems.
Start instantly—no credit card or registration needed!
verified_user We are SOC 2 Type 1 & 2 compliant with the American Institute of Certified Public Accountants (AICPA). open_in_new
chevron_leftchevron_right
globe_book
Use
r.jina.ai to read a URL and fetch its contenttravel_explore
Use
s.jina.ai to search the web and get SERPAdd
mcp.jina.ai as your MCP server to access our API in LLMsContent Format
You can control the level of detail in the response to prevent over-filtering. The default pipeline is optimized for most websites and LLM input.
Default
arrow_drop_down
JSON Response
The response will be in JSON format, containing the URL, title, content, and timestamp (if available). In Search mode, it returns a list of five entries, each following the described JSON structure.
Timeout (seconds)
Maximum time to wait for page load. Increase for slow pages, decrease for simple static pages.
Token Budget
Limits the maximum number of tokens used for this request. Exceeding this limit will cause the request to fail.
Use ReaderLM-v2
Experimental
Uses ReaderLM-v2 for HTML to Markdown conversion, to deliver high-quality results for websites with complex structures and contents. Costs 3x tokens!
Extract Only (CSS Selector)
Only extract content matching these CSS selectors. Example: article, .main-content, #post-body
Wait For (CSS Selector)
Wait until these elements appear before extracting content. Useful for dynamically loaded content.
Exclude (CSS Selector)
Remove these elements before extraction. Example: nav, footer, .sidebar, #ads
Remove All Images
Strip all images from the output. Reduces token usage when images are not needed.
OpenAI Citation Format
Format links for OpenAI's web browsing tool. Uses special citation markers compatible with GPT models.
Links Summary Section
A "Buttons & Links" section will be created at the end. This helps the downstream LLMs or web agents navigating the page or take further actions.
None
arrow_drop_down
Images Summary Section
An "Images" section will be created at the end. This gives the downstream LLMs an overview of all visuals on the page, which may improve reasoning.
None
arrow_drop_down
Browser Viewport Size
POST
Set browser window dimensions. Affects responsive layouts and content visibility.
Forward Cookie
Our API server can forward your custom cookie settings when accessing the URL, which is useful for pages requiring extra authentication. Note that requests with cookies will not be cached.
Image Caption
Captions all images at the specified URL, adding 'Image [idx]: [caption]' as an alt tag for those without one. This allows downstream LLMs to interact with the images in activities such as reasoning and summarizing.
Use a Proxy Server
Our API server can utilize your proxy to access URLs, which is helpful for pages accessible only through specific proxies.
Use a Country-Specific Proxy Server
Set country code for location-based proxy server. Use 'auto' for optimal selection or 'none' to disable.
Bypass Cached Content
Our API caches URL contents for a certain amount of time. Set it to true to ignore the cached result and fetch the content from the URL directly.
Cache Tolerance (seconds)
Accept cached content if younger than N seconds. Set to 0 for fresh content (same as Bypass Cache), or higher values to allow faster responses from cache.
Page Ready Timing
When to consider a page fully loaded. Later timings wait longer but capture more dynamic content.
Default
arrow_drop_down
Custom User-Agent
Override the browser User-Agent string. Useful for accessing sites that require specific browsers or block crawlers.
Custom Referer
Set the HTTP Referer header. Some sites check this to verify traffic comes from expected sources.
Preserve Base64 Images
Keep inline base64-encoded images in markdown output instead of converting them to external URLs.
Do Not Cache or Track
Prevent this request from being cached or logged on our servers. Use for sensitive URLs.
Github Flavored Markdown
Opt in/out features from GFM (Github Flavored Markdown).
Enabled
arrow_drop_down
Stream Mode
Stream mode is beneficial for large target pages, allowing more time for the page to fully render. If standard mode results in incomplete content, consider using Stream mode.
Customize Browser Locale
Control the browser locale to render the page. Lots of websites serve different content based on the locale.
Respect robots.txt
Check robots.txt rules before fetching. Specify which bot name to use for the check.
Include iframe Content
Extract content from embedded iframes. Enable for pages with content loaded in iframes.
Include Shadow DOM
Extract content from Shadow DOM components. Enable for pages using web components.
Use Final URL as Base
Resolve relative URLs using the final destination URL after redirects, instead of the original URL.
Local PDF/HTML file
POST
Use Reader on your local PDF and HTML file by uploading them. Only support pdf and html files. For HTML, please also specify a reference URL for better parsing related CSS/JS scripts.
upload
Run JavaScript Before Extraction
POST
Execute custom JS to modify the page before content extraction. Can be inline code or a URL to a script file.
Heading Style
Sets markdown heading format (passed to Turndown).
Hash Style
arrow_drop_down
Horizontal Rule Style
Defines markdown horizontal rule format (passed to Turndown).
Bullet Point Style
Sets bullet list marker character (passed to Turndown).
*
arrow_drop_down
Emphasis Style
Defines markdown emphasis delimiter (passed to Turndown).
_
arrow_drop_down
Strong Emphasis Style
Sets markdown strong emphasis delimiter (passed to Turndown).
**
arrow_drop_down
Link Style
Determines markdown link format (passed to Turndown).
Inline
arrow_drop_down
EU Compliance
Experimental
All infrastructure and data processing operations reside entirely within EU jurisdiction.
upload
Request
GET
Bash
Language
arrow_drop_down
curl "https://r.jina.ai/https://www.example.com"
key
API key
visibility_off
Available tokens
0
Our Publications
Understand how our frontier search models were trained from scratch, check out our latest publications. Meet our team at EMNLP, SIGIR, ICLR, NeurIPS, and ICML!
December 29, 2025
December 04, 2025
AAAI 2026
October 01, 2025
NeurIPS 2025
August 31, 2025
EMNLP 2025
June 24, 2025
ICLR 2025
March 04, 2025
ACL 2025
December 17, 2024
ICLR 2025
December 12, 2024
ECIR 2025
September 18, 2024
SIGIR 2025
September 07, 2024
EMNLP 2024
August 30, 2024
WWW 2025
June 21, 2024
ICML 2024
May 30, 2024
February 26, 2024
October 30, 2023
EMNLP 2023
July 20, 2023
16 publications in total.





































