Jina Embeddings

Qdrant is compatible with Jina AI embeddings. You can get a free trial key from Jina Embeddings to get embeddings.

Qdrant users can receive a 10% discount on Jina AI APIs by using the code QDRANT.

Technical Summary

Model	Dimension	Language	MRL (matryoshka)	Context
jina-embeddings-v4	2048 (single-vector), 128 (multi-vector)	Multilingual (30+)	Yes	32768 + Text/Image
jina-clip-v2	1024	Multilingual (100+, focus on 30)	Yes	Text/Image
jina-embeddings-v3	1024	Multilingual (89 languages)	Yes	8192
jina-embeddings-v2-base-en	768	English	No	8192
jina-embeddings-v2-base-de	768	German & English	No	8192
jina-embeddings-v2-base-es	768	Spanish & English	No	8192
jina-embeddings-v2-base-zh	768	Chinese & English	No	8192

Jina recommends using jina-embeddings-v4 for all tasks.

On top of the backbone, jina-embeddings-v4 has been trained with 5 task-specific adapters for different embedding uses. Include task in your request to optimize your downstream application:

retrieval.query: Used to encode user queries or questions in retrieval tasks.
retrieval.passage: Used to encode large documents in retrieval tasks at indexing time.
code.query: Used to encode user queries or questions in code related retrieval tasks.
code.passage: Used to encode large documents in code related retrieval tasks at indexing time.
text-matching: Used to encode text for similarity matching, such as measuring similarity between two sentences.

Similarly, jina-embeddings-v3 has been trained with 5 task-specific adapters for different embedding uses. Include task in your request to optimize your downstream application:

retrieval.query: Used to encode user queries or questions in retrieval tasks.
retrieval.passage: Used to encode large documents in retrieval tasks at indexing time.
classification: Used to encode text for text classification tasks.
text-matching: Used to encode text for similarity matching, such as measuring similarity between two sentences.
separation: Used for clustering or reranking tasks.

jina-embeddings-v4, jina-embeddings-v3 and jina-clip-v2 support Matryoshka Representation Learning, allowing users to control the embedding dimension with minimal performance loss. Include dimensions in your request to select the desired dimension. By default, dimensions is set to 2048 (jina-embeddings-v4) or 1024 (jina-embeddings-v3 and jina-clip-v2), and a number between 256 and 2048 is recommended. You can reference the table below for hints on dimension vs. performance for the jina-embeddings-v3 model. Similar results hold for the others.

Dimension	32	64	128	256	512	768	1024
Average Retrieval Performance (nDCG@10)	52.54	58.54	61.64	62.72	63.16	63.3	63.35

jina-embeddings-v4 and jina-embeddings-v3 supports Late Chunking, the technique to leverage the model’s long-context capabilities for generating contextual chunk embeddings. Include late_chunking=True in your request to enable contextual chunked representation. When set to true, Jina AI API will concatenate all sentences in the input field and feed them as a single string to the model. Internally, the model embeds this long concatenated string and then performs late chunking, returning a list of embeddings that matches the size of the input list.

Example

Text-to-Text Retrieval

The code below demonstrates how to use jina-embeddings-v4 with Qdrant:

import requests

import qdrant_client
from qdrant_client.models import Distance, VectorParams, Batch

# Provide Jina API key and choose one of the available models.
JINA_API_KEY = "jina_xxxxxxxxxxx"
MODEL = "jina-embeddings-v4"
DIMENSIONS = 2048 # Or choose your desired output vector dimensionality.
TASK = 'retrieval.passage' # For indexing, or set to retrieval.query for querying

# Get embeddings from the API
url = "https://api.jina.ai/v1/embeddings"

headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {JINA_API_KEY}",
}

data = {
    "input": ["Your text string goes here", "You can send multiple texts"],
    "model": MODEL,
    "dimensions": DIMENSIONS,
    "task": TASK,
    "late_chunking": True,
}

response = requests.post(url, headers=headers, json=data)
embeddings = [d["embedding"] for d in response.json()["data"]]


# Index the embeddings into Qdrant
client = qdrant_client.QdrantClient(":memory:")
client.create_collection(
    collection_name="MyCollection",
    vectors_config=VectorParams(size= DIMENSIONS, distance=Distance.DOT),
)


qdrant_client.upsert(
    collection_name="MyCollection",
    points=Batch(
        ids=list(range(len(embeddings))),
        vectors=embeddings,
    ),
)

Text-to-Image Retrieval