Available since v0.8.4
Snapshots are performed on a per collection basis and consist in a tar
archive file containing the necessary data to restore the collection at the time of the snapshot.
This feature can be used to archive data or easily replicate an existing deployment.
The target directory used to store generated snapshots is controlled through the configuration or using the ENV variable: QDRANT__STORAGE__SNAPSHOT_PATH=./snapshots
.
storage:
# Where to store snapshots
snapshots_path: ./snapshots
It defaults to ./snapshots
if no value is provided.
Create snapshot
To create a new snapshot for an existing collection:
POST /collections/{collection_name}/snapshots
from qdrant_client import QdrantClient
client = QdrantClient(host="localhost", port=6333)
client.create_snapshot(
collection_name="{collection_name}"
)
This is a synchronous operation for which a tar
archive file will be generated into the snapshot_path
.
List snapshot
List of snapshots for a collection:
GET /collections/{collection_name}/snapshots
from qdrant_client import QdrantClient
client = QdrantClient(host="localhost", port=6333)
client.list_snapshots(
collection_name="{collection_name}"
)
Retrieve snapshot
To download a specified snapshot from a collection as a file:
GET /collections/{collection_name}/snapshots/{snapshot_name}
Only available through the REST API for the time being.
Restore snapshot
There is a difference in recovering snapshots in single-deployment node and distributed deployment mode.
Recover in single deployment mode
Single deployment is simpler, you can recover any collection on the start-up and it will be immediately available in the service. Restoring snapshots is done through the Qdrant CLI at startup time.
The main entry point is the --snapshot
argument which accepts a list of pairs <snapshot_file_path>:<target_collection_name>
For example:
./qdrant --snapshot /snapshots/test-collection-archive.snapshot:test-collection --snapshot /snapshots/test-collection-archive.snapshot:test-copy-collection
The target collection must be absent otherwise the program will exit with an error.
If you wish instead to overwrite an existing collection, use the --force_snapshot
flag with caution.
Recover in cluster deployment
Available since v0.11.3
Recovering in cluster mode is more sophisticated, as Qdrant should maintain consistency across peers even during the recovery process. As the information about created collections is stored in the consensus, even a newly attached cluster node will automatically create collections. Recovering non-existing collections with snapshots won’t make this collection known to the consensus.
To recover snpshot in this case one can use snapshot recovery API:
PUT /collections/<collection_name>/snapshots/recover
{
"location": "http://qdrant-node-1:6333/collections/collection_name/snapshots/snapshot-2022-10-10.shapshot"
}
from qdrant_client import QdrantClient
client = QdrantClient(host="qdrant-node-2", port=6333)
client.recover_snapshot("collection_name", "http://qdrant-node-1:6333/collections/collection_name/snapshots/snapshot-2022-10-10.shapshot")
Qdrant will extract shard data from the snapshot and properly register shards in the cluster. If there are other active replicas of the recovered shards in the cluster, Qdrant will replicate them to the newly recovered node to maintain data consistency.
Snapshots for the whole storage
Available since v0.8.5
Sometimes it might be handy to create snapshot not just for a single collection, but for the whole storage, including collection aliases.
Qdrant provides a dedicated API for that as well. It is similar to collection-level snapshots, but does not require collecton_name
:
Create full storage snapshot
POST /snapshots
from qdrant_client import QdrantClient
client = QdrantClient(host="localhost", port=6333)
client.create_full_snapshot()
List full storage snapshots
GET /snapshots
from qdrant_client import QdrantClient
client = QdrantClient(host="localhost", port=6333)
client.list_full_snapshots()
Download full storage snapshot
GET /snapshots/{snapshot_name}
Restore full storage snapshot
Restoring snapshots is done through the Qdrant CLI at startup time.
For example:
./qdrant --storage-snapshot /snapshots/full-snapshot-2022-07-18-11-20-51.snapshot