Qdrant Summer of Code 24

Qdrant Summer of Code 24

Google Summer of Code (#GSoC) is celebrating its 20th anniversary this year with the 2024 program. Over the past 20 years, 19K new contributors were introduced to #opensource through the program under the guidance of thousands of mentors from over 800 open-source organizations in various fields. Qdrant participated successfully in the program last year. Both projects, the UI Dashboard with unstructured data visualization and the advanced Geo Filtering, were completed in time and are now a part of the engine. One of the two young contributors joined the team and continues working on the project.

We are thrilled to announce that Qdrant was ๐๐Ž๐“ ๐š๐œ๐œ๐ž๐ฉ๐ญ๐ž๐ into the GSoc 2024 program for unknown reasons, but instead, we are introducing our own ๐๐๐ซ๐š๐ง๐ญ ๐’๐ฎ๐ฆ๐ฆ๐ž๐ซ ๐จ๐Ÿ ๐‚๐จ๐๐ž program with a stipend for contributors! To not reinvent the wheel, we follow all the timelines and rules of the official Google program.

Our project ideas.

We have prepared some excellent project ideas. Take a look and choose if you want to contribute in Rust or a Python-based project.

โžก WASM-based dimension reduction viz ๐Ÿ“Š

Implement a dimension reduction algorithm in Rust, compile to WASM and integrate the WASM code with Qdrant Web UI.

โžก Efficient BM25 and Okapi BM25, which uses the BERT Tokenizer ๐Ÿฅ‡

BM25 and Okapi BM25 are popular ranking algorithms. Qdrant’s FastEmbed supports dense embedding models. We need a fast, efficient, and massively parallel Rust implementation with Python bindings for these.

โžก ONNX Cross Encoders in Python โš”๏ธ

Export a cross-encoder ranking models to operate on ONNX runtime and integrate this model with the Qdrant’s FastEmbed to support efficient re-ranking

โžก Ranking Fusion Algorithms implementation in Rust ๐Ÿงช

Develop Rust implementations of various ranking fusion algorithms including but not limited to Reciprocal Rank Fusion (RRF). For a complete list, see: https://github.com/AmenRa/ranx and create Python bindings for the implemented Rust modules.

โžก Setup Jepsen to test Qdrantโ€™s distributed guarantees ๐Ÿ’ฃ

Design and write Jepsen tests based on implementations for other Databases and create a report or blog with the findings.

See all details on our Notion page: https://www.notion.so/qdrant/GSoC-2024-ideas-1dfcc01070094d87bce104623c4c1110

Contributor application period begins on March 18th. We will accept applications via email. Let’s contribute and celebrate together!

In open-source, we trust! ๐Ÿฆ€๐Ÿค˜๐Ÿš€