Check-in begins at 8:30 AM.
First 200 to arrive get exclusive t-shirt.

June 11th, 2026 The Midway, San Francisco

Welcome

QDRANT

We Do it the Hard Way

Qdrant started for search, and remains for search. From its roots as an open source side project to millions of downloads and thousands of community members worldwide, Qdrant has always sought to build the highest quality search engine, even when that meant doing things the hard way. Hear about our origins, the decisions that shaped the product, and our future trajectory from Qdrant Co-Founder and CEO, Andre Zayarni. Welcome to Vector Space Day.

QDRANT

When Search Gets Serious

From a strong Rust core to composable retrieval primitives, Qdrant was built from the start for speed, accuracy, and flexibility. Today it supports thousands of projects and a growing number of enterprise businesses around the world, all from the same core engine. But the roadmap is far from finished. Where is Qdrant headed next, and what does it mean for the teams building on it? Come to this session to find out what's coming and why it matters.

AGENTS & MEMORY

Continual Learning Starts with Memory

Continual learning has long been treated as a training problem: new data, new gradients, new weights. But for production agents, the first unlock isn't retraining, it's memory. This talk reframes continual learning as a memory, retrieval, and state-management problem, showing how agents capture interactions, structure durable context, and improve decisions over time. Taranjeet shares patterns from building Mem0, including trade-offs of a memory layer on vector databases and what breaks at scale.

SEARCH & RETRIEVAL

Using GraphRAG to Improve Enterprise Governance

Enterprise AI agents are only as trustworthy as the rules they operate within. This talk presents a practical blueprint for combining Qdrant's vector search with a Neo4j graph governance layer — building agents that retrieve fast and stay policy-compliant. Through a live demo, we show how the same query returns different results for different users based on governance, not just relevance. Attendees leave with a concrete architecture for enterprise AI agents that are smart, fast, and safe.

Coffee Break

15' 10:40 AM

AGENTS & MEMORY

Literal Skill Issue: How Skills.md Might Be Holding Your Agents Back

SKILLS.md files have saved us from the massive headache of MCP servers, but having a human manually write and update static markdown files just doesn't scale. This session will break down the very real limitations of hardcoding what your agents can do, from brittle maintenance loops to capability ceilings that cap what agents can learn on their own. Come find out why SKILLS.md is just our awkward transitional phase, and how we're going to replace it with dynamic, autonomous tooling that evolves as your agents do.

AGENTS & MEMORY

Free Your Agent's Mind...with Context Graphs

AI systems need more than intelligence; they need context that persists. Without it, even strong models misinterpret information, lose decision rationale, or repeat mistakes. Context Graphs address this: a living graph capturing not just what was retrieved, but how context led to actions through tool calls, constraints, and outcomes, stitched across entities and time. This talk shows how context graphs complement retrieval with multi-hop structured assembly and built-in provenance for enterprise-ready AI.

SEARCH & RETRIEVAL

Building the Infra Behind 20 Billion+ Vectors

This talk traces HubSpot's journey from a Helm-based Qdrant deployment, where cluster provisioning and scaling were manual, error-prone, multi-step processes, to a fully automated Kubernetes Operator built on HubSpot's internal kube-operators framework. Learn how the team designed the operator to handle rolling upgrades, automated scaling, and self-healing across a fleet managing 20 billion+ vectors, and the lessons learned running Qdrant at this scale in production.

SEARCH & RETRIEVAL

Scaling to Billions: Lessons from Slack's Semantic Search Indexing

Slack's semantic search indexes trillions of messages into vectors, kept searchable within seconds. This open discussion skips the "perfect world" diagrams and covers what it actually takes to run a vector pipeline at this scale: a Lambda architecture with a "snowball" caching system to avoid recomputing billions of embeddings weekly, greedy batching for a 3x inference speedup, and a candid look at why complex quantization methods failed in production.

House Keeping

5' 12:15 PM

Lunch Break

60' 12:20 PM

QDRANT

Building the DNA of Search

Qdrant was engineered from the ground up for performance, scale, and flexibility — and Oncotelic Therapeutics put it to the test. Indexing 28M PubMed abstracts to power AI-driven drug development, Oncotelic compressed concept-to-clinic to ~2 years — a fraction of the typical biotech timeline. In this conversation, Qdrant Engineering and Oncotelic walk through what matters most for search at scale: hybrid retrieval, MeSH-enriched metadata filtering, and the operational realities of running a vector database in production.

AGENTS & MEMORY

Building Distributed, Enterprise-ready Agentic AI

A high-level look at building intelligent, enterprise-grade AI agents using modern tools and infrastructure. This session explores how scalable systems can support context-aware reasoning, long-term memory, and real-time decision-making at production scale. We frame the key architectural patterns behind reliable AI agents in enterprise environments, from retrieval and orchestration to evaluation and observability. Learn how Vultr's global cloud infrastructure powers distributed AI workloads for companies shipping agents today.

AGENTS & MEMORY

The Document Harness: What Your AI Misses in the 90%

An estimated 90% of enterprise data is unstructured, living in PDFs, PowerPoints, Word, and Excel files that power a majority of knowledge work. There's a huge opportunity to build autonomous agents that can understand, reason over, and edit massive quantities of documents. But real-world documents are too complex for even frontier models to understand. This session walks through core challenges and advances in document OCR and agent harnesses enabling modern document workflow automation.

SEARCH & RETRIEVAL

Stop Vibe Shipping: Evaluate Your Retrieval

"Looks good to me" is not an evaluation strategy. Yet most teams ship retrieval systems that way: tweak the chunking, run a few demo queries, call it done. This talk replaces vibes with measurement. We'll cover retrieval metrics that actually matter, how to build golden datasets that survive contact with reality, where LLM-as-judge helps and where it lies, and how to wire continuous evals into CI so regressions show up before customer complaints.

SEARCH & RETRIEVAL

Beyond the Single API Call: Agentic Video Intelligence

Video is a highly information-dense modality, and processing it at scale requires more than standard embed-store-retrieve pipelines. This talk explores how Twelve Labs' multimodal foundation models enable rich semantic understanding of video, from domain-specific search to structured metadata extraction. We'll walk through a real-world anomaly detection app built on Twelve Labs and Qdrant, and introduce Jockey, an agentic framework for multi-step video workflows.

Coffee Break

15' 3:00 PM

AGENTS & MEMORY

The Long and the Short of AI Memory

What is AI Memory today? Where does it live: markdown files, vectors, graphs, or somewhere else? And where will it be tomorrow? The answer matters because memory is quickly becoming the differentiator between agents that forget everything between sessions and agents that actually compound knowledge over time. In this talk, Dave will share examples of how short-term and long-term memory are used in production today, using OpenClaw as a hands-on example to illustrate the patterns and trade-offs.

EDGE & ROBOTICS

The World is Becoming More Searchable

The surface areas of search are only increasing as more and more data is captured from the physical and digital world. Embeddable systems unlock the ability to run local search across millions of devices, enabling AI at the edge without relying on round trips to the cloud. Come to this session to hear about Qdrant Edge: what it is, how it's built, the use cases it opens up, and why we think it fundamentally changes the game for teams building on-device AI experiences.

EDGE & ROBOTICS

When Latency Is the Product: Practical Patterns for On-Device GenAI

Agents are moving from the cloud to the edge - onto phones, PCs, and edge devices that perceive and act in real time. I'll share practical patterns for building agentic AI on-device: how to run models and tap local context for retrieval and memory. This runs on the NPU - a chip built for AI inference. That's why sustained, private, low-latency AI workloads win on the edge. And Qualcomm AI Hub is how you ship it: optimize any model and run it across devices.

EDGE & ROBOTICS

Tell the Robot What You Want

Learn how to build robots that respond to natural language commands using an open-source agentic AI framework. Sensors and actuators become agent tools, translating intents into actions. A lightweight hybrid architecture handles low-latency control locally on edge devices while delegating complex reasoning to the cloud. Explore hybrid edge-cloud patterns for low-latency control and intelligent planning, demonstrated live with a working robot.

Closing Remarks & Hackathon Winners

25' 4:35 PM

Happy Hour 🎉

5:00 PM

AGENDA

Welcome

We Do it the Hard Way

When Search Gets Serious

Continual Learning Starts with Memory

Using GraphRAG to Improve Enterprise Governance

Literal Skill Issue: How Skills.md Might Be Holding Your Agents Back

Free Your Agent's Mind...with Context Graphs

Building the Infra Behind 20 Billion+ Vectors

Scaling to Billions: Lessons from Slack's Semantic Search Indexing

Building the DNA of Search

Building Distributed, Enterprise-ready Agentic AI

The Document Harness: What Your AI Misses in the 90%

Stop Vibe Shipping: Evaluate Your Retrieval

Beyond the Single API Call: Agentic Video Intelligence

The Long and the Short of AI Memory

The World is Becoming More Searchable

When Latency Is the Product: Practical Patterns for On-Device GenAI

Tell the Robot What You Want