GPU Support? #60
-
|
Does qdrant support running on GPUs with CUDA? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
|
Hi, there is no such feature in the current version. We are considering using GPUs, but we don't make it a priority. In our experience, a much larger gain can be achieved by distributing the service between regular CPU machines. This would be cheaper and more efficient |
Beta Was this translation helpful? Give feedback.
-
|
GPU support for vector search is a meaningful optimization at scale — the gap between CPU and GPU HNSW search grows as collection size and dimensionality increase. Current state of GPU vector search (as of 2024-2025):
When GPU actually helps: For RAG workloads (many queries from many users), the throughput advantage of GPU search matters. For single-user interactive search, CPU HNSW's latency advantage often wins. The practical alternative: If you need more throughput than CPU HNSW provides, horizontal scaling (more Qdrant nodes) is often more operationally straightforward than GPU setup. GPU vector search requires careful batching to amortize transfer overhead. Is the use case throughput (many concurrent queries) or query latency optimization? |
Beta Was this translation helpful? Give feedback.
Hi, there is no such feature in the current version. We are considering using GPUs, but we don't make it a priority. In our experience, a much larger gain can be achieved by distributing the service between regular CPU machines. This would be cheaper and more efficient