Vector database

From HandWiki
Short description: Type of specialized database system

A vector database management system (VDBMS) or simply vector database or vector store is a database that can store vectors (fixed-length lists of numbers) along with other data items. Vector databases typically implement one or more Approximate Nearest Neighbor (ANN) algorithms,[1][2] so that one can search the database with a query vector to retrieve the closest matching database records.

Vectors are mathematical representations of data in a high-dimensional space. In this space, each dimension corresponds to a feature of the data, and tens of thousands of dimensions might be used to represent sophisticated data. A vector's position in this space represents its characteristics. Words, phrases, or entire documents, and images, audio, and other types of data can all be vectorized.[3]

These feature vectors may be computed from the raw data using machine learning methods such as feature extraction algorithms, word embeddings[4] or deep learning networks. The goal is that semantically similar data items receive feature vectors that are close to each other.

Vector databases can be used for similarity search, multi-modal search, recommendations engines, large languages models (LLMs), etc.[5]

Vector databases are also used to implement Retrieval-Augmented Generation (RAG), a method to improve domain-specific responses of large language models. Text documents describing the domain of interest are collected and for each document a feature vector (known as an "embedding") is computed, typically using a deep learning network, and stored in a vector database. Given a user prompt, the feature vector of the prompt is computed and the database is queried to retrieve the most relevant documents. These are then automatically added into the context window of the large language model and the large language model proceeds to create a response to the prompt given this context.[6]

List of vector databases

Name License
Apache Cassandra[7][8] Apache License 2.0
Azure Cosmos DB Vector Database Extension[9] N/A (Managed Service)
LlamaIndex [10] MIT License[11]
Milvus [12][13] Apache License 2.0
MongoDB Atlas [14] N/A (Managed service)
Couchbase[15][16] Unknown (Preview)
Pinecone [17] Closed source
Redis Cloud[18] Managed Service, Redis Source Available License
Postgres with pgvector [19] PostgreSQL License[20]
Qdrant [21] Apache License 2.0[22]
Weaviate [23] BSD 3-Clause[24]
Chroma[25][26] Apache License 2.0[27]
Elasticsearch[28] Server Side Public License, Elastic License [29]
Vespa [30] Apache License 2.0[31]
SurrealDB[32] Business Source License & Apache License (After 4 years)[33]

References

  1. "What is a Vector Database & How Does it Work". Pinecone. https://www.pinecone.io/learn/vector-database/. 
  2. "What is a vector database". Elastic. https://www.elastic.co/what-is/vector-database. 
  3. "Vector database - Azure Cosmos DB". 2023-12-26. https://learn.microsoft.com/en-us/azure/cosmos-db/vector-database. 
  4. Evan Chaki (2023-07-31). "What is a vector database?". Microsoft. https://learn.microsoft.com/en-us/semantic-kernel/memories/vector-db. "A vector database is a type of database that stores data as high-dimensional vectors, which are mathematical representations of features or attributes." 
  5. "Vector database - Azure Cosmos DB". 2023-12-26. https://learn.microsoft.com/en-us/azure/cosmos-db/vector-database. 
  6. Lewis, Patrick; Perez, Ethan; Piktus, Aleksandra; Petroni, Fabio; Karpukhin, Vladimir; Goyal, Naman; Küttler, Heinrich (2020). "Retrieval-augmented generation for knowledge-intensive NLP tasks". Advances in Neural Information Processing Systems 33: 9459–9474. 
  7. "5 Hard Problems in Vector Search, and How Cassandra Solves Them" (in en-US). 2023-09-22. https://thenewstack.io/5-hard-problems-in-vector-search-and-how-cassandra-solves-them/. 
  8. "Vector Search quickstart". https://cassandra.apache.org/doc/latest/cassandra/vector-search/overview.html. 
  9. "Vector database - Azure Cosmos DB". https://learn.microsoft.com/azure/cosmos-db/vector-database. 
  10. Wiggers, Kyle (2023-06-06). "LlamaIndex adds private data to large language models" (in en-US). https://techcrunch.com/2023/06/06/llamaindex-adds-private-data-to-large-language-models/. 
  11. "llama_index/LICENSE at main · run-llama/llama_index" (in en). https://github.com/run-llama/llama_index/blob/main/LICENSE. 
  12. "Open Source Vector Database – Milvus – LFAI & DATA". https://milvus.io/. 
  13. Liao, Ingrid Lunden and Rita (2022-08-24). "Zilliz raises $60M, relocates to SF" (in en-US). https://techcrunch.com/2022/08/24/zilliz-the-startup-behind-the-milvus-open-source-vector-database-for-ai-applications-raises-60m-and-relocates-to-sf/. 
  14. "Introducing Atlas Vector Search: Build Intelligent Applications with Semantic Search and AI Over Any Type of Data" (in en-US). 2023-06-22. https://www.mongodb.com/blog/post/introducing-atlas-vector-search-build-intelligent-applications-semantic-search-ai. 
  15. "Couchbase aims to boost developer database productivity with Capella IQ AI tool" (in en-US). 2023-08-30. https://venturebeat.com/ai/couchbase-aims-to-boost-developer-database-productivity-with-capella-iq-ai-tool/#h-next-on-the-roadmap-for-couchbase-is-vector-support. 
  16. "Investor Presentation Third Quarter Fiscal 2024" (in en-US). 2023-12-06. https://investors.couchbase.com/static-files/551e5b96-5307-4119-b225-19cfd8540242. 
  17. "Pinecone leads 'explosion' in vector databases for generative AI" (in en-US). 2023-07-14. https://venturebeat.com/ai/pinecone-leads-explosion-in-vector-databases-for-generative-ai/. 
  18. "Redis as a vector database quick start guide" (in en). https://redis.io/docs/get-started/vector-database/. 
  19. "pgvector" (in en-US). https://github.com/pgvector/pgvector. 
  20. "pgvector/License" (in en-US). https://github.com/pgvector/pgvector/blob/master/LICENSE. 
  21. Sawers, Paul (2023-04-19). "Qdrant, an open source vector database startup, wants to help AI developers leverage unstructured data" (in en-US). https://techcrunch.com/2023/04/19/qdrant-an-open-source-vector-database-startup-wants-to-help-ai-developers-leverage-unstructured-data/. 
  22. "qdrant/LICENSE at master · qdrant/qdrant" (in en). https://github.com/qdrant/qdrant/blob/master/LICENSE. 
  23. "Weaviate reels in $50M for its AI-optimized vector database" (in en-US). 2023-04-21. https://siliconangle.com/2023/04/21/weaviate-reels-50m-ai-optimized-vector-database/. 
  24. "weaviate/LICENSE at master · weaviate/weaviate" (in en). https://github.com/weaviate/weaviate/blob/master/LICENSE. 
  25. Palazzolo, Stephanie. "Vector database Chroma scored $18 million in seed funding at a $75 million valuation. Here's why its technology is key to helping generative AI startups." (in en-US). https://www.businessinsider.com/vector-database-startup-chroma-raises-seed-funding-generative-artificial-intelligence-2023-4. 
  26. MSV, Janakiram (2023-07-28). "Exploring Chroma: The Open Source Vector Database for LLMs" (in en-US). https://thenewstack.io/exploring-chroma-the-open-source-vector-database-for-llms/. 
  27. https://github.com/chroma-core/chroma/blob/main/LICENSE
  28. Kerner, Sean (23 May 2023). "Elasticsearch Relevance Engine brings new vectors to generative AI". VentureBeat. https://venturebeat.com/ai/elasticsearch-relevance-engine-brings-new-vectors-to-generative-ai/. 
  29. https://github.com/elastic/elasticsearch/blob/main/LICENSE.txt
  30. Riley, Duncan (4 October 2023). "Yahoo spins off AI scaling engine Vespa as an independent company". siliconANGLE. https://siliconangle.com/2023/10/04/yahoo-spins-off-ai-scaling-engine-vespa-independent-company/. 
  31. https://github.com/vespa-engine/vespa/blob/master/LICENSE
  32. Wiggers, Kyle (2023-01-04). "SurrealDB raises $6M for its database-as-a-service offering" (in en-US). https://techcrunch.com/2023/01/04/surrealdb-raises-6m-startup-funding-database-as-a-service/. 
  33. "SurrealDB | License FAQs | The ultimate multi-model database" (in en). https://surrealdb.com/license.