Vector stores
๐๏ธ Alibaba Cloud OpenSearch
Alibaba Cloud Opensearch is a one-stop platform to develop intelligent search services. OpenSearch was built on the large-scale distributed search engine developed by Alibaba. OpenSearch serves more than 500 business cases in Alibaba Group and thousands of Alibaba Cloud customers. OpenSearch helps develop search services in different search scenarios, including e-commerce, O2O, multimedia, the content industry, communities and forums, and big data query in enterprises.
๐๏ธ AnalyticDB
AnalyticDB for PostgreSQL is a massively parallel processing (MPP) data warehousing service that is designed to analyze large volumes of data online.
๐๏ธ Annoy
Annoy (Approximate Nearest Neighbors Oh Yeah) is a C++ library with Python bindings to search for points in space that are close to a given query point. It also creates large read-only file-based data structures that are mmapped into memory so that many processes may share the same data.
๐๏ธ Atlas
Atlas is a platform for interacting with both small and internet scale unstructured datasets by Nomic.
๐๏ธ AwaDB
AwaDB is an AI Native database for the search and storage of embedding vectors used by LLM Applications.
๐๏ธ Azure Cognitive Search
Azure Cognitive Search (formerly known as Azure Search) is a cloud search service that gives developers infrastructure, APIs, and tools for building a rich search experience over private, heterogeneous content in web, mobile, and enterprise applications.
๐๏ธ Cassandra
Apache Cassandraยฎ is a NoSQL, row-oriented, highly scalable and highly available database.
๐๏ธ Chroma
Chroma is a AI-native open-source vector database focused on developer productivity and happiness. Chroma is licensed under Apache 2.0.
๐๏ธ Clarifai
Clarifai is an AI Platform that provides the full AI lifecycle ranging from data exploration, data labeling, model training, evaluation, and inference. A Clarifai application can be used as a vector database after uploading inputs.
๐๏ธ ClickHouse Vector Search
ClickHouse is the fastest and most resource efficient open-source database for real-time apps and analytics with full SQL support and a wide range of functions to assist users in writing analytical queries. Lately added data structures and distance search functions (like L2Distance) as well as approximate nearest neighbor search indexes enable ClickHouse to be used as a high performance and scalable vector database to store and search vectors with SQL.
๐๏ธ Activeloop's Deep Lake
Activeloop's Deep Lake as a Multi-Modal Vector Store that stores embeddings and their metadata including text, jsons, images, audio, video, and more. It saves the data locally, in your cloud, or on Activeloop storage. It performs hybrid search including embeddings and their attributes.
๐๏ธ DocArrayHnswSearch
DocArrayHnswSearch is a lightweight Document Index implementation provided by Docarray that runs fully locally and is best suited for small- to medium-sized datasets. It stores vectors on disk in hnswlib, and stores all other data in SQLite.
๐๏ธ DocArrayInMemorySearch
DocArrayInMemorySearch is a document index provided by Docarray that stores documents in memory. It is a great starting point for small datasets, where you may not want to launch a database server.
๐๏ธ ElasticSearch
Elasticsearch is a distributed, RESTful search and analytics engine. It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents.
๐๏ธ FAISS
Facebook AI Similarity Search (Faiss) is a library for efficient similarity search and clustering of dense vectors. It contains algorithms that search in sets of vectors of any size, up to ones that possibly do not fit in RAM. It also contains supporting code for evaluation and parameter tuning.
๐๏ธ Hologres
Hologres is a unified real-time data warehousing service developed by Alibaba Cloud. You can use Hologres to write, update, process, and analyze large amounts of data in real time.
๐๏ธ LanceDB
LanceDB is an open-source database for vector-search built with persistent storage, which greatly simplifies retrevial, filtering and management of embeddings. Fully open source.
๐๏ธ Marqo
This notebook shows how to use functionality related to the Marqo vectorstore.
๐๏ธ MatchingEngine
This notebook shows how to use functionality related to the GCP Vertex AI MatchingEngine vector database.
๐๏ธ Milvus
Milvus is a database that stores, indexes, and manages massive embedding vectors generated by deep neural networks and other machine learning (ML) models.
๐๏ธ MongoDB Atlas
MongoDB Atlas is a fully-managed cloud database available in AWS , Azure, and GCP. It now has support for native Vector Search on your MongoDB document data.
๐๏ธ MyScale
MyScale is a cloud-based database optimized for AI applications and solutions, built on the open-source ClickHouse.
๐๏ธ OpenSearch
OpenSearch is a scalable, flexible, and extensible open-source software suite for search, analytics, and observability applications licensed under Apache 2.0. OpenSearch is a distributed search and analytics engine based on Apache Lucene.
๐๏ธ pg_embedding
pgembedding is an open-source vector similarity search for Postgres that uses Hierarchical Navigable Small Worlds for approximate nearest neighbor search.
๐๏ธ PGVector
PGVector is an open-source vector similarity search for Postgres
๐๏ธ Pinecone
Pinecone is a vector database with broad functionality.
๐๏ธ Qdrant
Qdrant (read: quadrant ) is a vector similarity search engine. It provides a production-ready service with a convenient API to store, search, and manage points - vectors with an additional payload. Qdrant is tailored to extended filtering support. It makes it useful for all sorts of neural network or semantic-based matching, faceted search, and other applications.
๐๏ธ Redis
Redis (Remote Dictionary Server) is an in-memory data structure store, used as a distributed, in-memory keyโvalue database, cache and message broker, with optional durability.
๐๏ธ Rockset
Rockset is a real-time analytics database service for serving low latency, high concurrency analytical queries at scale. It builds a Converged Indexโข on structured and semi-structured data with an efficient store for vector embeddings. Its support for running SQL on schemaless data makes it a perfect choice for running vector search with metadata filters.
๐๏ธ SingleStoreDB
SingleStoreDB is a high-performance distributed SQL database that supports deployment both in the cloud and on-premises. It provides vector storage, and vector functions including dotproduct and euclideandistance, thereby supporting AI applications that require text similarity matching.
๐๏ธ scikit-learn
scikit-learn is an open source collection of machine learning algorithms, including some implementations of the k nearest neighbors. SKLearnVectorStore wraps this implementation and adds the possibility to persist the vector store in json, bson (binary json) or Apache Parquet format.
๐๏ธ StarRocks
StarRocks is a High-Performance Analytical Database.
๐๏ธ Supabase (Postgres)
Supabase is an open source Firebase alternative. Supabase is built on top of PostgreSQL, which offers strong SQL querying capabilities and enables a simple interface with already-existing tools and frameworks.
๐๏ธ Tair
Tair is a cloud native in-memory database service developed by Alibaba Cloud.
๐๏ธ Tigris
Tigris is an open source Serverless NoSQL Database and Search Platform designed to simplify building high-performance vector search applications.
๐๏ธ Typesense
Typesense is an open source, in-memory search engine, that you can either self-host or run on Typesense Cloud.
๐๏ธ Vectara
Vectara is a API platform for building LLM-powered applications. It provides a simple to use API for document indexing and query that is managed by Vectara and is optimized for performance and accuracy.
๐๏ธ Weaviate
Weaviate is an open-source vector database. It allows you to store data objects and vector embeddings from your favorite ML-models, and scale seamlessly into billions of data objects.
๐๏ธ Zilliz
Zilliz Cloud is a fully managed service on cloud for LF AI Milvusยฎ,