OpenSearchDocumentStore
A Document Store for storing and retrieval from OpenSearch.
OpenSearch is a fully open source search and analytics engine for use cases such as log analytics, real-time application monitoring, and clickstream analysis. For more information, see the OpenSearch documentation.
This Document Store is great if you want to evaluate the performance of different retrieval options (dense vs. sparse). It’s compatible with the Amazon OpenSearch Service.
OpenSearch provides support for vector similarity comparisons and approximate nearest neighbors algorithms.
Initialization
Install and run an OpenSearch instance.
If you have Docker set up, we recommend pulling the Docker image and running it.
docker pull opensearchproject/opensearch:2.11.0
docker run -p 9200:9200 -p 9600:9600 -e "discovery.type=single-node" -e "ES_JAVA_OPTS=-Xms1024m -Xmx1024m" opensearchproject/opensearch:2.11.0
As an alternative, you can go to OpenSearch integration GitHub and start a Docker container running OpenSearch using the provided docker-compose.yml
:
docker compose up
Once you have a running OpenSearch instance, install the opensearch-haystack
integration:
pip install opensearch-haystack
Then, initialize an OpenSearchDocumentStore
object that’s connected to the OpenSearch instance and writes documents to it:
from haystack_integrations.document_stores.opensearch import OpenSearchDocumentStore
from haystack import Document
document_store = OpenSearchDocumentStore(hosts="<http://localhost:9200>", use_ssl=True,
verify_certs=False, http_auth=("admin", "admin"))
document_store.write_documents([
Document(content="This is first"),
Document(content="This is second")
])
print(document_store.count_documents())
Supported Retrievers
OpenSearchBM25Retriever
: A keyword-based Retriever that fetches documents matching a query from the Document Store.
OpenSearchEmbeddingRetriever
: Compares the query and document embeddings and fetches the documents most relevant to the query.
Updated 7 months ago