Writes Documents to a DocumentStore.
Module document_writer
DocumentWriter
Writes documents to a DocumentStore.
Usage example
from haystack import Document
from haystack.components.writers import DocumentWriter
from haystack.document_stores.in_memory import InMemoryDocumentStore
docs = [
    Document(content="Python is a popular programming language"),
]
doc_store = InMemoryDocumentStore()
doc_store.write_documents(docs)
DocumentWriter.__init__
def __init__(document_store: DocumentStore,
             policy: DuplicatePolicy = DuplicatePolicy.NONE)
Create a DocumentWriter component.
Arguments:
- document_store: The instance of the document store where you want to store your documents.
- policy: The policy to apply when a Document with the same ID already exists in the DocumentStore.
- DuplicatePolicy.NONE: Default policy, relies on the DocumentStore settings.
- DuplicatePolicy.SKIP: Skips documents with the same ID and doesn't write them to the DocumentStore.
- DuplicatePolicy.OVERWRITE: Overwrites documents with the same ID.
- DuplicatePolicy.FAIL: Raises an error if a Document with the same ID is already in the DocumentStore.
DocumentWriter.to_dict
def to_dict() -> Dict[str, Any]
Serializes the component to a dictionary.
Returns:
Dictionary with serialized data.
DocumentWriter.from_dict
@classmethod
def from_dict(cls, data: Dict[str, Any]) -> "DocumentWriter"
Deserializes the component from a dictionary.
Arguments:
- data: The dictionary to deserialize from.
Raises:
- DeserializationError: If the document store is not properly specified in the serialization data or its type cannot be imported.
Returns:
The deserialized component.
DocumentWriter.run
@component.output_types(documents_written=int)
def run(documents: List[Document], policy: Optional[DuplicatePolicy] = None)
Run the DocumentWriter on the given input data.
Arguments:
- documents: A list of documents to write to the document store.
- policy: The policy to use when encountering duplicate documents.
Raises:
- ValueError: If the specified document store is not found.
Returns:
Number of documents written to the document store.
