The Ingest Library is a powerful tool designed to coordinate the process of pulling data from data providers, partitioning the content, and pushing that new content to a desired location. This technical documentation will provide an in-depth understanding of the Ingest Library, including its features, architecture, installation, configuration, usage, API reference, troubleshooting, examples, and more.
Elements
for processing and distribution.
pip install unstructured
to install the latest version of the unstructured library which include the ingest code and the cli.
pip install unstructured[CONNECTOR_DEPS]
where CONNECTOR_DEPS
references the extra dependency label for a particular connector. For example, pip install unstructured[s3]
will install the dependencies to interact with the s3 connectors. If these aren’t installed before hand, a convenient error message will be printed for you when you run the ingest CLI for the first time, prompting you with the correct pip command to run.
unstructured-ingest --help
to get all the available commands.