docs: add Data Prep Kit integration (#316)
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
This commit is contained in:
parent
777237ebc9
commit
93fc1be61a
13
docs/integrations/data_prep_kit.md
Normal file
13
docs/integrations/data_prep_kit.md
Normal file
@ -0,0 +1,13 @@
|
||||
## Get started
|
||||
|
||||
Docling is used by the [Data Prep Kit \[↗\]](https://ibm.github.io/data-prep-kit/) open-source toolkit for preparing unstructured data for LLM application development ranging from laptop scale to datacenter scale.
|
||||
|
||||
Below you find the Data Prep Kit modules powered by Docling.
|
||||
|
||||
## PDF ingestion to Parquet
|
||||
- 💻 [GitHub \[↗\]](https://github.com/IBM/data-prep-kit/tree/dev/transforms/language/pdf2parquet)
|
||||
- 📖 [API docs \[↗\]](https://ibm.github.io/data-prep-kit/transforms/language/pdf2parquet/python/)
|
||||
|
||||
## Document chunking
|
||||
- 💻 [GitHub \[↗\]](https://github.com/IBM/data-prep-kit/tree/dev/transforms/language/doc_chunk)
|
||||
- 📖 [API docs \[↗\]](https://ibm.github.io/data-prep-kit/transforms/language/doc_chunk/python/)
|
@ -1,6 +1,6 @@
|
||||
## Get started
|
||||
|
||||
Docling is available as an official LlamaIndex extension!
|
||||
Docling is available as an official [LlamaIndex \[↗\]](https://docs.llamaindex.ai/) extension.
|
||||
|
||||
To get started, check out the [step-by-step guide in LlamaIndex \[↗\]](https://docs.llamaindex.ai/en/stable/examples/data_connectors/DoclingReaderDemo/)<!--{target="_blank"}-->.
|
||||
|
||||
|
@ -81,8 +81,9 @@ nav:
|
||||
# - CLI: examples/cli.md
|
||||
- Integrations:
|
||||
- Integrations: integrations/index.md
|
||||
- "LlamaIndex 🦙 extension": integrations/llamaindex.md
|
||||
# - "LangChain 🦜🔗 extension": integrations/langchain.md
|
||||
- "Data Prep Kit": integrations/data_prep_kit.md
|
||||
- "LlamaIndex 🦙": integrations/llamaindex.md
|
||||
# - "LangChain 🦜🔗": integrations/langchain.md
|
||||
# - API reference:
|
||||
# - API reference: api_reference/index.md
|
||||
|
||||
|
Loading…
Reference in New Issue
Block a user