docs: add coming-soon section (#235)
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
This commit is contained in:
parent
d5e65aedac
commit
5ce02c5c59
13
README.md
13
README.md
@ -19,19 +19,22 @@
|
||||
|
||||
Docling parses documents and exports them to the desired format with ease and speed.
|
||||
|
||||
|
||||
## Features
|
||||
|
||||
* 🗂️ Reads popular document formats (PDF, DOCX, PPTX, Images, HTML, AsciiDoc, Markdown) and exports to Markdown and JSON
|
||||
* 📑 Advanced PDF document understanding including page layout, reading order & table structures
|
||||
* 🧩 Unified, expressive [DoclingDocument](https://ds4sd.github.io/docling/concepts/docling_document/) representation format
|
||||
* 📝 Metadata extraction, including title, authors, references & language
|
||||
* 🤖 Seamless LlamaIndex 🦙 & LangChain 🦜🔗 integration for powerful RAG / QA applications
|
||||
* 🤖 Easy integration with LlamaIndex 🦙 & LangChain 🦜🔗 for powerful RAG / QA applications
|
||||
* 🔍 OCR support for scanned PDFs
|
||||
* 💻 Simple and convenient CLI
|
||||
|
||||
Explore the [documentation](https://ds4sd.github.io/docling/) to discover plenty examples and unlock the full power of Docling!
|
||||
|
||||
### Coming soon
|
||||
|
||||
* ♾️ Equation & code extraction
|
||||
* 📝 Metadata extraction, including title, authors, references & language
|
||||
* 🦜🔗 Native LangChain extension
|
||||
|
||||
## Installation
|
||||
|
||||
@ -57,7 +60,6 @@ result = converter.convert(source)
|
||||
print(result.document.export_to_markdown()) # output: "## Docling Technical Report[...]"
|
||||
```
|
||||
|
||||
|
||||
Check out [Getting started](https://ds4sd.github.io/docling/).
|
||||
You will find lots of tuning options to leverage all the advanced capabilities.
|
||||
|
||||
@ -66,7 +68,6 @@ You will find lots of tuning options to leverage all the advanced capabilities.
|
||||
|
||||
Please feel free to connect with us using the [discussion section](https://github.com/DS4SD/docling/discussions).
|
||||
|
||||
|
||||
## Technical report
|
||||
|
||||
For more details on Docling's inner workings, check out the [Docling Technical Report](https://arxiv.org/abs/2408.09869).
|
||||
@ -95,5 +96,5 @@ If you use Docling in your projects, please consider citing the following:
|
||||
|
||||
## License
|
||||
|
||||
The Docling codebase is under MIT license.
|
||||
The Docling codebase is under MIT license.
|
||||
For individual model usage, please refer to the model licenses found in the original packages.
|
||||
|
@ -22,7 +22,12 @@ Docling parses documents and exports them to the desired format with ease and sp
|
||||
* 🗂️ Reads popular document formats (PDF, DOCX, PPTX, Images, HTML, AsciiDoc, Markdown) and exports to Markdown and JSON
|
||||
* 📑 Advanced PDF document understanding incl. page layout, reading order & table structures
|
||||
* 🧩 Unified, expressive [DoclingDocument](./concepts/docling_document.md) representation format
|
||||
* 📝 Metadata extraction, including title, authors, references & language
|
||||
* 🤖 Seamless LlamaIndex 🦙 & LangChain 🦜🔗 integration for powerful RAG / QA applications
|
||||
* 🤖 Easy integration with LlamaIndex 🦙 & LangChain 🦜🔗 for powerful RAG / QA applications
|
||||
* 🔍 OCR support for scanned PDFs
|
||||
* 💻 Simple and convenient CLI
|
||||
|
||||
### Coming soon
|
||||
|
||||
* ♾️ Equation & code extraction
|
||||
* 📝 Metadata extraction, including title, authors, references & language
|
||||
* 🦜🔗 Native LangChain extension
|
||||
|
Loading…
Reference in New Issue
Block a user