..
backend_xml_rag.ipynb
docs(backend XML): do not delete temp file in notebook ( #817 )
2025-01-27 18:53:39 +01:00
batch_convert.py
chore: make tests lighter ( #228 )
2024-11-04 14:02:28 +01:00
custom_convert.py
docs: fix correct Accelerator pipeline options in docs/examples/custom_convert.py ( #733 )
2025-01-19 16:55:26 +01:00
develop_formula_understanding.py
refactor: allow the usage of backends in the enrich models and generalize the interface ( #742 )
2025-01-15 09:52:38 +01:00
develop_picture_enrichment.py
feat: Code and equation model for PDF and code blocks in markdown ( #752 )
2025-01-24 16:54:22 +01:00
export_figures.py
fix: Update tests and examples for docling-core 2.5.1 ( #449 )
2024-11-27 13:07:00 +01:00
export_multimodal.py
feat!: Docling v2 ( #117 )
2024-10-16 21:02:03 +02:00
export_tables.py
feat!: Docling v2 ( #117 )
2024-10-16 21:02:03 +02:00
full_page_ocr.py
feat(ocr): added support for RapidOCR engine ( #415 )
2024-11-27 13:57:41 +01:00
hybrid_chunking.ipynb
docs: add LangChain docs ( #717 )
2025-01-09 14:12:05 +01:00
index.md
docs: add architecture outline ( #341 )
2024-11-15 12:52:41 +01:00
inspect_picture_content.py
docs: Add example for inspection of picture content ( #624 )
2025-01-29 10:39:00 +01:00
minimal.py
chore: various minor docs fixes ( #169 )
2024-10-22 15:29:36 +02:00
rag_azuresearch.ipynb
docs: typo ( #814 )
2025-01-27 11:24:26 +01:00
rag_haystack.ipynb
docs: add integrations, revamp docs ( #693 )
2025-01-07 14:15:54 +01:00
rag_langchain.ipynb
docs: add LangChain docs ( #717 )
2025-01-09 14:12:05 +01:00
rag_llamaindex.ipynb
docs: add integrations, revamp docs ( #693 )
2025-01-07 14:15:54 +01:00
rag_weaviate.ipynb
docs: add integrations, revamp docs ( #693 )
2025-01-07 14:15:54 +01:00
retrieval_qdrant.ipynb
docs: add integrations, revamp docs ( #693 )
2025-01-07 14:15:54 +01:00
run_md.py
feat: Support AsciiDoc and Markdown input format ( #168 )
2024-10-23 16:14:26 +02:00
run_with_accelerator.py
feat: Introduce support for GPU Accelerators ( #593 )
2024-12-13 17:45:22 +01:00
run_with_formats.py
fix: fix duplicate title and heading + add e2e tests for html and docx ( #186 )
2024-10-30 13:14:56 +01:00
tesseract_lang_detection.py
feat: Introduce automatic language detection in TesseractOcrCliModel ( #800 )
2025-01-26 08:07:56 +01:00
translate.py
docs: Example to translate documents ( #739 )
2025-01-15 06:51:15 +01:00