Docling/docs/examples
Matteo 3213b247ad
feat: Code and equation model for PDF and code blocks in markdown (#752)
* propagated changes for new CodeItem class

Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>

* Rebased branch on latest main. changes for CodeItem

Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>

* removed unused files

Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>

* chore: update lockfile

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* pin latest docling-core

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* update docling-core pinning

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* pin docling-core

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* use new add_code in backends and update typing in MD backend

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* added if statement for backend

Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>

* removed unused import

Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>

* removed print statements

Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>

* gt for new pdf

Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>

* Update docling/pipeline/standard_pdf_pipeline.py

Co-authored-by: Michele Dolfi <97102151+dolfim-ibm@users.noreply.github.com>
Signed-off-by: Matteo <43417658+Matteo-Omenetti@users.noreply.github.com>

* fixed doc comment of __call__ function of code_formula_model

Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>

* fix artifacts_path type

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* move imports

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* move expansion_factor to base class

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

---------

Signed-off-by: Matteo Omenetti <omenetti.matteo@gmail.com>
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Signed-off-by: Matteo <43417658+Matteo-Omenetti@users.noreply.github.com>
Co-authored-by: Christoph Auer <cau@zurich.ibm.com>
Co-authored-by: Michele Dolfi <dol@zurich.ibm.com>
Co-authored-by: Michele Dolfi <97102151+dolfim-ibm@users.noreply.github.com>
2025-01-24 16:54:22 +01:00
..
batch_convert.py chore: make tests lighter (#228) 2024-11-04 14:02:28 +01:00
custom_convert.py docs: fix correct Accelerator pipeline options in docs/examples/custom_convert.py (#733) 2025-01-19 16:55:26 +01:00
develop_formula_understanding.py refactor: allow the usage of backends in the enrich models and generalize the interface (#742) 2025-01-15 09:52:38 +01:00
develop_picture_enrichment.py feat: Code and equation model for PDF and code blocks in markdown (#752) 2025-01-24 16:54:22 +01:00
export_figures.py fix: Update tests and examples for docling-core 2.5.1 (#449) 2024-11-27 13:07:00 +01:00
export_multimodal.py feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
export_tables.py feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
full_page_ocr.py feat(ocr): added support for RapidOCR engine (#415) 2024-11-27 13:57:41 +01:00
hybrid_chunking.ipynb docs: add LangChain docs (#717) 2025-01-09 14:12:05 +01:00
index.md docs: add architecture outline (#341) 2024-11-15 12:52:41 +01:00
minimal.py chore: various minor docs fixes (#169) 2024-10-22 15:29:36 +02:00
rag_azuresearch.ipynb docs: fix minor typos (#801) 2025-01-24 16:27:05 +01:00
rag_haystack.ipynb docs: add integrations, revamp docs (#693) 2025-01-07 14:15:54 +01:00
rag_langchain.ipynb docs: add LangChain docs (#717) 2025-01-09 14:12:05 +01:00
rag_llamaindex.ipynb docs: add integrations, revamp docs (#693) 2025-01-07 14:15:54 +01:00
rag_weaviate.ipynb docs: add integrations, revamp docs (#693) 2025-01-07 14:15:54 +01:00
retrieval_qdrant.ipynb docs: add integrations, revamp docs (#693) 2025-01-07 14:15:54 +01:00
run_md.py feat: Support AsciiDoc and Markdown input format (#168) 2024-10-23 16:14:26 +02:00
run_with_accelerator.py feat: Introduce support for GPU Accelerators (#593) 2024-12-13 17:45:22 +01:00
run_with_formats.py fix: fix duplicate title and heading + add e2e tests for html and docx (#186) 2024-10-30 13:14:56 +01:00
translate.py docs: Example to translate documents (#739) 2025-01-15 06:51:15 +01:00