Docling/docling/models
Michele Dolfi 6eaae3cba0
feat: add factory for ocr engines via plugins (#1010)
* add factory for ocr engines

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* apply pre-commit after rebase

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* add picture description factory

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* fix enable option

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* switch to create methods

Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>

* make `options` an explicit kwarg

Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>

* keep old lock of docling-core

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* fix lock

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* add allow_external_plugins option

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* add factory return and ignore options type

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

---------

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
Co-authored-by: Panos Vagenas <pva@zurich.ibm.com>
2025-03-18 13:58:05 +01:00
..
factories feat: add factory for ocr engines via plugins (#1010) 2025-03-18 13:58:05 +01:00
plugins feat: add factory for ocr engines via plugins (#1010) 2025-03-18 13:58:05 +01:00
__init__.py Initial commit 2024-07-15 09:42:42 +02:00
base_model.py feat: add factory for ocr engines via plugins (#1010) 2025-03-18 13:58:05 +01:00
base_ocr_model.py feat: add factory for ocr engines via plugins (#1010) 2025-03-18 13:58:05 +01:00
code_formula_model.py perf: New revision code formula model and document picture classifier (#1140) 2025-03-11 10:15:28 +01:00
document_picture_classifier.py perf: New revision code formula model and document picture classifier (#1140) 2025-03-11 10:15:28 +01:00
easyocr_model.py feat: add factory for ocr engines via plugins (#1010) 2025-03-18 13:58:05 +01:00
hf_vlm_model.py feat: [Experimental] Introduce VLM pipeline using HF AutoModelForVision2Seq, featuring SmolDocling model (#1054) 2025-02-26 14:43:26 +01:00
layout_model.py refactor: use org--name in artifacts-path (#912) 2025-02-07 13:58:05 +01:00
ocr_mac_model.py feat: add factory for ocr engines via plugins (#1010) 2025-03-18 13:58:05 +01:00
page_assemble_model.py feat: Implement new reading-order model (#916) 2025-02-20 17:51:17 +01:00
page_preprocessing_model.py feat: Add DoclingParseV4 backend, using high-level docling-parse API (#905) 2025-03-18 10:38:19 +01:00
picture_description_api_model.py feat: add factory for ocr engines via plugins (#1010) 2025-03-18 13:58:05 +01:00
picture_description_base_model.py feat: add factory for ocr engines via plugins (#1010) 2025-03-18 13:58:05 +01:00
picture_description_vlm_model.py feat: add factory for ocr engines via plugins (#1010) 2025-03-18 13:58:05 +01:00
rapid_ocr_model.py feat: add factory for ocr engines via plugins (#1010) 2025-03-18 13:58:05 +01:00
readingorder_model.py feat: Implement new reading-order model (#916) 2025-02-20 17:51:17 +01:00
table_structure_model.py feat: Add DoclingParseV4 backend, using high-level docling-parse API (#905) 2025-03-18 10:38:19 +01:00
tesseract_ocr_cli_model.py feat: add factory for ocr engines via plugins (#1010) 2025-03-18 13:58:05 +01:00
tesseract_ocr_model.py feat: add factory for ocr engines via plugins (#1010) 2025-03-18 13:58:05 +01:00