Docling/docling/models
Michele Dolfi b346faf622
feat: add coverage_threshold to skip OCR for small images (#161)
* feat: add coverage_threshold to skip OCR for small images

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* filter individual boxes

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* rename option

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

---------

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
2024-10-18 13:58:23 +02:00
..
__init__.py Initial commit 2024-07-15 09:42:42 +02:00
base_model.py feat!: Docling v2 (#117) 2024-10-16 21:02:03 +02:00
base_ocr_model.py feat: add coverage_threshold to skip OCR for small images (#161) 2024-10-18 13:58:23 +02:00
ds_glm_model.py fix: fix legacy doc ref (#162) 2024-10-18 13:11:20 +02:00
easyocr_model.py Ensure all models work only on valid pages (#158) 2024-10-18 08:54:06 +02:00
layout_model.py Ensure all models work only on valid pages (#158) 2024-10-18 08:54:06 +02:00
page_assemble_model.py Ensure all models work only on valid pages (#158) 2024-10-18 08:54:06 +02:00
page_preprocessing_model.py Ensure all models work only on valid pages (#158) 2024-10-18 08:54:06 +02:00
table_structure_model.py Ensure all models work only on valid pages (#158) 2024-10-18 08:54:06 +02:00
tesseract_ocr_cli_model.py Ensure all models work only on valid pages (#158) 2024-10-18 08:54:06 +02:00
tesseract_ocr_model.py Ensure all models work only on valid pages (#158) 2024-10-18 08:54:06 +02:00