Docling/docling/models
Yorick Terweijden 53327552e8
feat(ocr): expose rec_keys_path in RapidOcrOptions to support custom dictionaries (#786)
* Expose `rec_keys_path` in RapidOcrOptions to support custom dictionaries

- Added `rec_keys_path` to `RapidOcrOptions` to align with RapidOCR's capability to use custom character dictionaries.
- Passed `rec_keys_path` to `RapidOcrModel` initialization, ensuring the recognition model can load the correct dictionary (e.g., for Latin characters).

Signed-off-by: Yorick Terweijden <yorick@spread.ai>

* style(rapidocr-options): fix alignment of `rec_keys_path` comment

Adjusted the alignment of the comment for `rec_keys_path` to maintain consistent formatting. No functional changes were made.

Signed-off-by: Yorick Terweijden <yorick@spread.ai>

---------

Signed-off-by: Yorick Terweijden <yorick@spread.ai>
2025-01-27 13:38:15 +01:00
..
__init__.py Initial commit 2024-07-15 09:42:42 +02:00
base_model.py feat: Code and equation model for PDF and code blocks in markdown (#752) 2025-01-24 16:54:22 +01:00
base_ocr_model.py fix: Improve OCR results, stricten criteria before dropping bitmap areas (#719) 2025-01-10 10:38:49 +01:00
code_formula_model.py feat: Code and equation model for PDF and code blocks in markdown (#752) 2025-01-24 16:54:22 +01:00
document_picture_classifier.py feat: New document picture classifier (#805) 2025-01-24 18:05:51 +01:00
ds_glm_model.py feat: Updated Layout processing with forms and key-value areas (#530) 2024-12-17 17:32:24 +01:00
easyocr_model.py feat: Updated Layout processing with forms and key-value areas (#530) 2024-12-17 17:32:24 +01:00
layout_model.py chore: expose draw_clusters function (#803) 2025-01-24 17:35:29 +01:00
ocr_mac_model.py feat: add support for ocrmac OCR engine on macOS (#276) 2024-11-20 12:51:19 +01:00
page_assemble_model.py feat: Code and equation model for PDF and code blocks in markdown (#752) 2025-01-24 16:54:22 +01:00
page_preprocessing_model.py feat: Add pipeline timings and toggle visualization, establish debug settings (#183) 2024-10-30 15:04:19 +01:00
rapid_ocr_model.py feat(ocr): expose rec_keys_path in RapidOcrOptions to support custom dictionaries (#786) 2025-01-27 13:38:15 +01:00
table_structure_model.py fix: Correct scaling of debug visualizations, tune OCR (#700) 2025-01-08 12:26:44 +01:00
tesseract_ocr_cli_model.py feat: Introduce automatic language detection in TesseractOcrCliModel (#800) 2025-01-26 08:07:56 +01:00
tesseract_ocr_model.py feat: Introduce automatic language detection in TesseractOcrCliModel (#800) 2025-01-26 08:07:56 +01:00