Docling/docling
Christoph Auer 34c7c79858
fix: improve handling of disallowed formats (#429)
* fix: Fixes and tests for StopIteration on .convert()

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* fix: Remove unnecessary case handling

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* fix: Other test fixes

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* improve handling of unsupported types

- Introduced new explicit exception types instead of `RuntimeError`
- Introduced new `ConversionStatus` value for unsupported formats
- Tidied up converter member typing & removed asserts

Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>

* robustify & simplify format option resolution

Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>

* rename new status, populate ConversionResult errors

Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>

---------

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
Co-authored-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>
2024-12-03 12:45:32 +01:00
..
backend fix: use correct image index in word backend (#442) 2024-11-27 13:45:07 +01:00
cli perf: prevent temp file leftovers, reuse core type (#487) 2024-12-03 10:40:28 +01:00
datamodel fix: improve handling of disallowed formats (#429) 2024-12-03 12:45:32 +01:00
models fix: ParserError EOF inside string (#470) (#472) 2024-12-03 11:21:18 +01:00
pipeline feat(ocr): added support for RapidOCR engine (#415) 2024-11-27 13:57:41 +01:00
utils feat: Add pipeline timings and toggle visualization, establish debug settings (#183) 2024-10-30 15:04:19 +01:00
__init__.py Initial commit 2024-07-15 09:42:42 +02:00
document_converter.py fix: improve handling of disallowed formats (#429) 2024-12-03 12:45:32 +01:00
exceptions.py fix: improve handling of disallowed formats (#429) 2024-12-03 12:45:32 +01:00