Docling/docling/backend
Cesar Berrospi Ramis 0cd81a8122
fix(docx): merged table cells not properly converted (#857)
* fix(docx): merged cells not properly converted

Fix conversion issue of merged cells in Word tables leading to repeated text.
Simplify Word table conversion code.
Add docx file with several table formats for regression tests.

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

* chore: add type hinting to docx backend

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>

---------

Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
2025-02-03 10:20:03 +01:00
..
json feat: add Docling JSON ingestion (#783) 2025-01-24 18:05:23 +01:00
xml docs: description of supported formats and backends (#788) 2025-01-26 08:10:33 +01:00
__init__.py Initial commit 2024-07-15 09:42:42 +02:00
abstract_backend.py feat: add Docling JSON ingestion (#783) 2025-01-24 18:05:23 +01:00
asciidoc_backend.py feat: Code and equation model for PDF and code blocks in markdown (#752) 2025-01-24 16:54:22 +01:00
docling_parse_backend.py refactor: allow the usage of backends in the enrich models and generalize the interface (#742) 2025-01-15 09:52:38 +01:00
docling_parse_v2_backend.py refactor: allow the usage of backends in the enrich models and generalize the interface (#742) 2025-01-15 09:52:38 +01:00
html_backend.py fix: use new add_code in html backend and add more typing hints (#850) 2025-01-31 09:54:17 +01:00
md_backend.py fix(markdown): fix empty block handling (#843) 2025-01-30 16:22:29 +01:00
msexcel_backend.py fix: added extraction of byte-images in excel (#804) 2025-01-24 18:48:02 +01:00
mspowerpoint_backend.py fix: Processing of placeholder shapes in pptx that have text but no bbox (#868) 2025-02-03 09:33:33 +01:00
msword_backend.py fix(docx): merged table cells not properly converted (#857) 2025-02-03 10:20:03 +01:00
pdf_backend.py feat: Code and equation model for PDF and code blocks in markdown (#752) 2025-01-24 16:54:22 +01:00
pypdfium2_backend.py refactor: allow the usage of backends in the enrich models and generalize the interface (#742) 2025-01-15 09:52:38 +01:00