Docling/docling/backend
Qiefan Jiang 13865c06f5
perf(msexcel): _find_table_bounds use iter_rows/iter_cols instead of Worksheet.cell (#1875)
* perf(msexcel): _find_table_bounds use iter_rows/iter_cols instead of sheet.cell

* DCO Remediation Commit for Qiefan Jiang <jiangqiefan@bytedance.com>

I, Qiefan Jiang <jiangqiefan@bytedance.com>, hereby add my Signed-off-by to this commit: 274102a8d4db5d2da8c7ca603e1eb039c1e07967

Signed-off-by: Qiefan Jiang <jiangqiefan@bytedance.com>

* fix lint

* DCO Remediation Commit for Qiefan Jiang <jiangqiefan@bytedance.com>

I, Qiefan Jiang <jiangqiefan@bytedance.com>, hereby add my Signed-off-by to this commit: b6b5b090a99ba7ba23c1facf0317f7e9f95039e5

Signed-off-by: Qiefan Jiang <jiangqiefan@bytedance.com>

---------

Signed-off-by: Qiefan Jiang <jiangqiefan@bytedance.com>
2025-07-03 13:12:06 +02:00
..
docx ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
json feat: add Docling JSON ingestion (#783) 2025-01-24 18:05:23 +01:00
xml chore: typo fix (#1465) 2025-04-28 08:52:09 +02:00
__init__.py Initial commit 2024-07-15 09:42:42 +02:00
abstract_backend.py feat: add Docling JSON ingestion (#783) 2025-01-24 18:05:23 +01:00
asciidoc_backend.py fix(asciidoc): set default size when missing in image directive (#1769) 2025-06-16 10:38:46 +02:00
csv_backend.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
docling_parse_backend.py feat: Make Page.parsed_page the only source of truth for text cells, add OCR cells to it (#1745) 2025-06-13 19:01:55 +02:00
docling_parse_v2_backend.py feat: Make Page.parsed_page the only source of truth for text cells, add OCR cells to it (#1745) 2025-06-13 19:01:55 +02:00
docling_parse_v4_backend.py chore: Safer unloading of DPv4 backend (#1867) 2025-06-30 14:41:21 +02:00
html_backend.py feat: leverage new list modeling, capture default markers (#1856) 2025-06-27 16:37:15 +02:00
md_backend.py feat: leverage new list modeling, capture default markers (#1856) 2025-06-27 16:37:15 +02:00
msexcel_backend.py perf(msexcel): _find_table_bounds use iter_rows/iter_cols instead of Worksheet.cell (#1875) 2025-07-03 13:12:06 +02:00
mspowerpoint_backend.py feat: leverage new list modeling, capture default markers (#1856) 2025-06-27 16:37:15 +02:00
msword_backend.py feat: leverage new list modeling, capture default markers (#1856) 2025-06-27 16:37:15 +02:00
noop_backend.py feat: Support audio input (#1763) 2025-06-23 14:47:26 +02:00
pdf_backend.py ci: add coverage and ruff (#1383) 2025-04-14 18:01:26 +02:00
pypdfium2_backend.py feat: Make Page.parsed_page the only source of truth for text cells, add OCR cells to it (#1745) 2025-06-13 19:01:55 +02:00