Panos Vagenas
|
0533da1923
|
feat: leverage new list modeling, capture default markers (#1856)
* chore: update docling-core & regenerate test data
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
* update backends to leverage new list modeling
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
* repin docling-core
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
* ensure availability of latest docling-core API
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
---------
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
|
2025-06-27 16:37:15 +02:00 |
|
Panos Vagenas
|
7c5614a37a
|
fix(markdown): fix single-formatted headings & list items (#1820)
* fix(markdown): fix formatting & inline edge cases (show behavior before change)
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
* add change and updated test data
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
* update lock
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
* improve test case
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
---------
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
|
2025-06-25 13:05:06 +02:00 |
|
Ayraf
|
df140227c3
|
feat: support xlsm files (#1520)
* code for xlsm support
* updated support for xlsm
* updated code for xlsm support
* Update docling_parse_v4_backend.py
Signed-off-by: ShiroYasha18 <85089952+ShiroYasha18@users.noreply.github.com>
* Update docling_parse_v4_backend.py
Signed-off-by: ShiroYasha18 <85089952+ShiroYasha18@users.noreply.github.com>
* Update test_backend_msexcel_xlsm.py
updated the tests/test_backend_msexcel_xlsm.py:
have a function starting with test
removed all print statements
** To add an explicit assert {test}=={pred}
Signed-off-by: ShiroYasha18 <85089952+ShiroYasha18@users.noreply.github.com>
* Update base_models.py
Signed-off-by: ShiroYasha18 <85089952+ShiroYasha18@users.noreply.github.com>
* Update test_backend_msexcel.py
Signed-off-by: ShiroYasha18 <85089952+ShiroYasha18@users.noreply.github.com>
* Update test_backend_msexcel_xlsm.py
Signed-off-by: ShiroYasha18 <85089952+ShiroYasha18@users.noreply.github.com>
* Update document_converter.py
Signed-off-by: ShiroYasha18 <85089952+ShiroYasha18@users.noreply.github.com>
* Delete tests/test_backend_msexcel_xlsm.py
Signed-off-by: ShiroYasha18 <85089952+ShiroYasha18@users.noreply.github.com>
* xlsm file
Signed-off-by: ShiroYasha18 <85089952+ShiroYasha18@users.noreply.github.com>
* run tests
* ran tests
* Fix tests, upgrade XSLM example to a valid file
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
---------
Signed-off-by: ShiroYasha18 <85089952+ShiroYasha18@users.noreply.github.com>
Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Co-authored-by: Christoph Auer <cau@zurich.ibm.com>
|
2025-06-10 16:55:59 +02:00 |
|
Panos Vagenas
|
61d0d6c755
|
test: mark flaky test (#1698)
* test: cleanse Word test file
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
* mark textbox file test as flaky
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
* fix path usage
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
---------
Signed-off-by: Panos Vagenas <pva@zurich.ibm.com>
|
2025-06-03 13:13:44 +02:00 |
|
Cesar Berrospi Ramis
|
106951e71e
|
test: add missing ground truth files (#1667)
Signed-off-by: Cesar Berrospi Ramis <75900930+ceberam@users.noreply.github.com>
|
2025-05-28 13:26:49 +02:00 |
|