Docling/docs
Christoph Auer c93e36988f
feat: Implement new reading-order model (#916)
* Implement new reading-order model, replacing DS GLM model (WIP)

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Update reading-order model branch

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Update lockfile [skip ci]

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Add captions, footnotes and merges [skip ci]

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Updates for reading-order implementation

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Updates for reading-order implementation

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Update tests and lockfile

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Fixes, update tests

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Add normalization, update tests again

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Update tests with code

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Push final lockfile

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* sanitize text

Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>

* Inlcude furniture, Update tests with furniture

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* Fix content_layer assignment

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>

* chore: Delete empty file docling/models/ds_glm_model.py

Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>

---------

Signed-off-by: Christoph Auer <cau@zurich.ibm.com>
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
Signed-off-by: Nikos Livathinos <nli@zurich.ibm.com>
Co-authored-by: Michele Dolfi <dol@zurich.ibm.com>
Co-authored-by: Nikos Livathinos <nli@zurich.ibm.com>
2025-02-20 17:51:17 +01:00
..
assets feat: expose new hybrid chunker, update docs (#384) 2024-12-09 08:28:29 +01:00
concepts docs: fix links between docs pages (#697) 2025-01-20 09:52:59 +01:00
examples feat: Implement new reading-order model (#916) 2025-02-20 17:51:17 +01:00
integrations docs: add pointers to LangChain-side docs (#718) 2025-01-09 17:36:46 +01:00
overrides docs: extend integration docs & README (#456) 2024-11-28 09:41:21 +01:00
reference docs: specify docstring types (#702) 2025-01-08 09:05:18 +01:00
stylesheets docs: introduce docs site (#141) 2024-10-14 14:13:13 +02:00
faq.md feat: Python 3.13 support (#841) 2025-01-30 17:26:42 +01:00
index.md docs: updated the readme with upcoming features (#831) 2025-01-30 09:52:54 +01:00
installation.md feat(ocr): added support for RapidOCR engine (#415) 2024-11-27 13:57:41 +01:00
supported_formats.md feat(xml-jats): parse XML JATS documents (#967) 2025-02-17 10:43:31 +01:00
usage.md feat: Introduce the enable_remote_services option to allow remote connections while processing (#941) 2025-02-12 15:18:01 +01:00
v2.md fix: Test cases for RTL programmatic PDFs and fixes for the formula model (#903) 2025-02-07 08:43:31 +01:00