Docling

History

Gabe Goodhart c605edd8e9 feat: OllamaVlmModel for Granite Vision 3.2 (#1337 ) * build: Add ollama sdk dependency Branch: OllamaVlmModel Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> * feat: Add option plumbing for OllamaVlmOptions in pipeline_options Branch: OllamaVlmModel Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> * feat: Full implementation of OllamaVlmModel Branch: OllamaVlmModel Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> * feat: Connect "granite_vision_ollama" pipeline option to CLI Branch: OllamaVlmModel Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> * Revert "build: Add ollama sdk dependency" After consideration, we're going to use the generic OpenAI API instead of the Ollama-specific API to avoid duplicate work. This reverts commit bc6b366468cdd66b52540aac9c7d8b584ab48ad0. Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> * refactor: Move OpenAI API call logic into utils.utils This will allow reuse of this logic in a generic VLM model NOTE: There is a subtle change here in the ordering of the text prompt and the image in the call to the OpenAI API. When run against Ollama, this ordering makes a big difference. If the prompt comes before the image, the result is terse and not usable whereas the prompt coming after the image works as expected and matches the non-OpenAI chat API. Branch: OllamaVlmModel Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> * refactor: Refactor from Ollama SDK to generic OpenAI API Branch: OllamaVlmModel Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> * fix: Linting, formatting, and bug fixes The one bug fix was in the timeout arg to openai_image_request. Otherwise, this is all style changes to get MyPy and black passing cleanly. Branch: OllamaVlmModel Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> * remove model from download enum Signed-off-by: Michele Dolfi <dol@zurich.ibm.com> * generalize input args for other API providers Signed-off-by: Michele Dolfi <dol@zurich.ibm.com> * rename and refactor Signed-off-by: Michele Dolfi <dol@zurich.ibm.com> * add example Signed-off-by: Michele Dolfi <dol@zurich.ibm.com> * require flag for remote services Signed-off-by: Michele Dolfi <dol@zurich.ibm.com> * disable example from CI Signed-off-by: Michele Dolfi <dol@zurich.ibm.com> * add examples to docs Signed-off-by: Michele Dolfi <dol@zurich.ibm.com> --------- Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> Signed-off-by: Michele Dolfi <dol@zurich.ibm.com> Co-authored-by: Michele Dolfi <dol@zurich.ibm.com>		2025-04-10 18:03:04 +02:00
..
assets	feat: expose new hybrid chunker, update docs (#384 )	2024-12-09 08:28:29 +01:00
concepts	docs: add plugins docs (#1319 )	2025-04-08 09:44:37 +02:00
examples	feat: OllamaVlmModel for Granite Vision 3.2 (#1337 )	2025-04-10 18:03:04 +02:00
faq	chore: move to docling-project org (#1160 )	2025-03-14 12:35:29 +01:00
installation	docs: Enrichment models (#1097 )	2025-03-04 14:24:38 +01:00
integrations	docs: move apify to docs (#1182 )	2025-03-18 16:43:55 +01:00
overrides	docs: extend integration docs & README (#456 )	2024-11-28 09:41:21 +01:00
reference	docs: specify docstring types (#702 )	2025-01-08 09:05:18 +01:00
stylesheets	docs: introduce docs site (#141 )	2024-10-14 14:13:13 +02:00
usage	feat(SmolDocling): Support MLX acceleration in VLM pipeline (#1199 )	2025-03-19 15:38:54 +01:00
index.md	feat(SmolDocling): Support MLX acceleration in VLM pipeline (#1199 )	2025-03-19 15:38:54 +01:00
v2.md	fix: Test cases for RTL programmatic PDFs and fixes for the formula model (#903 )	2025-02-07 08:43:31 +01:00