Commit Graph

  • 7e84533299
    fix: Upgrade docling-parse to 1.1.1, safety checks for failed parse on pages (#45) Christoph Auer 2024-08-23 12:51:02 +0200
  • 1930f08d4e chore: bump version to 1.7.0 [skip ci] github-actions[bot] 2024-08-22 12:00:25 +0000
  • a8c6b29a67
    feat: Upgrade docling-parse PDF backend and interface to use page-by-page parsing (#44) Christoph Auer 2024-08-22 13:49:37 +0200
  • f7c50c8b0e chore: bump version to 1.6.3 [skip ci] github-actions[bot] 2024-08-22 11:02:35 +0000
  • fac5745dc8
    fix: usage of bytesio with docling-parse (#43) Michele Dolfi 2024-08-22 12:59:49 +0200
  • 1347c01a9e chore: bump version to 1.6.2 [skip ci] github-actions[bot] 2024-08-22 07:32:54 +0000
  • 69952682ed
    fix: remove [ocr] extra to fix wheel install (#42) Michele Dolfi 2024-08-22 09:25:19 +0200
  • 47c6dab6d2 chore: bump version to 1.6.1 [skip ci] github-actions[bot] 2024-08-21 17:41:26 +0000
  • f19871a5a1
    fix: Add scipy as dependency (#40) Christoph Auer 2024-08-21 17:21:02 +0200
  • 4a1ceaf65c
    Update docling-ibm-models to v1.1.2 (#39) Christoph Auer 2024-08-21 17:12:38 +0200
  • 22a5c29c63 chore: bump version to 1.6.0 [skip ci] github-actions[bot] 2024-08-20 13:34:53 +0000
  • e94d317c02
    feat: Add adaptive OCR, factor out treatment of OCR areas and cell filtering (#38) Christoph Auer 2024-08-20 15:28:03 +0200
  • 47b8ad917e chore: bump version to 1.5.0 [skip ci] github-actions[bot] 2024-08-20 11:53:52 +0000
  • 78347bf679
    feat: allow computing page images on-demand with scale and cache them (#36) Michele Dolfi 2024-08-20 13:27:19 +0200
  • c253dd743a
    Add redbooks to test data, small additions (#35) Christoph Auer 2024-08-20 12:36:00 +0200
  • a13114bafd
    docs: add technical paper ref (#37) Michele Dolfi 2024-08-20 12:32:53 +0200
  • 778e51ef18 chore: bump version to 1.4.0 [skip ci] github-actions[bot] 2024-08-14 11:46:55 +0000
  • 349b0e914f
    fix: allow newer torch versions (#34) Michele Dolfi 2024-08-14 13:37:36 +0200
  • 90dd676422
    feat: update parser with bytesio interface and set as new default backend (#32) Michele Dolfi 2024-08-14 12:30:00 +0200
  • 61be78a875
    Fix class re-mapping for table of contents (#33) Christoph Auer 2024-08-14 11:32:30 +0200
  • dd0df9f094 chore: bump version to 1.3.0 [skip ci] github-actions[bot] 2024-08-12 16:29:05 +0000
  • 63d80edca2
    feat: output page images and extracted bbox (#31) Michele Dolfi 2024-08-12 18:25:45 +0200
  • 0bf4a43ed5 chore: bump version to 1.2.1 [skip ci] github-actions[bot] 2024-08-07 15:38:00 +0000
  • 79ef8d2f2f
    fix: update (vuln) deps (#29) Michele Dolfi 2024-08-07 17:29:36 +0200
  • 794b20a50a
    fix: type of path_or_stream in PdfDocumentBackend (#28) Michele Dolfi 2024-08-07 17:20:44 +0200
  • 9550db8e64
    docs: improve examples (#27) Michele Dolfi 2024-08-07 17:16:35 +0200
  • 20cbe7c24a chore: bump version to 1.2.0 [skip ci] github-actions[bot] 2024-08-07 14:35:03 +0000
  • b8f5e38a8c
    feat: introducing docling_backend (#26) Maxim Lysak 2024-08-07 16:22:36 +0200
  • 62ba4aaf31 chore: bump version to 1.1.2 [skip ci] github-actions[bot] 2024-07-31 12:35:59 +0000
  • d2d9543415
    fix: set page number using 1-based indexing (#22) Panos Vagenas 2024-07-31 14:28:44 +0200
  • e102827753 chore: bump version to 1.1.1 [skip ci] github-actions[bot] 2024-07-30 12:53:54 +0000
  • f4bf3d25b9
    fix: Correct text extraction for table cells (#21) Maxim Lysak 2024-07-30 14:51:47 +0200
  • b07c4a7a4a chore: bump version to 1.1.0 [skip ci] github-actions[bot] 2024-07-26 15:01:56 +0000
  • d603137383
    feat: add simplified single-doc conversion (#20) Panos Vagenas 2024-07-26 16:55:33 +0200
  • 3eca8b8485
    refactor(pypdfium2): just forward input to PdfDocument directly (#17) mara004 2024-07-25 08:54:57 +0200
  • 6db2b350dd chore: bump version to 1.0.2 [skip ci] github-actions[bot] 2024-07-24 12:18:21 +0000
  • 54b3dda141
    fix: add easyocr to main deps for valid extra (#19) Michele Dolfi 2024-07-24 14:11:26 +0200
  • 3e92f0bfba chore: bump version to 1.0.1 [skip ci] github-actions[bot] 2024-07-24 09:28:47 +0000
  • b0725e0aa6
    fix: expose ocr as extra (#18) Michele Dolfi 2024-07-24 11:14:17 +0200
  • 9f2add112f chore: bump version to 1.0.0 [skip ci] github-actions[bot] 2024-07-18 15:52:38 +0000
  • 71c3a9c8cd
    feat!: v1.0.0 release (#16) Michele Dolfi 2024-07-18 17:50:14 +0200
  • 7bc20adc16
    pin docling-ibm-models 1.1.0 with python 3.10 support (#15) Michele Dolfi 2024-07-18 17:27:48 +0200
  • eb0b208272
    chore: switch to docling-core Markdown export (#14) Panos Vagenas 2024-07-18 16:10:05 +0200
  • 28d1c746a6
    chore: update README (#13) Panos Vagenas 2024-07-18 11:23:23 +0200
  • f09ffcc8f4 chore: bump version to 0.4.0 [skip ci] github-actions[bot] 2024-07-17 14:26:50 +0000
  • e9526bb11e
    feat: Optimize table extraction quality, add configuration options (#11) Christoph Auer 2024-07-17 16:13:21 +0200
  • 3e2ede8107 chore: bump version to 0.3.1 [skip ci] github-actions[bot] 2024-07-17 13:58:51 +0000
  • d1d1724537
    fix: missing type for default values (#12) Michele Dolfi 2024-07-17 15:54:43 +0200
  • 2baa35c548
    docs: reflect supported Python versions, add badges (#10) Panos Vagenas 2024-07-17 15:49:26 +0200
  • 0dfa4548d3 chore: bump version to 0.3.0 [skip ci] github-actions[bot] 2024-07-17 12:11:15 +0000
  • fb72688ff7
    feat: enable python 3.12 support by updating glm (#8) Michele Dolfi 2024-07-17 14:03:26 +0200
  • 2803222ee1
    docs: Add setup with pypi to Readme (#7) Christoph Auer 2024-07-16 14:15:09 +0200
  • 5c88574d03 chore: bump version to 0.2.0 [skip ci] github-actions[bot] 2024-07-16 11:37:14 +0000
  • b1479cf4ec
    feat: build with ci (#6) Michele Dolfi 2024-07-16 13:34:42 +0200
  • b4f45ce96b
    disable docs build (#5) Michele Dolfi 2024-07-16 13:14:44 +0200
  • e45dc5d1a5
    ci: Add Github Actions (#4) Michele Dolfi 2024-07-16 13:05:04 +0200
  • b9dc892385
    Update convert.py (#3) Christoph Auer 2024-07-15 18:02:42 +0200
  • 05ab89f958
    doc: More documentation updates (#2) Christoph Auer 2024-07-15 14:59:53 +0200
  • 180f70c6e8
    docs: Update links, add GH repository to metadata (#1) Christoph Auer 2024-07-15 12:43:05 +0200
  • e2d996753b Initial commit Christoph Auer 2024-07-15 09:42:42 +0200