Docling

NeoAnd/Docling

Fork 0

Commit Graph

Author	SHA1	Message	Date
Maxim Lysak	b8f5e38a8c	feat: introducing docling_backend (#26 ) Uses our own docling_parse to reliably get PDF cells To get page images, this backend uses pypdfium2 Signed-off-by: Maxim Lysak <mly@zurich.ibm.com> Co-authored-by: Maxim Lysak <mly@zurich.ibm.com>	2024-08-07 16:22:36 +02:00
mara004	3eca8b8485	refactor(pypdfium2): just forward input to PdfDocument directly (#17 ) PdfDocument() should do accept strings, paths, bytes and byte streams. If not, please file a bug report. Signed-off-by: mara004 <geisserml@gmail.com>	2024-07-25 08:54:57 +02:00
Christoph Auer	e2d996753b	Initial commit	2024-07-15 09:42:42 +02:00

Author

SHA1

Message

Date

Maxim Lysak

b8f5e38a8c

feat: introducing docling_backend (#26 )

Uses our own docling_parse to reliably get PDF cells
To get page images, this backend uses pypdfium2

Signed-off-by: Maxim Lysak <mly@zurich.ibm.com>
Co-authored-by: Maxim Lysak <mly@zurich.ibm.com>

2024-08-07 16:22:36 +02:00

mara004

3eca8b8485

refactor(pypdfium2): just forward input to PdfDocument directly (#17 )

PdfDocument() should do accept strings, paths, bytes and byte streams. If not, please file a bug report.

Signed-off-by: mara004 <geisserml@gmail.com>

2024-07-25 08:54:57 +02:00

Christoph Auer

e2d996753b

Initial commit

2024-07-15 09:42:42 +02:00

3 Commits