This namespace contains 10 node(s).
Available Nodes
- PDF Extract Styled Text - Extract text spans with font name, size, bounding box, and color (always null…
- PDF Extract Tables - Detect and extract tables from a PDF by analysing text layout.
- PDF Extract Text - Extract plain text from a PDF, preserving line breaks based on layout position.
- PDF Extract Text (OCR) - Extract text from a PDF using OCR, suitable for scanned documents and image-b…
- PDF Extract Text Blocks - Extract text blocks with bounding boxes, useful for layout analysis.
- PDF Page Count - Get the total number of pages in a PDF document.
- PDF Page Metadata - Get dimensions and bounding box for each page.
- PDF Page Screenshot - Render PDF pages as PNG images.
- PDF Search Text - Search a PDF for a phrase and return each match with its page number and boun…
- PDF to Markdown - Convert PDF to Markdown, inferring headings from font size.