Type: lib.pdf.ExtractText
Namespace: lib.pdf
Description
Extract plain text from a PDF, preserving line breaks based on layout position. pdf, text, extract, read, content
Properties
| Property | Type | Description | Default |
|---|---|---|---|
| start_page | int |
First page (0-based) | 0 |
| end_page | int |
Last page (-1 for all) | -1 |
Outputs
| Output | Type | Description |
|---|---|---|
| output | str |
Related Nodes
Browse other nodes in the lib.pdf namespace.