PDF Extract Tables

Type: lib.pdf.ExtractTables

Namespace: lib.pdf

Description

Detect and extract tables from a PDF by analysing text layout. pdf, tables, extract, data, rows, columns

Property	Type	Description	Default
start_page	`int`	First page (0-based)	`0`
end_page	`int`	Last page (-1 for all)	`-1`
y_tolerance	`int`	Pixel tolerance for grouping text into rows	`3`

Output	Type	Description
output	`list[dict]`

Browse other nodes in the lib.pdf namespace.