Type: lib.pdf.ExtractTables

Namespace: lib.pdf

Description

Detect and extract tables from a PDF by analysing text layout. pdf, tables, extract, data, rows, columns

Properties

Property Type Description Default
start_page int First page (0-based) 0
end_page int Last page (-1 for all) -1
y_tolerance int Pixel tolerance for grouping text into rows 3

Outputs

Output Type Description
output list[dict]  

Browse other nodes in the lib.pdf namespace.