Home

Awesome

Layout Analysis tools

Document layout analysis is the process of identifying and categorizing the regions of interest over an unstructured document page (e.g. a scanned page).

Intented to be used in Mapa76 processing pipeline for detecting the clusters of text in a PDF file to correctly perform NE dectection to the body of text, excluding other unrelated text lines (like page numbers, titles, footnotes, etc).