Before the “character” recognition will take place, the logical structure of the document has to be be analyzed and defined. For example:
ABBYY technology contains several variants of Document Layout Analysis:
The Document Analysis (DA) searches and “finds” zones for recognition on the document images. Here how it works:
The following screenshot of ABBYY FineReader shows the result of a analyzed layout (text, image and table blocks) , as well as the reconstructed output.
or on a multi-column magazine page with intelligent layout analysis & reconstruciton
If there would be no intelligent layout analysis, but use only use one large text block, then the results of are by far not that useable for a human for example on a multi-column document, then the user would also get the text, but not
Automatic Document Analysis in the SDKs can work in the different modes available in the OCR-SDKs:
Note: It is possible to use ABBYY SDK without applying the document layout analysis. Then the developer has to create own blocks/recognition areas. Then this processing scenario is called Field-Level-OCR - Zonal OCR