Skip to main content
Version: Version 1.0.0

[PageAnalysisParams] INI file section

The [PageAnalysisParams] INI file section defines parameters controlling how the Pdftools OCR Service analyzes page content during layout analysis, including detecting text, tables, images, barcodes, and layout structures.


DetectText

KeyTypeDefault
DetectTextBooleantrue

If this property is true, the text areas are detected during layout analysis.


EnableTextExtractionMode

KeyTypeDefault
EnableTextExtractionModeBooleanfalse

When set to true, the Pdftools OCR Service assumes that the text blocks can be located anywhere on the page.
Isolated text blocks are detected during layout analysis. Tables are not detected.
Model analysis is not performed, as if the ProhibitModelAnalysis property was set to true.


DetectTables

KeyTypeDefault
DetectTablesBooleantrue

If this property is true, the tables are detected during layout analysis.


AggressiveTableDetection

KeyTypeDefault
AggressiveTableDetectionBooleanfalse

This property manages the table detection mode. If you set it to true, the Pdftools OCR Service tries to find as many tables as possible on the page.

This setting is recommended only for documents that contain a lot of tables.


DetectBarcodes

KeyTypeDefault
DetectBarcodesBooleanfalse

Specifies if barcodes are detected, and accordingly, barcode blocks are created during layout analysis. If this property is false, barcodes may be detected as blocks of some other type (for example pictures).


DetectSeparators

KeyTypeDefault
DetectSeparatorsBooleantrue

If this property is true, the separators are detected during layout analysis.


DetectPictures

KeyTypeDefault
DetectPicturesBooleantrue

If this property is true, the pictures are detected during layout analysis.


DetectVectorGraphics

KeyTypeDefault
DetectVectorGraphicsBooleantrue

If this property is true, vector pictures are detected during layout analysis.
Vector picture blocks may appear in the layout only if this property was set to true during layout analysis.


Additional settings

DetectMultipleBusinessCards

KeyTypeDefault
DetectMultipleBusinessCardsBooleanfalse

Specifies whether a processing page can contain several business cards.


NoShadowsMode

KeyTypeDefault
NoShadowsModeBooleanfalse

When set to true, the Pdftools OCR Service presumes that an image has no shadows from scanning.


DetectVerticalEuropeanText

KeyTypeDefault
DetectVerticalEuropeanTextBooleanfalse

When set to true, the Pdftools OCR Service looks for vertically oriented text.
It applies to all languages other than CJK.
For CJK languages, vertical text detection is managed by the ProhibitCJKColumns property.


ProhibitCJKColumns

KeyTypeDefault
ProhibitCJKColumnsBooleanfalse

The text in CJK languages can be written vertically as well as horizontally.
Setting this property to true sets the Pdftools OCR Service to ignore the possibility of vertical text and recognize the image with the assumption that all text is arranged horizontally.

This property is valid only for CJK languages.


ProhibitDoublePageMode

KeyTypeDefault
ProhibitDoublePageModeBooleanfalse

When set to true, the Pdftools OCR Service presumes that an image is not a double-page book.


ProhibitModelAnalysis

KeyTypeDefault
ProhibitModelAnalysisBooleanfalse

If this property is false, typical variants of page layout will be evaluated during page analysis, and the best variant will be selected to improve recognition quality.
If the best variant cannot be selected, standard page layout analysis will be performed.

note

If EnableTextExtractionMode is set to true, this property is ignored and model analysis is not performed.