Skip to main content
Version: Version 1.1.0

[SynthesisParamsForPage] INI file section

The [SynthesisParamsForPage] INI file section defines page-level synthesis settings for paragraph structure, font formatting, link detection, and color detection.


Main settings

ParagraphExtractionMode

KeyTypeDefault
ParagraphExtractionModeParagraphExtractionModeEnumPEM_NormalExtraction

Controls how paragraphs are identified and extracted from recognized text.

ParagraphExtractionModeEnum

  • PEM_NormalExtraction: Standard paragraph detection and extraction.
  • PEM_RoughExtraction: Produces the fewest paragraphs possible—either one per block or only those beginning with an enlarged initial letter (drop cap).
  • PEM_SingleLineParagraphsWithSpaceFormatting: Treats each line as its own paragraph, preserving space-based formatting.
  • PEM_SingleLineParagraphsWithWordSeparationOnly: Treats each line as its own paragraph, using spaces only to separate words without additional formatting.

DetectFontFormattingAtPageLevel

KeyTypeDefault
DetectFontFormattingAtPageLevelBooleanfalse

When set to true, font formatting—including subscripts, superscripts, italics, and small caps—is identified during page synthesis rather than document synthesis. This also activates the [FontFormattingDetectionParams] section, which provides additional control over formatting detection. When this property is false, [FontFormattingDetectionParams] has no effect.

info

By default, Pdftools OCR Service identifies font formatting at the document synthesis stage. If you set this property to true, you must turn off formatting detection during document synthesis by setting DetectFontFormatting to false.

Moving font detection to page synthesis can reduce processing time and memory consumption during document synthesis, but may produce less accurate formatting results.


KeyTypeDefault
DetectDocumentLinksBooleantrue

When set to true, Pdftools OCR Service identifies internal document references such as cross-references during page synthesis.


KeyTypeDefault
DetectHyperlinksBooleantrue

When set to true, Pdftools OCR Service identifies hyperlinks (URLs, email addresses) in recognized text during page synthesis.


SynthesizeBusinessCards

KeyTypeDefault
SynthesizeBusinessCardsBooleanfalse

When set to true, Pdftools OCR Service attempts to find and process business cards on the page.


Color settings

DetectBackgroundColor

KeyTypeDefault
DetectBackgroundColorThreeStatePropertyValueEnumTSPV_Auto

When set to TSPV_Yes, Pdftools OCR Service identifies background colors during page synthesis.

ThreeStatePropertyValueEnum

  • TSPV_Auto: Automatically determine if this processing mode should be used, depending on the situation (image characteristics, etc.).
  • TSPV_No: The processing mode in question will not be used.
  • TSPV_Yes: The processing mode in question will be used.

AllowGrayBackgroundColor

KeyTypeDefault
AllowGrayBackgroundColorThreeStatePropertyValueEnumTSPV_Auto

When set to TSPV_Yes, gray shades are included in background color detection. Otherwise, backgrounds are classified as either black or white. This property only applies when DetectBackgroundColor is set to TSPV_Yes or TSPV_Auto.


DetectTextColor

KeyTypeDefault
DetectTextColorThreeStatePropertyValueEnumTSPV_Auto

When set to TSPV_Yes, Pdftools OCR Service identifies text colors during page synthesis.


AllowGrayTextColor

KeyTypeDefault
AllowGrayTextColorThreeStatePropertyValueEnumTSPV_Auto

When set to TSPV_Yes, gray shades are included in text color detection. Otherwise, text is classified as either black or white. This property only applies when DetectTextColor is set to TSPV_Yes or TSPV_Auto.


CorrectDynamicRange

KeyTypeDefault
CorrectDynamicRangeThreeStatePropertyValueEnumTSPV_Auto

When set to TSPV_Yes, image colors are adjusted to maximize contrast between text and background (for example, making the background white and text black, or the reverse). This improves image quality but increases processing time.

This property is most effective when DetectBackgroundColor and DetectTextColor are set to TSPV_Yes or TSPV_Auto.