[SynthesisParamsForPage] INI file section
The [SynthesisParamsForPage] INI file section defines page-level synthesis settings for paragraph structure, font formatting, link detection, and color detection.
Main settings
ParagraphExtractionMode
| Key | Type | Default |
|---|---|---|
ParagraphExtractionMode | ParagraphExtractionModeEnum | PEM_NormalExtraction |
Controls how paragraphs are identified and extracted from recognized text.
ParagraphExtractionModeEnum
PEM_NormalExtraction: Standard paragraph detection and extraction.PEM_RoughExtraction: Produces the fewest paragraphs possible—either one per block or only those beginning with an enlarged initial letter (drop cap).PEM_SingleLineParagraphsWithSpaceFormatting: Treats each line as its own paragraph, preserving space-based formatting.PEM_SingleLineParagraphsWithWordSeparationOnly: Treats each line as its own paragraph, using spaces only to separate words without additional formatting.
DetectFontFormattingAtPageLevel
| Key | Type | Default |
|---|---|---|
DetectFontFormattingAtPageLevel | Boolean | false |
When set to true, font formatting—including subscripts, superscripts, italics, and small caps—is identified during page synthesis rather than document synthesis. This also activates the [FontFormattingDetectionParams] section, which provides additional control over formatting detection. When this property is false, [FontFormattingDetectionParams] has no effect.
By default, Pdftools OCR Service identifies font formatting at the document synthesis stage. If you set this property to true, you must turn off formatting detection during document synthesis by setting DetectFontFormatting to false.
Moving font detection to page synthesis can reduce processing time and memory consumption during document synthesis, but may produce less accurate formatting results.
DetectDocumentLinks
| Key | Type | Default |
|---|---|---|
DetectDocumentLinks | Boolean | true |
When set to true, Pdftools OCR Service identifies internal document references such as cross-references during page synthesis.
DetectHyperlinks
| Key | Type | Default |
|---|---|---|
DetectHyperlinks | Boolean | true |
When set to true, Pdftools OCR Service identifies hyperlinks (URLs, email addresses) in recognized text during page synthesis.
SynthesizeBusinessCards
| Key | Type | Default |
|---|---|---|
SynthesizeBusinessCards | Boolean | false |
When set to true, Pdftools OCR Service attempts to find and process business cards on the page.
Color settings
DetectBackgroundColor
| Key | Type | Default |
|---|---|---|
DetectBackgroundColor | ThreeStatePropertyValueEnum | TSPV_Auto |
When set to TSPV_Yes, Pdftools OCR Service identifies background colors during page synthesis.
ThreeStatePropertyValueEnum
TSPV_Auto: Automatically determine if this processing mode should be used, depending on the situation (image characteristics, etc.).TSPV_No: The processing mode in question will not be used.TSPV_Yes: The processing mode in question will be used.
AllowGrayBackgroundColor
| Key | Type | Default |
|---|---|---|
AllowGrayBackgroundColor | ThreeStatePropertyValueEnum | TSPV_Auto |
When set to TSPV_Yes, gray shades are included in background color detection. Otherwise, backgrounds are classified as either black or white. This property only applies when DetectBackgroundColor is set to TSPV_Yes or TSPV_Auto.
DetectTextColor
| Key | Type | Default |
|---|---|---|
DetectTextColor | ThreeStatePropertyValueEnum | TSPV_Auto |
When set to TSPV_Yes, Pdftools OCR Service identifies text colors during page synthesis.
AllowGrayTextColor
| Key | Type | Default |
|---|---|---|
AllowGrayTextColor | ThreeStatePropertyValueEnum | TSPV_Auto |
When set to TSPV_Yes, gray shades are included in text color detection. Otherwise, text is classified as either black or white. This property only applies when DetectTextColor is set to TSPV_Yes or TSPV_Auto.
CorrectDynamicRange
| Key | Type | Default |
|---|---|---|
CorrectDynamicRange | ThreeStatePropertyValueEnum | TSPV_Auto |
When set to TSPV_Yes, image colors are adjusted to maximize contrast between text and background (for example, making the background white and text black, or the reverse). This improves image quality but increases processing time.
This property is most effective when DetectBackgroundColor and DetectTextColor are set to TSPV_Yes or TSPV_Auto.