Pdftools OCR Service release notes
Learn about new updates in the OCR Service, including new features, improvements, and bug fixes.
Version 1.1.0
6 March 2026
Added
- Docker container support for Pdftools OCR Service. With this release, the Manager and Worker nodes can be deployed as Docker containers. For more details, review OCR Service in Docker.
- Added support for multiple OCR languages: Chinese, Japanese, Korean, Thai, Hebrew, and Arabic.
- Added predefined profiles:
DataExtraction,DocumentConversion_Normal,HighCompressedImageOnlyPdf,BusinessCardsProcessing, andMachineReadableZone. For details on each profile, review Predefined profiles. - Added INI file sections for custom profiles:
[DocumentStructureDetectionParams]for configuring headline, footnote, caption, column, list, and table of contents detection.[SortingBlocksParams]for controlling how Pdftools OCR Service combines recognized text blocks into groups.[TextLayerInjectionParams]for configuring searchable PDF creation with text layer injection.
- Persistent connections with automatic retry and reconnect logic for interservice communication between processes within a Worker node. Worker service processes maintain long-lived connections instead of establishing a new connection for each request, reducing connection overhead.
Changed
- Removed predefined profiles:
DocumentConversion_SpeedBookArchiving_AccuracyBookArchiving_Speed
- All predefined OCR profiles use updated parameter names and settings. If you use custom profiles that reference or extend predefined profiles, review and update your configurations after upgrading. For details, review Predefined profiles and the following section Parameter changes by INI file section.
Parameter changes by INI file section
The following sections describe changed, moved, and removed parameters in each INI file section.
[PrepareImageMode]
The following parameters moved to other INI file sections or have new replacements:
Removed [PrepareImageMode] parameters | Replacement |
|---|---|
Rotation | Use RotationType in [ImageProcessingParams] |
InvertImage, MirrorImage | Moved to [ImageProcessingParams] |
CorrectSkew, CorrectSkewMode | Use CorrectSkewMode in [PagePreprocessingParams] |
AutoOverwriteResolution, OverwriteResolution, XResolutionToOverwrite, YResolutionToOverwrite | Use OverwriteResolutionMode and ResolutionToOverwrite in [PagePreprocessingParams] |
DiscardColorImage, UseFastBinarization | Moved to [PagePreprocessingParams] |
EnhanceLocalContrast | Use BackgroundWhitening in [PagePreprocessingParams] |
PhotoProcessingMode | Use ImageSourceType in [PrepareImageMode] and DetectImageType in [PagePreprocessingParams] |
ImageCompression | Use CompressImageMode in [PrepareImageMode] |
BackgroundFillingColor, CreatePreview, PreviewHeight, PreviewWidth | Removed with no replacement |
New [PrepareImageMode] parameters:
CompressImageModeDocumentTypeImageSourceTypeRasterizeFreeText
[PagePreprocessingParams]
Removed [PagePreprocessingParams] parameters | Replacement |
|---|---|
CorrectOrientation (Boolean) | CorrectOrientationMode (enum: COM_Auto, COM_Clockwise, COM_CounterClockwise, COM_UpsideDown, COM_No) |
CorrectShadowsAndHighlights | Removed with no replacement |
CorrectSkew + CorrectSkewMode (6-value flag enum) | Single CorrectSkewMode with new enum values: CSM_Auto, CSM_Fast, CSM_Off |
GeometryCorrectionMode | CorrectGeometry (ThreeStatePropertyValueEnum) |
ResolutionCorrectionMode | OverwriteResolutionMode and ResolutionToOverwrite |
- Type change:
CorrectInvertedImagechanged fromBoolean(defaultfalse) toThreeStatePropertyValueEnum(defaultTSPV_Auto) - New
[PagePreprocessingParams]parameters:StraightenLinesModeBackgroundWhiteningCropImageDetectImageTypeDiscardColorImageUseFastBinarizationSplitTypeOverwriteResolutionModeResolutionToOverwrite
[PageAnalysisParams]
Removed [PageAnalysisParams] parameters:
EnableTextExtractionMode: replaced withAnalysisMode(set toPAM_TextExtraction).AggressiveTableDetection: Removed with no replacementDetectMultipleBusinessCards: Removed with no replacementProhibitModelAnalysis: Removed with no replacement
New [PageAnalysisParams] parameters:
AnalysisModeSpeedQualityModeDetectHandwrittenDetectCheckmarksDetectStampsPaperSizeDetectionModeDetectTextOnPictures(moved from[ObjectsExtractionParams])CollectPdfExportData
[RecognizerParams]
Removed [RecognizerParams] parameters:
BalancedMode(Boolean): Replaced withMode(set toRM_Normal)FastMode(Boolean): Replaced withMode(set toRM_Fast)WritingStyle(allWS_*values): Removed with no replacement
Enum rename:
TT_Handprintedrenamed toTT_HandwritteninTextTypeEnum.
New [RecognizerParams] parameters:
ModeDetectTextTypesIndependentlyProhibitSmallCaps
[BarcodeParams]
| Change | Details |
|---|---|
Removed [BarcodeParams] parameters | MinRatioToTextHeight, EnableAdvancedExtractionMode, IsCode39WithoutAsterisk |
| New parameter | EnableBarcodesCheck |
New BarcodeTypeEnum values | BT_AutodetectWithoutPostal, BT_Australia4State, BT_JapanPost, BT_KIX, BT_RoyalMail4State |
Removed BarcodeSupplementTypeEnum value | BS_Unknown |
New CodePageEnum values for added languages | CP_Hebrew, CP_Hebrew_ISO, CP_Hebrew_Macintosh, CP_Hebrew_MSDOS, CP_Japan_Mac, CP_Japan_SJIS, CP_Korean, CP_Korean_Johab, CP_Korean_Mac, CP_Thai, CP_Thai_Macintosh |
[FontFormattingDetectionParams]
- Moved parameter:
DetectDropCapsmoved to[PageAnalysisParams]. - Default change:
DetectSpacingdefault changed fromtruetofalse.
[ObjectsExtractionParams]
- Moved parameter:
DetectTextOnPicturesmoved to[PageAnalysisParams]. - New
[ObjectsExtractionParams]parameter:SourceContentReuseMode.
[SynthesisParamsForPage]
New [SynthesisParamsForPage] parameters:
DetectDocumentLinksDetectHyperlinksSynthesizeBusinessCardsAllowGrayTextColor
Removed [OrientationDetectionParams]
- The
[OrientationDetectionParams]INI file section was removed. UseCorrectOrientationModein[PagePreprocessingParams]instead.
Fixed
- Before this update, Worker service processes established a new connection to the main Worker service for each request. In rare cases, a race condition during connection setup caused request failures. With this update, persistent connections with automatic reconnect logic resolve this issue.
Version 1.0.2
10 November 2025
Fixed
- Fixed SDK corruption issues with ABBYY FineReader Engine that could cause the Pdftools OCR Service to become unstable or crash during processing. As of this update, the service automatically detects corruption and recreates the OCR engine when needed.
- Fixed failures when processing large files by increasing the default maximum request body size from 30 MB to 100 MB. As a result, processing of large TIFF files and multi-page documents no longer fails due to file size limitations.
- Improved error handling and logging for failed OCR jobs, with better exception tracking and proper error code reporting.
Changed
- The maximum request body size is now configurable using the
MaxRequestBodySizeBytessetting in the manager nodeappsettings.jsonfile, allowing administrators to adjust the limit based on their specific requirements. The configured default is 100 MB (specifically, it is defined as 104857600 bytes).
Version 1.0.1
15 September 2025
Fixed
- Fixed PostgreSQL database migration compatibility issues to ensure correct schema updates when upgrading from previous versions.
- Improved OCR validation and file creation processes to enhance reliability and error handling during document processing.
Version 1.0.0
30 July 2025
Added
- First release of the Pdftools OCR Service introduces a new OCR engine that is fully compatible with current 3-Heights® OCR Service workflows and configurations. It acts as a drop-in replacement for existing installations, requiring no changes to your integration or processing pipelines. Compared to the previous OCR configurations (the ABBYY FineReader using your license key and the legacy 3-Heights® OCR Service), the new Pdftools OCR Service also provides:
- Streamlined installation and license key configuration.
- Scalability options to support higher throughput.
Changed
- Compared to the 3-Heights® OCR Service, this release of the Pdftools OCR Service doesn’t include support for the
BusinessCardsProcessingpredefined OCR profile.