Skip to main content
Version: Version 1.1.0

Pdftools OCR Service release notes

Learn about new updates in the OCR Service, including new features, improvements, and bug fixes.

Version 1.1.0

6 March 2026

Added

  • Docker container support for Pdftools OCR Service. With this release, the Manager and Worker nodes can be deployed as Docker containers. For more details, review OCR Service in Docker.
  • Added support for multiple OCR languages: Chinese, Japanese, Korean, Thai, Hebrew, and Arabic.
  • Added predefined profiles: DataExtraction, DocumentConversion_Normal, HighCompressedImageOnlyPdf, BusinessCardsProcessing, and MachineReadableZone. For details on each profile, review Predefined profiles.
  • Added INI file sections for custom profiles:
  • Persistent connections with automatic retry and reconnect logic for interservice communication between processes within a Worker node. Worker service processes maintain long-lived connections instead of establishing a new connection for each request, reducing connection overhead.

Changed

  • Removed predefined profiles:
    • DocumentConversion_Speed
    • BookArchiving_Accuracy
    • BookArchiving_Speed
  • All predefined OCR profiles use updated parameter names and settings. If you use custom profiles that reference or extend predefined profiles, review and update your configurations after upgrading. For details, review Predefined profiles and the following section Parameter changes by INI file section.

Parameter changes by INI file section

The following sections describe changed, moved, and removed parameters in each INI file section.

[PrepareImageMode]

The following parameters moved to other INI file sections or have new replacements:

Removed [PrepareImageMode] parametersReplacement
RotationUse RotationType in [ImageProcessingParams]
InvertImage, MirrorImageMoved to [ImageProcessingParams]
CorrectSkew, CorrectSkewModeUse CorrectSkewMode in [PagePreprocessingParams]
AutoOverwriteResolution, OverwriteResolution, XResolutionToOverwrite, YResolutionToOverwriteUse OverwriteResolutionMode and ResolutionToOverwrite in [PagePreprocessingParams]
DiscardColorImage, UseFastBinarizationMoved to [PagePreprocessingParams]
EnhanceLocalContrastUse BackgroundWhitening in [PagePreprocessingParams]
PhotoProcessingModeUse ImageSourceType in [PrepareImageMode] and DetectImageType in [PagePreprocessingParams]
ImageCompressionUse CompressImageMode in [PrepareImageMode]
BackgroundFillingColor, CreatePreview, PreviewHeight, PreviewWidthRemoved with no replacement

New [PrepareImageMode] parameters:

  • CompressImageMode
  • DocumentType
  • ImageSourceType
  • RasterizeFreeText
[PagePreprocessingParams]
Removed [PagePreprocessingParams] parametersReplacement
CorrectOrientation (Boolean)CorrectOrientationMode (enum: COM_Auto, COM_Clockwise, COM_CounterClockwise, COM_UpsideDown, COM_No)
CorrectShadowsAndHighlightsRemoved with no replacement
CorrectSkew + CorrectSkewMode (6-value flag enum)Single CorrectSkewMode with new enum values: CSM_Auto, CSM_Fast, CSM_Off
GeometryCorrectionModeCorrectGeometry (ThreeStatePropertyValueEnum)
ResolutionCorrectionModeOverwriteResolutionMode and ResolutionToOverwrite
  • Type change: CorrectInvertedImage changed from Boolean (default false) to ThreeStatePropertyValueEnum (default TSPV_Auto)
  • New [PagePreprocessingParams] parameters:
    • StraightenLinesMode
    • BackgroundWhitening
    • CropImage
    • DetectImageType
    • DiscardColorImage
    • UseFastBinarization
    • SplitType
    • OverwriteResolutionMode
    • ResolutionToOverwrite
[PageAnalysisParams]

Removed [PageAnalysisParams] parameters:

  • EnableTextExtractionMode: replaced with AnalysisMode (set to PAM_TextExtraction).
  • AggressiveTableDetection: Removed with no replacement
  • DetectMultipleBusinessCards: Removed with no replacement
  • ProhibitModelAnalysis: Removed with no replacement

New [PageAnalysisParams] parameters:

  • AnalysisMode
  • SpeedQualityMode
  • DetectHandwritten
  • DetectCheckmarks
  • DetectStamps
  • PaperSizeDetectionMode
  • DetectTextOnPictures (moved from [ObjectsExtractionParams])
  • CollectPdfExportData
[RecognizerParams]

Removed [RecognizerParams] parameters:

  • BalancedMode (Boolean): Replaced with Mode (set to RM_Normal)
  • FastMode (Boolean): Replaced with Mode (set to RM_Fast)
  • WritingStyle (all WS_* values): Removed with no replacement

Enum rename:

  • TT_Handprinted renamed to TT_Handwritten in TextTypeEnum.

New [RecognizerParams] parameters:

  • Mode
  • DetectTextTypesIndependently
  • ProhibitSmallCaps
[BarcodeParams]
ChangeDetails
Removed [BarcodeParams] parametersMinRatioToTextHeight, EnableAdvancedExtractionMode, IsCode39WithoutAsterisk
New parameterEnableBarcodesCheck
New BarcodeTypeEnum valuesBT_AutodetectWithoutPostal, BT_Australia4State, BT_JapanPost, BT_KIX, BT_RoyalMail4State
Removed BarcodeSupplementTypeEnum valueBS_Unknown
New CodePageEnum values for added languagesCP_Hebrew, CP_Hebrew_ISO, CP_Hebrew_Macintosh, CP_Hebrew_MSDOS, CP_Japan_Mac, CP_Japan_SJIS, CP_Korean, CP_Korean_Johab, CP_Korean_Mac, CP_Thai, CP_Thai_Macintosh
[FontFormattingDetectionParams]
  • Moved parameter: DetectDropCaps moved to [PageAnalysisParams].
  • Default change: DetectSpacing default changed from true to false.
[ObjectsExtractionParams]
  • Moved parameter: DetectTextOnPictures moved to [PageAnalysisParams].
  • New [ObjectsExtractionParams] parameter: SourceContentReuseMode.
[SynthesisParamsForPage]

New [SynthesisParamsForPage] parameters:

  • DetectDocumentLinks
  • DetectHyperlinks
  • SynthesizeBusinessCards
  • AllowGrayTextColor
Removed [OrientationDetectionParams]
  • The [OrientationDetectionParams] INI file section was removed. Use CorrectOrientationMode in [PagePreprocessingParams] instead.

Fixed

  • Before this update, Worker service processes established a new connection to the main Worker service for each request. In rare cases, a race condition during connection setup caused request failures. With this update, persistent connections with automatic reconnect logic resolve this issue.

Version 1.0.2

10 November 2025

Fixed

  • Fixed SDK corruption issues with ABBYY FineReader Engine that could cause the Pdftools OCR Service to become unstable or crash during processing. As of this update, the service automatically detects corruption and recreates the OCR engine when needed.
  • Fixed failures when processing large files by increasing the default maximum request body size from 30 MB to 100 MB. As a result, processing of large TIFF files and multi-page documents no longer fails due to file size limitations.
  • Improved error handling and logging for failed OCR jobs, with better exception tracking and proper error code reporting.

Changed

  • The maximum request body size is now configurable using the MaxRequestBodySizeBytes setting in the manager node appsettings.json file, allowing administrators to adjust the limit based on their specific requirements. The configured default is 100 MB (specifically, it is defined as 104857600 bytes).

Version 1.0.1

15 September 2025

Fixed

  • Fixed PostgreSQL database migration compatibility issues to ensure correct schema updates when upgrading from previous versions.
  • Improved OCR validation and file creation processes to enhance reliability and error handling during document processing.

Version 1.0.0

30 July 2025

Added

  • First release of the Pdftools OCR Service introduces a new OCR engine that is fully compatible with current 3-Heights® OCR Service workflows and configurations. It acts as a drop-in replacement for existing installations, requiring no changes to your integration or processing pipelines. Compared to the previous OCR configurations (the ABBYY FineReader using your license key and the legacy 3-Heights® OCR Service), the new Pdftools OCR Service also provides:
    • Streamlined installation and license key configuration.
    • Scalability options to support higher throughput.

Changed

  • Compared to the 3-Heights® OCR Service, this release of the Pdftools OCR Service doesn’t include support for the BusinessCardsProcessing predefined OCR profile.