Skip to main content
Version: Version 1.0.0

Get started with OCR in the Conversion Service

Learn how to install and configure the Pdftools OCR Service with the Conversion Service.

Install Pdftools OCR Service

To install the Pdftools OCR Service, follow these steps:

  1. Log in to the Pdftools Portal.
  2. Go to the Products, find the Pdftools OCR Service, and then click Get started or See product.
  3. In the Product kits section, find the PdftoolsOcrService-VERSION_NUMBER.msi package, and then click Download . The exact filename depends on the version you are installing.
  4. Follow the instructions in the installer.
    Screenshot of the Pdftools OCR Service Windows MSI installer.
  5. In the Pdftools Portal, on the Products page, next to the Pdftools OCR Service, click Get started or See product.
  6. Next to the Pdftools OCR Service license key, click Click to copy .
  7. Go to the installation folder of your worker nodes and open the appsettings.json. Full path example:
    C:\Program Files\Pdftools\Pdftools OCR Service\PdftoolsOcrWorker\appsettings.json
  8. Replace the "<LICENSE_KEY>" placeholder with your license key, and then save the file.
    tip

    In its default configuration, the Pdftools OCR Service requires a network connection to validate the license key. For information about partially offline or fully offline solution, review Pdftools OCR Service licensing.

  9. Open Windows start menu and search for Services, and then search for Pdftools OCR Service Worker and start the service.
    Screenshot of the Windows services showing the registed Pdftools OCR Service nodes.
  10. Optional: Check whether the installation succeeded through log files. Example path to worker-logYYYYMMDD.txt file:
    C:\ProgramData\Pdftools\OcrService\logs\worker-log20251007.txt

If you need to remove a trial license key or in general insert another key, repeat the procedure from step 7.

Supported Platforms

The Pdftools OCR Service can be installed on Windows Server and Windows machines. Support for Docker and Linux is on our roadmap.

Set up Pdftools OCR Service with Conversion Service

Learn how to enable and configure the Pdftools OCR Service in the Conversion Service.

Before you start

Enable Pdftools OCR Service in the Conversion Service

The following steps explain how to enable and configure OCR in the Conversion Service profile:

  1. In the Conversion Service Configurator, go to Workflows & Profiles.
  2. Click the pen icon next to the workflow profile you want to edit. The OCR is available in Archive and Conversion workflows.
  3. Enable the OCR Settings toggle.
    Integration tab of the Conversion Service Configurator
  4. Navigate to the now displayed OCR Settings configuration section.
  5. In the OCR Settings section, click the Add Item button.
  6. Select Pdftools OCR Service (3H Legacy Compatible) as the OCR engine, and then click Next.
    Integration tab of the Conversion Service Configurator
  7. Optional: If the OCR service is on a different server, update the Service URL.
    Integration tab of the Conversion Service Configurator
  8. Click Apply.

The Pdftools OCR Service must be configured and accessible through HTTP. You can configure Pdftools OCR Services to distribute the OCR processing equally. For more details, review Scale the Pdftools OCR Service.

Configure OCR in the Conversion Service

You can configure parameters such as languages that the OCR identifies in the documents, predefined profiles, accuracy of text extraction, and many more. To edit the configuration, follow these steps:

  1. In the Conversion Service Configurator, go to Workflows & Profiles.
  2. Click the pen icon next to the workflow profile you want to edit.
  3. Navigate to the OCR Settings section.
  4. Next to Engine, click the pen icon in the Pdftools OCR Service (3H Legacy Compatible) section.
    Integration tab of the Conversion Service Configurator
  5. After editing your configuration, click Apply.

You can edit parameters in the Parameters and Languages input fields:

  • In the Parameters input field, the key-value pairs are joined by an equal sign and separated by semicolons (;). For more information about available parameters, review Parameters.
    • The default parameter set in the Conversion Service Configurator:
      PredefinedProfile=DocumentConversion_Accuracy
    • An example of more parameters set in the Conversion Service Configurator:
      PredefinedProfile=DocumentArchiving_Accuracy;PreprocessingOnly=false;RemoveGarbage=0;RecognizeBlankPages=false;BlankPageMargin=0.02;DisableMaskEmbedding=false
  • In the Languages input field, you can set the languages that the OCR recognizes as one comma-separated string. For more information about available language recognition options, review Supported languages.
    • Default OCR language configuration in the Conversion Service Configurator:
      English,German,French
    • An example of more natural and technical languages separated by commas:
      English,German,French,Tagalog,Corsican,Spanish,Chemistry,Java
References

For more information about specific configuration options, review Pdftools OCR Service references


Logo stating 'Powered by ABBYY' as the Pdftools OCR Service uses the ABBYY FineReader.