Skip to main content

Conversion workflow

This workflow is engineered specifically for the conversion of documents to PDF 1.x. Unlike the Archive PDF/A workflows, files are only converted to PDF (and not PDF/A), the file format is not validated and the output documents cannot be signed.

The workflow supports these features:

  • Optimization for speed or size
  • Optical character recognition (OCR)
  • Office file conversion (as required)
  • Configuration of attachment conversion

Supported file formats for Conversion workflow

This workflow supports these file formats:

Extension / Type
Document formatsPDF 1.x, PDF 2.0, PDF/A-1, PDF/A-2, PDF/A-3
EmailEML, MSG (without encryption)
WordDOC, DOT, DOCX, DOCM, DOTX, DOTM, RFT, XML (WordprocessingML 2003)
ExcelXLS, XLT, XLSX, XLSM, XLTX, XLTM, XML (SpreadsheetML 2003)
OpenOfficeODT, ODS, ODP
OtherCSV, HTML, HTM (prepared for archiving), TXT, XML, ZIP (without password protection)

Compared to the Archive PDF/A workflows, the Conversion workflow offers these additional features:

Optimize for speed or size

The workflow's profile offers an option to optimize for processing time (speed) or for minimal document file size.

Convert mode configuration for child documents (Attachments)

Certain child documents can be skipped (removed) during conversion to PDF, such as attachments of emails or PDF documents. The convert mode can be specified based on the type of the child document, its filename, or the type of its parent document.

For example, by default executables attached to an email are removed. If desired, rules can be added to attach files that can not be converted (e.g. PDF documents containing unrendered XFA, HTML documents) in their orignal source format to the resulting output document.

Collect mode configuration

The collect mode configuration defines how a converted document and its child documents are combined. The collect mode can be configured for each document type and also defines how errors are handled.

For example, emails can be converted by creating a PDF collection (Portfolio) of its body and attachments. Or when converting Word documents, all embedded files can be merged to the converted PDF.