Archive PDF/A-1 workflow

This workflow is engineered specifically for preparing documents for archiving, specifically where PDF/A-2 or PDF/A-3 conformance is not allowed.

This workflow is disabled by default because it is recommended you use Archive PDF/A 2 workflow for archiving. Since PDF/A-2 is based on a newer version of the PDF standard, more PDF features are allowed such as transparency, layers, embedded files, and a less restrictive internal file structure.

For these reasons, achieve fewer conversion errors and better conversion quality with the Archive PDF/A-2 workflow. Particularly, if you use the Conversion Service in a Docker container where document content rasterization is unavailable, use PDF/A-2 workflow instead to avoid visual differences, removed transparency, or other potential issues.

info

More information on PDF/A-2, read PDF/A-2 overview.

Supported file formats for Archive PDF/A-1 workflow

This workflow supports these file formats:

Content type	File type
Document formats	PDF 1.x, PDF 2.0, PDF/A-1, PDF/A-2, PDF/A-3
Image formats	JPEG, JPEG200, TIFF, BMP, GIF, JBIG2, PNG, HEIC, HEIF, WebP
Email	EML, MSG (without encryption)
Word	DOC, DOT, DOCX, DOCM, DOTX, DOTM, RTF, XML (WordprocessingML 2003)
Excel	XLS, XLT, XLSX, XLSM, XLTX, XLTM, XML (SpreadsheetML 2003)
PowerPoint	PPT, PPS, PPTX, PPTM, PPSX, PPSM
OpenOffice	ODT, ODS, ODP
Other	CSV, HTML, HTM (prepared for archiving), TXT, XML, ZIP (without password protection)

Note on OpenOffice formats

PDF conversion of OpenDocument Format depends on the rendering in Microsoft Word, Excel or PowerPoint. In particular, visual differences may occur with tables and tabs. The visual differences caused by the rendering of shapes are usually not acceptable.

Note on HTML format

HTML documents need to be self-contained (layout information and images are either inline or available on the web) and suited for portrait page layout. JavaScript content is disabled during processing.

Note on XML format

Layout information and images need to be available on the web.

The conversion of most file formats is enabled by default in the profile's Convert Mode for Child Documents (Attachments).

Features and limitations

Compared to the PDF/A-2 workflow, this workflow has the following features and limitations:

Collect mode

The PDF/A-1 standard does not allow embedded files. Therefore, the only collect mode configuration available is Merge.

Document content rasterization

Certain graphical features of PDF documents are not allowed in PDF/A-1; for example, transparency. The removal of these features can lead to severe visual differences that may render the page's content unreadable.

The Windows version of the product can preserve the visual appearance of such documents by converting the pages' content to images (content rasterization).When doing so, a content rasterized warning is generated. If possible, content rasterization preserves the pages' extractable text, links, outlines and viewer preferences.

The Docker version cannot rasterize pages. Therefore, the conversion of such files leads to transparency removed or visual differences warning depending on whether the visual differences are caused by the removal of transparency.

PDF/A-1 specific conversion warnings

The following warnings occur more frequently when converting to PDF/A-1:

Annotation removed
Content rasterized
Layers removed
Transparency removed
Visual differences So the profile's conversion settings for these warnings is particularly important.
Note: Content rasterized warning appears only in Windows where content rasterization is available; Transparency removed warning only appears in Docker image, where content rasterizaiton is not available.

Sign

Document signing is currently not implemented.

Job and document options for the PDF/A-1 workflow

The PDF/A-1 workflow lets you use job and document options to pass job- and document-specific values to be used when processing documents using the workflow.

Job options

Job options apply to all documents processed in the same job. Any subsequent jobs processed with the workflow profile use the profile's default settings.

Type	Option	Description
Document compression and optimization	`OPTIMIZE`	Turn on or off document compression and optimization. All settings must be previously set up in the profile. If `true`, documents included in the job are compressed and optimized according to the optimization profile set in the profile settings. If `false`, no compression and optimization is performed. Documents can be optimized according to five profiles: Web: Compresses the file without affecting viewing quality on digital devices Print: Compresses the file without affecting print quality Max: Removes redundant data and reduces image resolution to achieve a minimal viable file size MRC: Profile designed to process mixed raster content Archive: Prepares a document for archiving in PDF/A format
OCR	`OCR`	Turn on and off optical character recognition for the job. All settings must be previously set up in the profile. If `true`, documents included in the job are processed to recognize any images as text (as appropriate). If `false`, no OCR is performed.
Metadata	`META.AUTHOR`	The author of the document
Metadata	`META.TITLE`	The title of the document
Metadata	`META.SUBJECT`	The subject of the document
Metadata	`META.KEYWORDS`	Keywords that apply to the document

note

Apart from the standard metadata properties, you can also set extended metadata properties.

Document options

Document options apply only to a specific input. It allows you to determine specific properties based on an individual document, rather than as a global setting (either determined by the job or the profile). Any subsequent jobs processed with the workflow profile use the profile's default settings.

Type	Option	Description
Document property	`DOC.PASSWORD`	Set the password for the document.

Supported file formats for Archive PDF/A-1 workflow​

Features and limitations​

Collect mode​

Document content rasterization​

PDF/A-1 specific conversion warnings​

Sign​

Job and document options for the PDF/A-1 workflow​

Job options​

Document options​