Hier finden Sie Artikel, die Ihnen einen praktischen Einblick in die Welt der PDFs geben. Erfahren Sie alles über PDFs, von den Grundlagen bis hin zu den besten Verfahren.
The caveats of assembling PDF/A documents
Assembling PDF documents from various sources is a crucial part of an output management system. And, as the document needs to be archived in most cases, it should conform to the PDF/A standard. But, is there a way to ass ...
Digging for information by extracting data from a PDF document
Extracting text from a PDF document is one of the most popular information retrieval function. But how about other information such as images, metadata and more? It can be simple - but also tricky.
How to convert signed documents to PDF/A?
I often get the question whether it is possible to convert digitally signed documents to PDF/A. Because there's no short answer to this I thought it would be helpful to explore the topic a bit into more detail. What tec ...
Font subsetting - how it works and when to use
In order to reduce the file size PDF producers use a technique called font subsetting. What does exactly happen with the fonts and what are the consequences?
Digital signatures in PDF/A
Digital signatures are still not very widely used and the the knowledge about them is often fuzzy. This article tries to give an overview about this huge and complex topic.
Automating the conversion of Microsoft Office Documents to PDF/A
A central service to convert Microsoft Office documents to PDF or PDF/A has obvious advantages. The conversion is done on an enterprise wide platform with well defined software versions and conversion process configurati ...
Can I trust PDF validation software?
If I use validation software from different manufacturers I sometimes get different results. Why can this happen? Does it mean that I can't trust the software? What can I do about it? I hear these and more questions very ...
The discipline of converting PDF to PDF/A
We all know that the conversion one file format to another is not as easy as one might wish for and can lead to unpleasant surprises. However, it is hardly known that this is the case for the conversion from PDF to PDF/A ...
Splitting and merging pages of PDF documents
Single out pages from a number of input documents and re-arrange them in a set of output documents belongs to the daily routine in a document assembly application. At first glance, this seems to be a clear and understand ...
Why is the extraction of text from a PDF document such a hassle
When I use a text editing tool such as Microsoft Word then it is quite natural that I can select a portion of text and copy it to the clipboard and paste it in to a window of any other tool. Not so with PDF. At least not ...
Embedding fonts in PDF
I collect bad PDFs since the Reference Manual 1.0 was published in 1993 and today I have recourse to a data base of more than 100'000 real world PDF files with all kinds of faults in them. The vast majority of problems, ...
TIFF Mixed Raster Content (MRC) conforming to RFC 2301
Mixed Raster Content (MRC) is a process to reduce the size of raster images. It is well known since PDF/A is used to archive scanned documents. However, it has been used and standardized in RFC 2301 for TIFF files earlie ...