PDF Optimization horror-stories

Optimizing PDFs to contain just the right amount of structural information can sometimes be quite challenging. We would like to present autopsies of some of the more gruesome PDF Optimization issues that we’ve encountered. Exact case studies TBD, but potentially touching on Font optimization, weird resource trees, weird structure trees, and auditing PDF space usage.

Next Generation Forms for PDF

This session will present a summary of the progress of the forms technical working group. PDF forms require updates in several areas: Better accessibility Support for reflow Reduced reliance on JavaScript Modernization of the technology stack The presentation will elaborate on the motivations behind these changes and will summarize proposals to address each of them.

PDF in Variable Data Printing

In the concept of omni channel marketing, print is one channel amongst others in customer communication. Individualized print requires not only high speed printing machines – it also means that PDFs have to be processed (ripped) as quickly as these machines can print. And for that process the “inner qualities” of PDF files play an … Read more

Tagged and Accessible PDF with LaTeX

In Summer 2020 the LaTeX Project Team announced the start of a multi-year project [1, 2] to produce tagged and accessible PDF from existing  LaTeX sources with no or only minimal configuration adjustments. In this talk we describe the current state of the project, the existing achievements, and our plans for future. References [1] Frank … Read more

New Clay

We all know PDF from daily life. If you were to ask a room full of people whether they’ve come into contact with a PDF document, all would raise their hand. From invoices, receipts, to lab-results (keeping it current for 2020-2021), everyone has seen a PDF firsthand. But how many have seen a PDF that … Read more

How PDF/R helps transforming image capture for mobile and cloud

This presentation will summarize last year’s introduction and adaption of PDF/R and show how it fits into the TWAIN Working Group’s vision of mobile and de-centralized cloud based image capture. It will provide an outlook of the work in progress to include highly compressed images in a PDF/R 1.1 revision in the coming years.

Implementing a PDF Library in Ruby

HexaPDF (https://hexapdf.gettalong.org) is a PDF library written in Ruby. It aims to provide the full spectrum of PDF functionality with the exception of rendering. This presentation focuses on how HexaPDF implements the PDF specification and how Ruby is actually quite a good language for this task. It is said that Ruby, being a scripting language, … Read more

Ideas for interoperable self-updating PDF documents

One criticism that is commonly levelled at PDF is that when looking at a PDF document, the user generally has no easy way of knowing whether the document they’re looking at is current or not. While this isn’t an issue when browsing the web, it can be problematic for users viewing previously downloaded documents. Technical … Read more

Putting a squeeze on your PDF

There are many ways and tools to make a PDF. However, sometimes the resulting document may take up significantly more disk space than you would expect based on its content. It therefore makes sense to try to reduce the size of such PDFs, while preserving the quality of your documents. The goal of this talk … Read more

The PDF Detectives

PDF is one of the most used document formats in the world! This is a fact we should all be proud of. But the downside of this is that PDF files in all their glory and elegance can also be buggy or poorly constructed. Either through malice or ignorance of the specification. This leads to … Read more