Navigating the PDF ecosystem

The PDF Association’s CTO, Peter Wyatt, will walk through the many and various free technical resources provided by the PDF Association. Understand: the roadmap and brief history across all PDF technical resources; how the technical resources are organized and arranged at https://pdfa.org/resources and in GitHub https://github.com/pdf-association; where to locate specific kinds of technical information; why … Read more

Deriving HTML from PDF – lessons learned

Two years after introducing Deriving HTML from PDF document, two years after implementing the core concept, after processing countless authored and un-authored pdf files we will share our experiences. To successfully adopt the idea, developers need to understand the implementation challenges, authors have to change their habits in producing pdf files. We will discuss gaps … Read more

Accessible PDF – How to tag content the right way

More and more PDFs need to be accessible and so PDF/UA compliant. Today it is technically possible to generate such files of of many tools without tricks hacks used years ago. But often there is one big challenge using such tools: How to tag content the right way? The PDF standard offers a set of … Read more

Capturing the richness of page description languages

How can you carry over the attributes expressed in a non-PDF format into a PDF so that their original intent is not lost especially when that detail may be required at another stage in the workflow? This talk looks at how the detail of a document can be retained in the most terse and efficient … Read more

Automate office files to PDF

Most office files are created with Microsoft Office and quality of conversion is defined by the degree to which it is the similar to what this suite creates – at least for visual appearance. What limitations do alternative converters have for Word, Excel or PowerPoint documents? What about emails created or received with Outlook? What … Read more

Robotic Process Automation and PDF

Robotic Process Automation (RPA) uses software robots to mimic human actions and automate everyday tasks.  This session will explain the place that PDF plays in the range of solutions that may be classified as Robotic Process Automation and how RPA fits in the wider spectrum of automation. Use cases will be discussed and demonstrated.  Particular … Read more

Support of complex scripts in PDF

Complex writing systems have always required special attention. Examples of such complex scripts are Arabic, Devanagari or Thai alphabets, but there are many more. In case of the PDF graphics model there are two key challenges when processing text in complex scripts: how to shape the correct visual representation out of glyphs and subglyphs, often … Read more

Generating well-tagged PDF documents

The talk is devoted to typical problems and possible solutions of generating PDF/UA compliant (or at least well-tagged) PDF documents from authoring applications such as rich layout WYSIWYG text editors, web browsers, and others. We touch on some ambiguous PDF/UA requirements and technical challenges one faces when converting declarative html-like documents to Tagged PDF. This … Read more