ISO TC 171 SC 2 and TC 130: a summary

A survey of the major developments at “the ISO table” since the last in-person event in 2019. This session can help attendees to identify other PDF Days Europe 2020 sessions pertaining to specific technologies. This PDF Association session is presented by the Vice Chair, ISO Liaison Officer and Managing Director of callas software, Dietrich von Seggern … Read more

The future: re-imagining the best possible definition of PDF

The PDF specification is critical to the PDF technology ecosystem and its stakeholders. “The spec” is the “law” that defines what a valid PDF file must be. While PDF does not have a reference implementation, the common practice of relying on reverse-engineering the work of others, or examining PDFs “from the wild” tends to lead … Read more

Navigating the PDF ecosystem

The PDF Association’s CTO, Peter Wyatt, will walk through the many and various free technical resources provided by the PDF Association. Understand: the roadmap and brief history across all PDF technical resources; how the technical resources are organized and arranged at https://pdfa.org/resources and in GitHub https://github.com/pdf-association; where to locate specific kinds of technical information; why … Read more

Email Archiving in PDF

Email is one of the most ubiquitous and widely disseminated, and also one of the least preserved and difficult to manage, of electronic document formats.  While the  preservation community has developed email-specific tools and preservation pathways, PDF provides a new and scalable approach that can be used to archive email messages, folders, and even accounts. … Read more

Making more sense of PDF structures in the wild at scale

This is a follow-on talk from our 2021 PDF Days presentation on the File Observatory. Our team built the File Observatory to support Defense Advanced Research Projects Agency (DARPA)’s SafeDocs program by enabling parser developers to understand features of PDFs in the wild at scale. In the first part of our presentation, we’ll offer an overview … Read more

10 Years of PDF/A-3 Based Electronic Invoicing

Almost 10 years ago, ZUGFeRD 1.0, the first invoice data format based on the public standards UN/CEFACT CII and PDF/A-3, was published. The stated intention was to digitize invoice exchange and make the transition from paper to data for SMEs and single users as smooth as possible without losing efficiency. The idea of using PDF/A-3 … Read more

Smart Legal Contracts and PDF

Smarter contracts and blockchain technologies are already transforming business processes across commercial, legal and financial sectors and yet “We still approach contracts in much the same way as we have done for centuries. Decades old software such as PDF and Microsoft Word is used to create and store agreements in a digital format, but they … Read more

How document understanding can leverage your PDF workflow

Document understanding is a constantly addressed topic and has become on top of the scene these last years with Deep Learning and NLP evolution. The PDF format is by nature unstructured, which implies sophisticated processes to extract and qualify information from such documents. In this presentation, we will discuss four ways to address challenges brought … Read more

Leveling up RichMedia

The RichMedia Annotation has been embraced by the 3D PDF community for moving forward with PDF 2.0; But exploiting the potential for Audio and Video RichMedia Content has proven more challenging.  We will discuss the several generations of embedded multimedia mechanisms in the PDF specifications,  the current (sad) state of multimedia support in the PDF … Read more