Making sense of PDF structures in the wild at scale
PDFs in the wild offer a bewildering amount of variation in syntax, features and structure. For those building parsers or evaluating parsers, it is critical to have a broad coverage corpus available to assess and discover distributions of issues “in the wild” or on specific client document sets. In this talk, our team will present … Read more