Invented by Adobe Systems over 20 years ago, the Portable Document Format (PDF) is now an open standard for electronic document exchange maintained by the International Organization for Standardization (ISO). When you convert documents, forms, graphics, and web pages to PDF, they look just like they would if printed.

PDF

Testing PDF Merging: Performance and Content Integrity

We’ve been making progress on improving our PDF merge tool JPedal (JPedal is the best Java PDF library for developers), and I was curious...
Jacob Collins
2 min read

How to extract text from PDF files as YAML…

This blog post shows you how to convert a PDF to YAML, by extracting marked content with structure tags. What is structured PDF? Some...
Jacob Collins
47 sec read

Choosing the Best Document Viewer for Your Web App

Choosing the right document viewer for display is a crucial step in your journey to developing a web application. The document viewer should be...
Leon Atherton
2 min read

How to convert PDF files to ePUB

Not all PDFs are created equally. Some go beyond simple visual layouts and include internal tags that describe the document’s structure. These are known...
Jacob Collins
1 min read

How to extract text from a PDF as JSON

Some PDF files can be “tagged” which means they contain information about the structure of the file. This structure is embedded as metadata within...
Jacob Collins
1 min read

How to process PDFs for use with AI (Tutorial)

As Artificial Intelligence becomes more popular for processing large bodies of text, it becomes apparent that PDF files pose a challenge. PDF is a...
Jacob Collins
1 min read