Mark Stephens

  https://www.idrsolutions.com Mark founded the company and has worked with Java and PDF since 1997. The original creator of the core code, he is also a NetBeans enthusiast who enjoys speaking at conferences and reading. He holds an Athletics Blue and an MA in Mediaeval History from St. Andrews University.

   



196 Stories by Mark Stephens

Multiple trailers in a PDF file

A PDF file consists of a ‘dump’ of PDF objects and a reference table defining where they are located in the PDF file and...
1 min read

Table order in OTF fonts

As part of our TrueType to OpenType font conversion (we need this for PDF to HTML5 conversion to ensure fonts display on all browsers),...
49 sec read

How to extract Structured text from PDF files in Java (Tutorial)

Developers hoping to extract content from PDF documents whilst maintaining the structure of the text should follow this tutorial. Some (but not all) PDF...
1 min read

Avoid transparency when printing in Java

Java has a print mechanism called Java Print Services. In most cases this works brilliantly, but beware the use of transparency in anything you...
1 min read

How are Embedded CMAP tables defined in a PDF File?

Every glyf inside a PDF file can have a display value and a different extraction value. This is useful because often you need to...
2 min read

Replacing the deprecated Java JPEG classes for Java 7

In the early days of Java, Sun produced a really handy set of classes to handle JPEG images. These included some really nifty little...
1 min read