It’s impossible, even in principle, to imagine a world without HTML, isn’t it? Certainly, there are various ways to convey textual content… but textual information will itself be a means…
Does the world still run on paper documents? No… and yes. To be sure, vast quantities of documents and data reside on servers. This material is accessed worldwide, millions of times per second, 24 hours a day. It’s typically delivered from web server to browser in HTML, the core language of the web. We trust… Read more
I’ve been tracking the relative popularity of electronic document file-formats for several years – here’s the February 2014 survey. Along the way I’ve noticed that the .com domain (and at least some top-level country-specific domains) tend to have far higher proportions of HTML files compared with .gov, .edu and .org (see chart). Let’s go ahead… Read more
Last week I wrote about Acrobat’s export to HTML feature, how it was missing from Acrobat X and XI, and how Adobe has made it available once again. Today we’re going to talk about an interesting implementation based on the idea of converting well-tagged PDF to HTML.
Back in December I noticed that a feature in Adobe Acrobat I’d always thought very valuable was now missing: the ability to export tagged PDF to HTML or Word using the document’s structure (tags). What are “tags” in PDF files? Tags are the feature of PDF that provides reading order and semantic structure – headings,… Read more