Magic editing and creation: a primer 09 Aug 2012

The purpose of this post is to give a brief introduction to creating, editing and submitting format signatures (or ‘magic’ entries) for the well-known File tool. The occasion for this was some work I did last week on improving File’s identification of the JPEG 2000 formats. I had some difficulty finding any easy-to-follow documentation that describes how to do this. The information is all out there, but it’s pretty fragmented. So, I wrote this brief tutorial, which is intended as an accessible introduction to magic editing. It only covers the very basics, but hopefully this is enough to overcome some initial stumbling blocks.

More ...

PDF – Inventory of long-term preservation risks 26 Jul 2012

In this blog post I’ll be dusting off some old stuff for a change. The occasion for this is the following question, posted by Paul Wheatley on the Libraries and Information Science Stack Exchange website a few days ago:

What preservation risks are associated with the PDF file format?

More ...

EPUB for archival preservation 18 Jun 2012

Over the last few years, the EPUB format has gained widespread popularity in the consumer market. The KB has been approached by a number of publishers that wish to use EPUB for delivering some of their electronic publications. Surprisingly little information is available on the format’s suitability for archival preservation, apart from Library of Congress’ Sustainability of Digital Formats web pages, which contain entries on EPUB 2 and EPUB 3.

So, the KB’s Departments of Collection and Collection Care requested a more detailed investigation of EPUB’s preservation credentials. More specifically, answers were needed to the following questions:

  • What are the main characteristics of EPUB?

  • What functionality does EPUB provide, and is this sufficient for representing e.g. content with sophisticated layout and typography requirements?

  • How well is the EPUB supported by software tools that are used in (pre-)ingest workflows?

  • How suitable is EPUB for archival preservation? What are the main risks?

More ...

Update on jpylyzer 23 Apr 2012

Introduction

In this blog post I will give a brief update of the latest jpylyzer developments. Jpylyzer is a validation and feature extraction tool for the JP2 (JPEG 2000 Part 1) still image format.

More ...

Jpylyzer documentation 10 Jan 2012

This will be my shortest blog post ever. Following up on my previous blog post on a prototype JP2 validator and properties extractor (jpylyzer), there is now a comprehensive User Manual of the tool. Just follow the link below:

http://jpylyzer.openpreservation.org/userManual.html

Link to jpylyzer home page:

http://jpylyzer.openpreservation.org/

Meanwhile work on jpylyzer remains ongoing, so watch this space for any updates on this.

Update February 2019: updated links in original blog post


Originally published at the Open Preservation Foundation blog

More ...