digital preservation - file formats
WHEEL OUT THE DIGITAL DARK AGE KLAXON!!11!!
This page does not exist (404 error)!
About
Search
Tags
Android
Towards a preservation workflow for mobile apps
Four Android emulators, two apps
Running archived Android apps on a PC: first impressions
Apache-Preflight
Why PDF/A validation matters, even if you don't have PDF/A - Part 2
Identification of PDF preservation risks: analysis of Govdocs selected corpus
Identification of PDF preservation risks with Apache Preflight: the sequel
Identification of PDF preservation risks with Apache Preflight: a first impression
Apache-Tika
Extracting text from EPUB files in Python
PDF processing and analysis with open-source tools
Towards a preservation workflow for mobile apps
Top 50 file formats in the KB e-Depot
APK
Towards a preservation workflow for mobile apps
Debian
Adventures in Debian packaging
Update on jpylyzer
digital-dark-age
How to preserve your personal Twitter archive
Wheel Out the Digital Dark Age Klaxon!
digital-preservation-day
Wheel Out the Digital Dark Age Klaxon!
disk-imaging
Writing yet another workflow tool for imaging portable media
Identification of physical storage media and devices with Python and the Windows API
A simple disk imaging workflow tool
diskimgr
A simple disk imaging workflow tool
DNS
Moving my Internet domains
DROID
Why can't we have digital preservation tools that just work?
Evaluation of identification tools: first results from SCAPE
Improved identification of XML: a Python experiment
e-depot
Top 50 file formats in the KB e-Depot
emulation
Four Android emulators, two apps
Running archived Android apps on a PC: first impressions
EPUB
Extracting text from EPUB files in Python
ISO/IEC TS 22424 standard on EPUB3 preservation
Valid, but not accessible: crazy fixed EPUB layouts
The future of EPUB? A first look at the EPUB 3.1 Editor’s draft
Policy-based assessment of EPUB with Epubcheck
EPUB for archival preservation: an update
EPUB for archival preservation
EPUBCheck
Policy-based assessment of EPUB with Epubcheck
EPUB for archival preservation: an update
ExifTool
Multi-image TIFFs, subfiles and image file directories
PDF processing and analysis with open-source tools
Fido
Why can't we have digital preservation tools that just work?
Evaluation of identification tools: first results from SCAPE
Improved identification of XML: a Python experiment
FITS
Why can't we have digital preservation tools that just work?
Evaluation of identification tools: first results from SCAPE
FLAC
Breaking WAVEs (and some FLACs too)
floppy-disks
Writing yet another workflow tool for imaging portable media
Identification of physical storage media and devices with Python and the Windows API
Offline digital data carriers in the KB deposit collection
A simple disk imaging workflow tool
format-identification
Towards a preservation workflow for mobile apps
Top 50 file formats in the KB e-Depot
Magic editing and creation: a primer
Evaluation of identification tools: first results from SCAPE
Improved identification of XML: a Python experiment
format-validation
VeraPDF parse status as a proxy for PDF rendering: experiments with the Synthetic PDF Testset
geodata
Mapping the Dutch web domain
Web domain geolocation and spatial analysis with QGIS
GitHub-Pages
Moving my Internet domains
GW-BASIC
A prototype JP2 validator and properties extractor
HFS
Introducing Isolyzer 1.4
Update on Isolyzer: UDF, HFS+ and more!
Imaging CD-Extra / Blue Book discs
High-Sierra
Introducing Isolyzer 1.4
ImageMagick
Multi-image TIFFs, subfiles and image file directories
PDF processing and analysis with open-source tools
internet
Moving my Internet domains
iOS
Towards a preservation workflow for mobile apps
IPA
Towards a preservation workflow for mobile apps
iromlab
Image and Rip Optical Media Like A Boss!
ISO-9660
Introducing Isolyzer 1.4
Update on Isolyzer: UDF, HFS+ and more!
Imaging CD-Extra / Blue Book discs
Detecting broken ISO images: introducing Isolyzer
Preserving optical media from the command-line
isolyzer
Introducing Isolyzer 1.4
A simple workflow tool for imaging optical media using readom and ddrescue
Update on Isolyzer: UDF, HFS+ and more!
Imaging CD-Extra / Blue Book discs
Detecting broken ISO images: introducing Isolyzer
JHOVE
Multi-image TIFFs, subfiles and image file directories
VeraPDF parse status as a proxy for PDF rendering: experiments with the Synthetic PDF Testset
Identification of PDF preservation risks with VeraPDF and JHOVE
PDF processing and analysis with open-source tools
Breaking WAVEs (and some FLACs too)
Why can't we have digital preservation tools that just work?
A simple JP2 file structure checker
JHOVE2
Why can't we have digital preservation tools that just work?
Evaluation of identification tools: first results from SCAPE
JP2
Generating lossy access JP2s from lossless preservation masters
Jpylyzer 2015 round-up
Response to report on JPEG 2000 expert round table
Six ways to decode a lossy JP2
Jpylyzer software finalist voor digitale duurzaamheidsprijs
Optimising archival JP2s for the derivation of access copies
ICC profiles and resolution in JP2: update on 2011 D-Lib paper
Automated assessment of JP2 against a technical profile
Update on jpylyzer
Jpylyzer documentation
A prototype JP2 validator and properties extractor
A simple JP2 file structure checker
Paper on JPEG 2000 for preservation
Ensuring the suitability of JPEG 2000 for preservation
jpeg-2000
Generating lossy access JP2s from lossless preservation masters
Jpylyzer 2015 round-up
Response to report on JPEG 2000 expert round table
Six ways to decode a lossy JP2
Jpylyzer software finalist voor digitale duurzaamheidsprijs
Optimising archival JP2s for the derivation of access copies
ICC profiles and resolution in JP2: update on 2011 D-Lib paper
Automated assessment of JP2 against a technical profile
Update on jpylyzer
Jpylyzer documentation
A prototype JP2 validator and properties extractor
A simple JP2 file structure checker
Paper on JPEG 2000 for preservation
Ensuring the suitability of JPEG 2000 for preservation
jpylyzer
Generating lossy access JP2s from lossless preservation masters
Jpylyzer 2015 round-up
Jpylyzer software finalist voor digitale duurzaamheidsprijs
Adventures in Debian packaging
Automated assessment of JP2 against a technical profile
Update on jpylyzer
Jpylyzer documentation
A prototype JP2 validator and properties extractor
A simple JP2 file structure checker
magic
Magic editing and creation: a primer
Microsoft
Does Microsoft OneDrive export large ZIP files that are corrupt?
omimgr
A simple workflow tool for imaging optical media using readom and ddrescue
OneDrive
Does Microsoft OneDrive export large ZIP files that are corrupt?
optical-media
Identification of physical storage media and devices with Python and the Windows API
Introducing Isolyzer 1.4
Offline digital data carriers in the KB deposit collection
A simple workflow tool for imaging optical media using readom and ddrescue
Resurrecting the first Dutch web index: NL-menu revisited
Update on Isolyzer: UDF, HFS+ and more!
Image and Rip Optical Media Like A Boss!
Imaging CD-Extra / Blue Book discs
Detecting broken ISO images: introducing Isolyzer
Breaking WAVEs (and some FLACs too)
Preserving optical media from the command-line
packaging
Adventures in Debian packaging
Update on jpylyzer
PDF
VeraPDF parse status as a proxy for PDF rendering: experiments with the Synthetic PDF Testset
Identification of PDF preservation risks with VeraPDF and JHOVE
On The Significant Properties of Spreadsheets
PDF processing and analysis with open-source tools
Policy-based assessment with VeraPDF - a first impression
PDF/A as a preferred, sustainable format for spreadsheets?
Why PDF/A validation matters, even if you don't have PDF/A - Part 2
Why PDF/A validation matters, even if you don't have PDF/A
When (not) to migrate a PDF to PDF/A
Identification of PDF preservation risks: analysis of Govdocs selected corpus
Identification of PDF preservation risks with Apache Preflight: the sequel
What do we mean by "embedded" files in PDF?
Identification of PDF preservation risks with Apache Preflight: a first impression
PDF – Inventory of long-term preservation risks
preservation-risks
Multi-image TIFFs, subfiles and image file directories
Identification of PDF preservation risks with VeraPDF and JHOVE
On The Significant Properties of Spreadsheets
PDF processing and analysis with open-source tools
ISO/IEC TS 22424 standard on EPUB3 preservation
Does Microsoft OneDrive export large ZIP files that are corrupt?
Why PDF/A validation matters, even if you don't have PDF/A - Part 2
Why PDF/A validation matters, even if you don't have PDF/A
Measuring Bigfoot
Assessing file format risks: searching for Bigfoot?
PDF – Inventory of long-term preservation risks
EPUB for archival preservation
python
Extracting text from EPUB files in Python
Identification of physical storage media and devices with Python and the Windows API
Quattro-Pro
Quattro Pro for DOS: an obsolete format at last?
rant
Why can't we have digital preservation tools that just work?
schematron
Policy-based assessment with VeraPDF - a first impression
Why PDF/A validation matters, even if you don't have PDF/A - Part 2
Policy-based assessment of EPUB with Epubcheck
Automated assessment of JP2 against a technical profile
Siegfried
Towards a preservation workflow for mobile apps
significant-properties
On The Significant Properties of Spreadsheets
spreadsheets
On The Significant Properties of Spreadsheets
PDF/A as a preferred, sustainable format for spreadsheets?
Quattro Pro for DOS: an obsolete format at last?
tapeimgr
Recovering '90s Data Tapes - Experiences From the KB Web Archaeology project (iPres 2019 paper)
Roll the tape - recovering '90s data tapes in BitCurator
tapes
Identification of physical storage media and devices with Python and the Windows API
Offline digital data carriers in the KB deposit collection
Recovering '90s Data Tapes - Experiences From the KB Web Archaeology project (iPres 2019 paper)
Roll the tape - recovering '90s data tapes in BitCurator
TIFF
Multi-image TIFFs, subfiles and image file directories
On The Significant Properties of Spreadsheets
Twitter
How to preserve your personal Twitter archive
UDF
Introducing Isolyzer 1.4
Update on Isolyzer: UDF, HFS+ and more!
Imaging CD-Extra / Blue Book discs
unix-file
Towards a preservation workflow for mobile apps
Magic editing and creation: a primer
Evaluation of identification tools: first results from SCAPE
Improved identification of XML: a Python experiment
VeraPDF
VeraPDF parse status as a proxy for PDF rendering: experiments with the Synthetic PDF Testset
Identification of PDF preservation risks with VeraPDF and JHOVE
PDF processing and analysis with open-source tools
Policy-based assessment with VeraPDF - a first impression
Why PDF/A validation matters, even if you don't have PDF/A - Part 2
Why PDF/A validation matters, even if you don't have PDF/A
virtualization
Four Android emulators, two apps
Running archived Android apps on a PC: first impressions
WAVE
Breaking WAVEs (and some FLACs too)
web-archaeology
Restoring Liesbet's Virtual Home, a digital treasure from the early Dutch web
Recovering '90s Data Tapes - Experiences From the KB Web Archaeology project (iPres 2019 paper)
A simple disk imaging workflow tool
Roll the tape - recovering '90s data tapes in BitCurator
Crawling offline web content: the NL-menu case
Resurrecting the first Dutch web index: NL-menu revisited
web-archiving
How to preserve your personal Twitter archive
Mapping the Dutch web domain
Restoring Liesbet's Virtual Home, a digital treasure from the early Dutch web
Web domain geolocation and spatial analysis with QGIS
Crawling offline web content: the NL-menu case
Resurrecting the first Dutch web index: NL-menu revisited
Dutch newspaper wipes out articles citing fabricated sources - Internet Archive to the rescue!
Perdiep Ramesar in het Internet Archive
Demise of the Dutch Blogosphere
How to save a web page to the Internet Archive
XS4ALL
Restoring Liesbet's Virtual Home, a digital treasure from the early Dutch web
ZIP
Does Microsoft OneDrive export large ZIP files that are corrupt?
Archive
2024
March
Multi-image TIFFs, subfiles and image file directories
2023
June
VeraPDF parse status as a proxy for PDF rendering: experiments with the Synthetic PDF Testset
May
Identification of PDF preservation risks with VeraPDF and JHOVE
March
Extracting text from EPUB files in Python
February
Moving my Internet domains
January
Writing yet another workflow tool for imaging portable media
2022
November
How to preserve your personal Twitter archive
Wheel Out the Digital Dark Age Klaxon!
June
Identification of physical storage media and devices with Python and the Windows API
April
Introducing Isolyzer 1.4
March
Generating lossy access JP2s from lossless preservation masters
2021
September
On The Significant Properties of Spreadsheets
PDF processing and analysis with open-source tools
February
Towards a preservation workflow for mobile apps
Four Android emulators, two apps
2020
September
Mapping the Dutch web domain
June
Restoring Liesbet's Virtual Home, a digital treasure from the early Dutch web
April
ISO/IEC TS 22424 standard on EPUB3 preservation
March
Does Microsoft OneDrive export large ZIP files that are corrupt?
February
Offline digital data carriers in the KB deposit collection
Web domain geolocation and spatial analysis with QGIS
2019
September
Recovering '90s Data Tapes - Experiences From the KB Web Archaeology project (iPres 2019 paper)
April
A simple disk imaging workflow tool
March
A simple workflow tool for imaging optical media using readom and ddrescue
January
Roll the tape - recovering '90s data tapes in BitCurator
2018
July
Crawling offline web content: the NL-menu case
April
Resurrecting the first Dutch web index: NL-menu revisited
2017
July
Update on Isolyzer: UDF, HFS+ and more!
June
Image and Rip Optical Media Like A Boss!
Policy-based assessment with VeraPDF - a first impression
April
Imaging CD-Extra / Blue Book discs
January
Detecting broken ISO images: introducing Isolyzer
Breaking WAVEs (and some FLACs too)
2016
December
PDF/A as a preferred, sustainable format for spreadsheets?
April
Valid, but not accessible: crazy fixed EPUB layouts
March
The future of EPUB? A first look at the EPUB 3.1 Editor’s draft
2015
December
Jpylyzer 2015 round-up
November
Preserving optical media from the command-line
October
Response to report on JPEG 2000 expert round table
July
Why PDF/A validation matters, even if you don't have PDF/A - Part 2
Why PDF/A validation matters, even if you don't have PDF/A
April
Top 50 file formats in the KB e-Depot
March
Policy-based assessment of EPUB with Epubcheck
January
Dutch newspaper wipes out articles citing fabricated sources - Internet Archive to the rescue!
2014
December
Perdiep Ramesar in het Internet Archive
November
Demise of the Dutch Blogosphere
October
Quattro Pro for DOS: an obsolete format at last?
Running archived Android apps on a PC: first impressions
September
Six ways to decode a lossy JP2
Jpylyzer software finalist voor digitale duurzaamheidsprijs
August
When (not) to migrate a PDF to PDF/A
How to save a web page to the Internet Archive
January
Why can't we have digital preservation tools that just work?
Identification of PDF preservation risks: analysis of Govdocs selected corpus
2013
October
Measuring Bigfoot
September
Assessing file format risks: searching for Bigfoot?
August
Optimising archival JP2s for the derivation of access copies
July
Identification of PDF preservation risks with Apache Preflight: the sequel
ICC profiles and resolution in JP2: update on 2011 D-Lib paper
May
EPUB for archival preservation: an update
April
Adventures in Debian packaging
January
What do we mean by "embedded" files in PDF?
2012
December
Identification of PDF preservation risks with Apache Preflight: a first impression
September
Automated assessment of JP2 against a technical profile
August
Magic editing and creation: a primer
July
PDF – Inventory of long-term preservation risks
June
EPUB for archival preservation
April
Update on jpylyzer
January
Jpylyzer documentation
2011
December
A prototype JP2 validator and properties extractor
September
Evaluation of identification tools: first results from SCAPE
A simple JP2 file structure checker
July
Improved identification of XML: a Python experiment
June
Paper on JPEG 2000 for preservation
2010
December
Ensuring the suitability of JPEG 2000 for preservation
Issues
Report a problem with this site
Hackers Hall of Fame
Social
Mastodon (digipres.club)
Feeds
RSS
ATOM