Policy-based assessment of EPUB with Epubcheck

13 March 2015

Back in 2012 the KB conducted a first investigation of the suitability of the EPUB format for long-term preservation. The KB will soon start receiving publications in this format, and in anticipation of this, our Collection Care department has formulated a policy on the minimum requirements an EPUB must meet to ensure long-term accessibility. The policy largely follows the recommendations from the 2012 report. This blog explores to what extent it is possible to automatically assess the EPUBs that we receive against our policy using a combination of the Epubcheck tool and Schematron rules.


Dutch newspaper wipes out articles citing fabricated sources - Internet Archive to the rescue!

06 January 2015

Shortly before Christmas, Dutch daily newspaper Trouw removed 126 articles from its website. These articles were all authored by Perdiep Ramesar, a former journalist of the newspaper. Ramesar had been fired by Trouw in November, after it turned out that many of the sources that are cited in his articles were fabricated. The most notorious example was a series of pieces about the so-called “Sharia Triangle”, a neighbourhood in the city of The Hague, which Ramesar claimed was being ruled by Sharia law. As it turned out, this story was largely based on fabricated sources. Nevertheless, it was taken at face value by most major Dutch news outlets at the time, and even prompted a parliamentary debate.

Trouw’s decision to remove the 126 articles overnight was met with considerable criticism. For example, historian Jan Dirk Snel noted that the removal of these articles makes it impossible to check what was wrong with them in the first place. Various other critics accused Trouw of trying to rewrite history.


Perdiep Ramesar in het Internet Archive

28 December 2014

Eerder deze week verwijderde dagblad Trouw 126 artikelen van haar website die geschreven waren door ontslagen journalist Perdiep Ramesar. Aanleiding hiervoor was het onderzoek naar door Ramesar opgevoerde “niet traceerbare” bronnen. De beslissing van Trouw om de onbetrouwbare artikelen van de site af te halen stuitte op nogal wat kritiek. Sommigen noemden het geschiedvervalsing. Historicus Jan Dirk Snel merkte terecht op dat nu de stukken zijn verwijderd, niemand meer kan controleren wat er eventueel wel of niet aan deugt.


Demise of the Dutch Blogosphere

13 November 2014

Back in 2006, Dutch weblog Sargasso started following the activity of about 260 Dutch blogs that were active at the time, mainly by looking at the frequency of new postings.


Quattro Pro for DOS: an obsolete format at last?

29 October 2014

While browsing ArchiveTeam’s File Formats Wiki earlier this week, I came across some entries I created there on Quattro Pro spreadsheets two years ago. At the time I had also contributed some old Quattro Pro for DOS spreadsheets (here and here) from my personal archives to the OPF format corpus. Seeing those files again, I decided to spend an afternoon trying to access them using modern-day software. This turned out to be more challenging than expected. It even made me wonder whether, at long last, I had finally run into a case of the much discussed (but rarely observed) phenomenon of format obsolescence. Yes, big words indeed, and if anyone would like to prove me wrong, the comments section below is your friend!



Search

Tags

Archive

2021

September

February

2020

September

June

April

March

February

2019

September

April

March

January

2018

July

April

2017

July

June

April

January

2016

December

April

March

2015

December

November

October

July

April

March

January

2014

December

November

October

September

August

January

2013

October

September

August

July

May

April

January

2012

December

September

August

July

June

April

January

2011

December

September

July

June

2010

December

Feeds

RSS

ATOM