This report describes the year 3 activities of the SCAPE project in the Characterisation Components work package, and presents an evaluation of format identification tools for execution in a parallelised Map Reduce environment. We report two general solutions that complement each other with different pros and cons. We present a solution to remedy the challenge of different tools giving different results on the same data. We discuss the concept of policy driven validation of digital objects according to an institutional preservation policy and gives reference to a concrete proof of concept solution. We present an evaluation of deploying Apache Tika and DROID on the SCAPE Azure platform as an alternative to the general SCAPE Execution Platform. We present the research project in extracting semantic information from web based text corpora and how such a system could be utilised by the digital preservation community.
Upcoming Events
- The SCAPE Project has closed on 2014-09-30. See Past Events above.
OPF Blogs for SCAPE
- ChatGPT discusses Digital Preservation 14/03/2023DALL·E “A futuristic robot and human meet, synth wave” There has been a lot of buzz around AI and Language Tools so OPF decided to...Darren Dignam
- DigitALL Inclusion this International Women’s Day 08/03/2023The 2023 International Women’s Day theme is, “DigitALL: Innovation and technology for gender equality”. This is aligned with the upcoming 67th Session of the Commission...Georgia Moppett
- What is the checksum of a directory? Using DROID reports and the concepts behind Merkle Trees to generate Directory, and Collection Checksums 16/01/2023What is the checksum of a directory? A directory on disk doesn’t have a checksum, but what if it did? This is a question I ask, and try to answer in my new script, sumfolder1.Ross Spencer
- How we got preservation tools installed in our secured work environments 21/12/2022As part of the Ministry of Education, Culture and Science (OCW), we at the National Archives of the Netherlands (NANeth) work with virtual Windows work...Remco van Veenendaal
- Happy birthday for KBNL’s e-Depot 19/12/2022KBNL celebrates the 20th anniversary of the e-DepotJudith Rog
- ChatGPT discusses Digital Preservation 14/03/2023