Tag Archives: hadoop

An Open Source Infrastructure for Quality Assurance and Preservation of a Large Digital Book Collection

Sven Schlarb: An Open Source Infrastructure for Quality Assurance and Preservation of a Large Digital Book Collection In: Archiving 2013, Washington, DC; April 2013; p. 234-238; ISBN / ISSN: 978-0-89208-304-6 Abstract This article presents an open source infrastructure for processing large collections … Continue reading

Publications Tagged , , 0

The Elephant in the Library

Clemens Neudecker and Sven Schlarb: The Elephant in the Library In: Hadoop Summit Europe, 20-21 March 2013, Amsterdam, the Netherlands. Abstract: Libraries collect books, magazines and newspapers. Yes, that’s what they always did. But today, the amount of digital information … Continue reading

Publications Tagged Comments Off on The Elephant in the Library

SCAPE & Hadoop

Libraries have to process a rapidly increasing amount of data as part of their day-to-day business and computing tasks like file format migration, text recognition, or the validation of technical metadata require significant computing resources.  Processing very large data sets … Continue reading

News Tagged 0