scholarly journals Migration Performance for Legacy Data Access

2008 ◽  
Vol 3 (2) ◽  
pp. 74-88 ◽  
Author(s):  
Kam Woods ◽  
Geoffrey Brown

We present performance data relating to the use of migration in a system we are creating to provide web access to heterogeneous document collections in legacy formats. Our goal is to enable sustained access to collections such as these when faced with increasing obsolescence of the necessary supporting applications and operating systems. Our system allows searching and browsing of the original files within their original contexts utilizing binary images of the original media. The system uses static and dynamic file migration to enhance collection browsing, and emulation to support both the use of legacy programs to access data and long-term preservation of the migration software. While we provide an overview of the architectural issues in building such a system, the focus of this paper is an in-depth analysis of file migration using data gathered from testing our software on 1,885 CD-ROMs and DVDs. These media are among the thousands of collections of social and scientific data distributed by the United States Government Printing Office (GPO) on legacy media (CD-ROM, DVD, floppy disk) under the Federal Depository Library Program (FDLP) over the past 20 years.

2009 ◽  
Vol 4 (2) ◽  
pp. 184-198 ◽  
Author(s):  
Kam Woods ◽  
Geoffrey Brown

Over the past 20 years, more than 100,000 CD-ROM titles have been published including thousands of collections of government documents and data. CD-ROMs present preservation challenges at the bit level and in ensuring usability of the preserved artifact. We present techniques we have developed to archive and support user access to a collection of approximately 2,900 CD-ROMs published under the Federal Depository Library Program (FDLP) by the United States Government Printing Office (GPO). The project provides web-based access to CD-ROM contents using both migration and emulation and supports remote execution of the raw CD-ROM images. Our project incorporates off-the-shelf, primarily open-source software. The raw data and (METS) metadata are made available through AFS, a standard distributed file system, to encourage sharing among libraries.


2005 ◽  
Vol 34 (2) ◽  

The United States Government Printing Office is at the epicenter of change in the ways humans create and use information to communicate, remain informed, research a topic and preserve a record.


Sign in / Sign up

Export Citation Format

Share Document