For close to two decades, the Bentley Historical Library has actively collected, processed, preserved, and provided access to born-digital archives from the University of Michigan as well as from private individuals and organizations from around the state. These experiences have provided a strong foundation for the planning and implementation work already underway with the grant.
Laying the Foundation (1997-2008)As early as 1979, archivists in the Bentley's University Archives and Records Program (UARP) were discussing the challenges posed by new technologies and machine readable records. A 1991 NHPRC grant, “Study on the Uses of Electronic Communication to Document an Academic Community,” provided an opportunity for the library to explore the topic in more depth. It was not until 1997, however, that the Bentley received its first significant collection of born-digital archives: the Macintosh personal computer of former University of Michigan President James J. Duderstadt.
At the time, Electronic Records Archivist Nancy Deromedi developed a preservation strategy for the approximately 2,100 files in the accession that included running virus scans, documenting file and folder naming conventions, and migrating content from the original MORE 3.1 and Microsoft Word 6.0 file formats to the Word 97 (and later PDF/A) format. Through the late 1990s and early 2000s, the Bentley continued to collect born-digital archives and Deromedi later published accounts of her strategies in a series of SAA Campus Case Studies:
- Gaps and Inconsistencies: Issues in the Dissemination of the University Bulletin at the University of Michigan (Campus Case Study #1)
- Defining and Formalizing a Procedure for Archiving the Digital Version of the Schedule of Classes at the University of Michigan (Campus Case Study #2)
- Generating and Archiving Records in Digital Form of the Promotion and Tenure Process at the University of Michigan (Campus Case Study #3)
These early efforts were instrumental in capturing historical and administrative records of long-term value, but each involved developing unique preservation strategies and relied upon heavily manual procedures. As the university's production of electronic records with archival value increased, the library faced challenges of scalability and sustainability: the Bentley lacked in-house IT staff and extensive technical expertise and Deromedi balanced numerous responsibilities in addition to her work with digital archives.
The planning, development, and implementation work associated with MeMail laid the foundations for the Bentley's current digital curation program. As Technical Lead, I explored software, procedures, and workflows required to ingest and preserve email and attachments (Office files, images, audio, video, etc.). I worked closely with McKay and others to identify rights and access issues associated with acquiring digital content and making it accessible and also developed policies and procedures to address sensitive personal information (SSNs, credit card numbers, etc.). In addition, the project gave us an impetus to review and enhance our infrastructure: we acquired secure server space to store our backlog and conduct ingest procedures and also negotiated for expanded use of Deep Blue, the University of Michigan's DSpace repository (with another copy of material stored in a local dark archive managed by ITS).
Digital Curation Division (2011-2014)
Hoping to overcome these constraints and enable more of our staff to work with digital content, I started to explore the possibility of automating workflow steps. In setting out, I was particularly influenced by the Archivematica digital preservation system and its 'microservice' design, whereby a specific tool is implemented to perform a specific function (and may be swapped out or replaced by another without impacting the rest of the system). After a successful proof of concept in automating our format migration procedures (a step that creates preservation copies of content based upon migration pathways that reflect professional standards and best practices), I set about revising other steps. By early 2012, I had produced the AutomatedProcessor (or AutoPro), a collection of 33 Visual Basic and Windows CMD.EXE shell scripts that moved content through an 11 step workflow.
|AutoPro splash screen|
Working Smarter (2014-)Since its introduction, AutoPro has been used to prepare more than 230 accessions of digital content (approx. 1.2 TB) for deposit in our Deep Blue repository. In helping us to address a growing backlog of digital archives in a standardized manner, the tool has been a smashing success. At the same time, AutoPro was never intended to be a final solution for the Bentley: the command line interface is not particularly intuitive or user friendly, the CMD.EXE scripts have poor error-handling functionality, and maintaining and updating the scripts and software on individual workstations often takes an inordinate amount of time. We also realized that we were entering the same descriptive and administrative metadata in numerous locations: once in our finding aids, again in our processing workflow (so that descriptions of content could be stored alongside materials in the Archival Information Package), and a third time when we manually uploaded material to the Deep Blue DSpace repository.
Given these complications and inefficiencies, Nancy Deromedi and I considered options for more than a year before deciding to explore integrating functionality of Archivematica and ArchivesSpace into a single workflow and automating the deposit of material into DSpace. While the idea of bringing together these systems (especially the former two) has been discussed in various circles for years, the Bentley was fortunate enough to secure grant funding to push development work forward. I've already described our basic goals and strategy in our first post—in the next one, I'll discuss the challenges and progress we've encountered thus far. Stay tuned!