Finding Light in Dark Archives (with AI) – recorded online presentation

We have another update on our AHRC-funded research project on Email Archives, as we presented our work with an excellent group of UK and Irish scholars and professionals focusing on AI & Archives.

For more information on AURA and their events follow this link.

Update from the Contextualising Email Archives Research Project

As part of our AHRC-funded collaborative research project on “Historicising the dot.com boom and contextualising email archives”, we have recorded an introduction to our project and its aims. In case you are interested, you can find the presentation here.

Bauhaus digital archive

For those of you interested in archives available online, the Bauhaus Archive has made its digital archive available here: http://open-archive.bauhaus.de/eMuseumPlus?service=ExternalInterface&module=collection&moduleFunction=search

There are also a digital history resources available for those interested in the history of the school, which celebrated its centenary in 2019, though the original website has now be changed to https://www.bauhauskooperation.com/ and does not feature quite as much useful historical detail anymore – but hopefully this will come back as they develop the site!

Crowdsourcing digitisation in archives

In another instalment of our digital history series, I wanted to highlight that there is increasingly work being done involving crowdsourcing. Many of you may have read the media reports that during the lock down, a crowdsourcing project to digitise rainfall records has barrelled ahead as people enthusiastically engaged. The project is now complete. Does make you wonder what other archival resources may benefit from such an approach.

The National Archives UK has been exploring this recently as well and the potential for expanding this is really not something that was on my radar at all until recently. An interesting insight into what is happening is provided by a chapter by Alexandra Everleigh, ‘Crowding out the Archivist?’ in an edited volume on Crowdsourcing our Heritage. I am not aware that there are any organization or business-focused crowdsourcing projects underway, but may just be my ignorance. Do let us know if this is something that is being explored or ongoing in an archive near you!

AHRC funding for digital business archives research

I am really pleased to announce that, together with a team of investigators including Dr Adam Nix (De Montford University), Prof David Kirsch (University of Maryland and University of Oxford) and our heritage partners, The National Archives (UK) and the Hagley Museum & Library (USA), we have been awarded funding from the AHRC to investigate how historical researchers may be able to research emails as historical sources, and use this resource to historicise the dot.com boom from the perspective of a software development company. See below for a description of our new project!

Historicizing the dot.com bubble and contextualizing email archives

Summary

Future researchers will have to engage with emails if they are to understand the lives of those who lived in the late twentieth and early twenty-first centuries. This is particularly true of organizations and their employees, for whom email has become the default form of internal and external communication. As it currently stands, publicly available email archives are rare, and there has been minimal engagement with them as a historical resource. Indeed, one of the most well-known examples, the Enron Email Corpus, only exists because of high-profile legal proceedings that followed the firm’s bankruptcy and has seen minimal historical investigation since its publication. While this is partly due to its comparative recency, the reading of emails as a historical source is a developing practice and requires particular skills and knowledge that are not traditionally associated with historical enquiry. Despite this, archives and other heritage organizations are increasingly collecting and preserving email data and we are fast moving into the period where the events of the 1990s are of historical interest. We believe that our project offers a timely opportunity to address the gap between current efforts to preserve email and the future requirements that will allow them to actually be read and engaged with.

To address this issue, we seek a better understanding of how email archives can be made more accessible for the purposes of historical learning and research. The problem we focus on here is that, while emails offer valuable insight to researchers, a lack of context often presents a challenge to those wishing to understand their content, inter-relationship and wider historical significance. This de-contextualization can represent a barrier to engagement, to both trained historians and general interest users. Furthermore, existing examples of email archives often purposefully remove personal information, further disconnecting emails from their authors, recipients and connection to related material. For these reasons, our project will make an email archive available in such a way that maintains the relational and network properties that emails hold, as these allow individual emails to be understood in terms of their connection to those that precede and follow them. Furthermore, we will bring the historical context back to otherwise de-contextualized data, allowing researchers to interpret isolated items of communication in a way that appreciates the wider historical circumstances in which they were created.

We will address this challenge through a UK-US collaboration between three universities (University of Bristol, De Montfort University, University of Maryland) and two heritage sector partners (The National Archives, UK, and Hagley Museum and Library, US). Through these collaborations, the project will focus on accessioning and re-contextualizing a worked example of an email archive from a failed US software company from the dot.com era, making it available in various forms to suit the diverse requirements of its potential readers. More specifically, the project has three overall work packages that together deliver on the project’s aim and objectives. The first aspect of the project centres around work linking the constituent emails in the archive together to retain the basic network structure of the communications and making relational links to otherwise disconnected emails based on their content. This will be combined with a user interface that allows the whole archive to be searched and read. The second aspect of the project provides a historical case study of the failed US company based on its archive and will require the development of both a narrative explanation of its history and an online platform for public engagement with it. The final package focuses on the project’s legacy and deals with issues of long-term preservation of the archive, description of best practice, and engagement with project stakeholders.