Research Output

Palimpsest: improving assisted curation of loco-specific literature.

  Text mining and information visualisation techniques applied to large-scale historical and literary document collections have enabled new types of humanities research. The assumption behind such efforts is often that trends will emerge from the analysis despite errors for individual data points and that noise will be dominated by the signal in the data.  However, for some text analysis tasks, the technology is unable to perform as well as domain experts, perhaps because it does not have sufficient world knowledge or metadata available.  However, the advantage of language processing technology is that it can process at scale, even if not perfectly accurately.  Geo-locating literary works is one example where human expert knowledge is invaluable when it comes to distinguishing between candidate works.  This was the underlying assumption in Palimpsest, an interdisciplinary digital humanities research project on mining literary Edinburgh. From the outset, the project adopted an assisted curation process whereby the automatic processing of large data collections was combined with manual checking to identify literary works set in Edinburgh.  In this article, we introduce the assisted curation process and evaluate how the feedback from literary scholars helped to improve the technology, thereby highlighting the importance of placing humanities research at the core of digital humanities projects.

  • Type:


  • Date:

    11 November 2016

  • Publication Status:


  • DOI:


  • ISSN:


  • Library of Congress:

    AZ History of Scholarship The Humanities

  • Dewey Decimal Classification:

    020 Library & information sciences


Alex, B., Grover, C., Oberlander, J., Thomson, T., Anderson, M., Loxley, J., …Zhou, K. (2016). Palimpsest: improving assisted curation of loco-specific literature. Digital Scholarship in the Humanities, 32(1), 4-16.



digital humanities,

Monthly Views:

Available Documents