Ted Underwood’s most recent book – 206 pages, published in February 2019 – was widely acclaimed and much discussed. Based on the assumption that digital methods and corpora show their strengths especially in diachronic observations over long periods of time, he has provided aspects of a digital literary historiography.
But this “computational monograph” – to quote Maciej Maryl – is not just something to read. Data and code have been published and can be replicated and questioned. This is not always easy and is not yet a living part of our practice in the Digital Humanities.
That’s why this blogpost attempts to bring together the REPLICATIONS made so far to put Underwood’s technical setup to the test. Such efforts to verify scientific claims should be encouraged more and can be exemplary for similar undertakings in the future.
I’m also gathering all REVIEWS of the book I found (as neither the publisher’s website nor the author’s personal page offers such an overview). If you have any additions, please lemmino (you can find me on Twitter).
Fig. 1. Ted Underwood: Distant Horizons. Digital Evidence and Literary Change. Chicago and
London: The University of Chicago Press 2019. (doi:10.7208/chicago/9780226612973.001.0001).
Underwood’s book has a somewhat mediating character. The machine-learning methods used for the analyses are not bleeding edge, he mostly relies on regularised logistic regression on several thousand features. Underwood refutes the claim that machine-learning models are always inscrutable black boxes. He spends a lot of time looking into these black boxes and interpreting the word frequencies and their distribution in order to craft narratives relevant to literary studies.
But we are still at the beginning, as Underwood makes clear: “Statistical models can help us understand the social forces shaping a long arc of literary change. But turning those models into fully satisfying stories could take several more decades.” (p. 109)
I originally wanted to write a review myself, but never found the time to finish it (and finally got discouraged late last year when I encountered Maciej Maryl’s excellent and comprehensive review 😊). The following list of reviews I have compiled may serve to better sort through the rich discussions surrounding “Distant Horizons” and find good entry points.
REVIEWS
- Dan Sinykin, Post45 (6 May 2019)
- Tess McNulty responds to his review, Post45 (6 May 2019)
- Mareike Schumacher, lebelieberliterarisch.de (25 Jul 2019) 🇩🇪 – followed by a comment from Fotis Jannidis
- Laura Jiménez Ríos, Revista de Humanidades Digitales, vol. 4 (2019), pp. 212–215 🇪🇸
- Daniel Rosenberg, Modern Philology, vol. 117, no. 3 (Feb 2020), publ. online 22 Nov 2019
- Katherine Bode, Modern Language Quarterly, vol. 81, issue 1 (1 Mar 2020), pp. 95–124 (preprint) – a more critical take
- Alison Booth, Victorian Studies, vol. 62, no. 3 (Spring 2020), pp. 547–550 – also somewhat critical
- Marie-Christine Boucher, KULT_online, no. 61 (April 2020)
- Maciej Maryl, JLTonline (17 Oct 2020) – assigns a new genre to the book, calling it a “computational monograph”
REPLICATIONS OF CODE AND/OR DATA
When Underwood submitted his book to the publisher, he published data and code in a ZIP package on Zenodo on 25 March 2018 (doi:10.5281/zenodo.1207277). There’s also a (slightly updated) Git Hub repo.
Portions of chapters 2, 3 and 4 are reprints of prior collaboratory work. This is also the reason why most of the listed replications could take place before the book even went to press.
- Andrew Goldstone, “Of Literary Standards and Logistic Regression: A Reproduction”, andrewgoldstone.com, 4 Jan 2016
- Sarah Allison wrote a short piece reflecting on Goldstone’s replication: “Other people’s data: humanities edition”, Journal of Cultural Analytics, vol. 1, issue 1 (18 Dec 2016)
- Jonathan Goodwin, “Darko Suvin’s Genres of Victorian SF Revisited”, jgoodwin.net, 17 Oct 2016
- Nan Z. Da, “The Computational Case against Computational Literary Studies”, vol. 45, no. 3 (Spring 2019) – among a total of 14 CLS papers, she also discusses Underwood’s “The Life Cycle of Genres”, which expanded into chapter 2 of “Distant Horizons”
- followed by Underwood’s reaction in CI’s response forum, critinq.wordpress.com, 1 Apr 2019
- and Da’s follow-up, “Errors”, critinq.wordpress.com, 2 Apr 2019
- Ted Underwood’s reply to Da, “The Theoretical Divide Driving Debates about Computation”, Critical Inquiry, vol. 46, no. 4 (Summer 2020), which re-analyses material in chapter 2 using different methods