Franco Moretti‘s work on “distant reading” (or macroanalysis) has been very inspiring as we have explored what kinds of research we can do using the ELMCIP Electronic Literature Knowledge Base, especially his book Graphs, Maps, Trees: Abstract Models for a Literary History. Moretti generously supported a grant application for developing visualizations of the data in the Knowledge Base last year (we didn’t get the grant, but are still working on it) and we were eager to visit the Stanford Literary Lab, which is led by Moretti and Matthew Jockers, to learn more about their work and hear some of their thoughts about our Knowledge Base.

After a beautiful drive up the Californian coastline from Santa Barbara we arrived at Stanford on Thursday afternoon and easily found the Literary Lab.

Untitled

The door of the Stanford Literary Lab

Matthew Jockers’ work is primarily on text mining of literary texts, and he immediately starting thinking about how and whether one could do text mining of electronic literature. This is something Scott and Jeremy Douglass also discussed earlier this week in San Diego. A problem is that works of electronic literature are very heterogenous in comparison to most traditional literary genres (Jockers noted that this is the case with modernism, too, which is why there is very little digital humanities work in the field, other than on individual authors) making it difficult to draw patterns out of large quantities of texts. On the other hand, as Scott and Jeremy Douglass had discussed, and Scott and Andrew Salway have discussed in Bergen, it would be possible to automatically extract and compare particular code structures, functions, use of visuals and so on. There are also genres of electronic literature that are (or have been) relatively stable, such as Flash poetry, interactive fiction or hypertext narratives.

Members of the Stanford Literary Lab listening to our presentation of the Knowledge Base. Matthew Jockers is on the left, Franco Moretti on the right, and grad student Ben Allen is next to Moretti.

But the ELMCIP Knowledge Base as it stands now is primarily focused on metadata rather than the texts themselves. We show the connections between creative works and events they were presented at, critical writing they are discussed by, organizations with which their authors are affiliated, journals in which they were published and so on, and being able to visualize all this in a social network graph would be immensely useful. And although the works themselves might be characterized as having “extreme morphological variation”, as Jockers or Moretti said, the network of connections is homogenous. For instance, as Moretti put it, all the authors in our database are humans, most are alive, and many live in Europe or the US.

Jockers suggested that simply showing a graph view where different categories of node were displayed in different colours (blue for creative works, green for critical works and so forth) would be useful. We’d also like to be able to see how nodes would cluster. How important are geographical locations in the community, for example?

Another way of viewing the data is in a timeline. When he saw our rather simple graph of when creative works in the Knowledge Base were published, Moretti immediately jumped up to take a closer look. 

“There’s not really a long enough time span to identify trends,” he said, “but would you say that there was a steep increase and then a stabilization?” The problem with our data, of course, is that we don’t really know how representative it is, other than that we most certainly do not have an entry for each actual work of electronic literature in the world. The data is skewed by the interests of contributors and by our knowledge. For instance, we know there is a great deal of Brazilian electronic literature but we have almost none of it registered in the Knowledge Base. Moretti pointed out that with enough data over time, you can link literary trends to events like wars or social upheaval, but you can’t really do that with a fifteen year time span.

Moretti did suggest comparing this graph to the publication of critical works, which we’ll certainly try next. Graphs like these are easy to generate by dumping a spreadsheet from the Knowledge Base and playing with it in Google Fusion or even Excel.

But if trends are difficult to track with such a brief timeframe, Moretti suggested that typologies would be easier to spot. While trends link literary history to social events outside of the texts themselves, typologies are internal to the data, and allow us to identify genres and themes. You need to work differently with the data with typologies than with trends, being much more precise.

Stanford Literary Lab mostly researches 19th century literature, because that is what they have good corpuses for. Until recently, we haven’t had access to corpuses or metadata for electronic literature, because there have been no systems for documenting these works. Libraries haven’t known what to do with it, apart from the few works that are published on disk or CD-ROM with an ISBN number. Hopefully the existence of data in the ELMCIP Knowledge Base will allow future scholars to engage more with electronic literature as well as the classics.


Discover more from Jill Walker Rettberg

Subscribe to get the latest posts sent to your email.

Leave A Comment

Recommended Posts

Top of a ransom note from Shinyhunters hacking group. Text reads: "SHINYHUNTERS rooting your systems since '19 ;) ShinyHunters has breached Instructure (again). Instead of contacting us to resolve it they ignored us and did some "security patches"."
Networked Politics University politics

UiB self-hosts the open source version of Canvas, so wasn’t affected by the breach

On May 1st Canvas announced a security breach, and then yesterday the system was hacked. The login page was replaced by a ransom note: if universities don’t pay up by 12 May, student data will be released. Here’s what the login page looked like yesterday: Way back in 2015, when […]

AI and algorithmic culture Networked Politics

AI-generated images, fascist aesthetics: Dieselbrølet and Heimatstrom

My German is pretty dodgy, so when I first saw Heimatstrom on Bluesky, shared by Roland Meyer, a professor of visual culture at Universität Zürich’s Digital Society Initiative, I misinterpreted it and thought it was a far-right campaign. But no, Heimatstrom is a group of left-wing environmentalists using fascist AI […]

Photo of a billboard ad at Oslo S train station showing a smiliing conductor and the text "Du må ikke sove. Joda, bare sov du."
AI STORIES

“Du må ikke sove”: a floating motif detached from its meaning (or: LLMs can write Norwegian but miss cultural references)

There’s a new ad for the train between Stavanger and Oslo in Norway that uses a line from Arnulf Øverland’s famous anti-fascist poem Du må ikke sove (“You must not sleep”). Du må ikke sove, you must not sleep, the ad says. And then it flips it, jovially, joda, bare […]

Academics in Norway: Sign this petition asking for research-based discussions of how to use AI in universities

I just signed a petition calling for Norwegian universities to use research expertise on AI when deciding how to implement it, rather than having decisions be made mostly administratively. ,  If you are a researcher in Norway, please read it and sign it if you agree – and share with anyone else who might be interested. The petition was written by three researchers at UiT: Maria Danielsen (a philosopher who completed her PhD in 2025 on AI and ethics, including discussions of art and working life), Knut Ørke (Norwegian as a second language), and Holger Pötzsch (a professor of media studies with many years of research on digital media, video games, disruption, and working life, among other topics).  This is not about preventing researchers from exploring AI methods in their research. It is about not uncritically accepting the hype that everyone must use AI everywhere without critical reflection. It is about not introducing Copilot as the default option in word processors, or training PhD candidates to believe they will fall behind if they do not use AI when writing articles, without proper academic discussion. Changes like these should be knowledge-based and discussed academically, not merely decided administratively, because they alter the epistemological foundations of research. Maria wrote to me a couple of months ago because she had read my opinion piece in Aftenposten in which I called for a strong brake on the use of language models in knowledge work. She was part of a committee tasked with developing UiT’s AI strategy and was concerned because there was so much hype and so few members of the committee with actual expertise in AI. I fully support the petition. There are probably some good uses for AI in research, but the uncritical, hype-driven insistence that we must simply adopt it everywhere is highly risky. There are many researchers in Norway with strong expertise in AI, language, ethics, working life, and culture. We must make use of this expertise. This is also partly about respect for research in the humanities, social sciences, psychology, and law. Introducing AI at universities and university colleges is not merely a technical issue, and perhaps not even primarily a technical one. It concerns much more: philosophy of science, methodological reflection, epistemology, writing, publishing, the working environment, and more. […]

screenshot of Grammarly - main text in the middle, names of experts on the left with reccomendations and on the right more info about the expert review feature
AI and algorithmic culture Teaching

Grammarly generated fake expert reviews “by” real scholars

Grammarly is a full on AI plagiarism machine now, generating text, citations (often irrelevant), “humanizing” the text to avoid AI checkers and so on. If you’re an author or scholar, they also have been impersonating and offering “feedback” in your name. Until yesterday, when they discontinued the Expert Review feature due to a class action lawsuit. Here are screenshots of how it worked.