This is fun: brand new words invented yesterday. The Norwegian Newspaper Corpus scrapes newspaper content from online versions of Norwegian newspapers, matches all the words against words used previously in this and other electronic corpuses (corpi?) they have access to, and lists words never used before. Norwegian, like German, uses compound words, so while the English “jazz festival” is two words, in Norwegian it’s “jazzfestival” — so lots of the new words are freshly created compounds. Some are misspellings, others are anglicisms. (from Gisle Andersen’s presentation — this is a Bergen-based project.)


Discover more from Jill Walker Rettberg

Subscribe to get the latest posts sent to your email.

6 thoughts on “words invented yesterday

  1. Lars

    Hooray! One of yesterday’s new words is from Finnmarken: HelnestrÂl. Unfortunately, it’s just a boat name, and an old one at that. Guess we have to try harder.

  2. Jill

    Ooh, yes, game the system!!! Great idea!

  3. lesley

    corpora?

  4. […] ng it from online newspapers. Working with Knut Hofland here in Bergen. Generates lists of new words. Material is used by lexicographers, good tool for studying emergence and spread of new words. Political eve […]

  5. Jamie

    The plural of corpus is corpora (cuz u asked)

  6. Stephen Shimanek

    Longman gives stress on the first syllable in English, something I can’t seem to get into my head.

Leave A Comment

Recommended Posts

From 17th century book factories to AI-generated literature

When I studied literature we mostly read the classics. Great literature, the canon. But that’s not necessarily what most people actually read. What if instead of comparing AI-generated literature to the literary canon, we tried comparing it to super popular and commercial forms of literature instead? Like the folkebøker that […]