Lancaster University Department of Linguistics and Modern English Language
Corpus Linguistics Home
Page index
Basic WordSmith
Using Concord
Frequency Lists and Keywords
Part-of-speech Tags
DIY Corpora
Page One
Page Two
Page Three
Current page

Extra Practice


Do a keyword analysis for the Catholic corpus (under the directory "religion\catholic").

Before you begin, which words would you expect to occur in the Catholic corpora but not in FLOB?

  1. Compare the list for the Catholic corpus (name it "cath.lst") with FLOB.
  2. Do the batch processing of texts in the Catholic corpus, and produce key keywords.
  3. Next compare the lists of the Baptist corpus (baptist.lst) and the Catholic corpus (cath.lst).

Which words were key in the Catholic corpus but not in the Baptist corpus, and vice versa?