Extra Practice


Do a keyword analysis for the Catholic corpus (under the directory "religion\catholic").

Before you begin, which words would you expect to occur in the Catholic corpora but not in FLOB?

  1. Compare the list for the Catholic corpus (name it "cath.lst") with FLOB.
  2. Do the batch processing of texts in the Catholic corpus, and produce key keywords.
  3. Next compare the lists of the Baptist corpus (baptist.lst) and the Catholic corpus (cath.lst).

Which words were key in the Catholic corpus but not in the Baptist corpus, and vice versa?