4. Data sources

The tables linked in this section list the sources of the text samples included in the ZCTC corpus. Each sample has approximately 2,000 words. Samples composed of short texts can have multiple sources. The following bibliographic details are given where such information is available and has been recorded in data collection: sample ID, title, author/source, translator, publisher/journal, year/volume, sample position, and URL.

A) Press reportage (44 text samples)

B) Press editorial (27 text samples)

C) Press review (17 text samples)

D) Religious writing (17 text samples)

E) Skill / trade / hobby (38 text samples)

F) Popular lore (44 text samples)

G) Biography and essay (77 text samples)

H) Miscellaneous - reports and official document (30 text samples)

J) Science - academic prose (80 text samples)

K) General fiction (29 text samples)

L) Mystery and detective fiction ( 24 text samples)

M) Science fiction (6 text samples)

N) Adventure fiction (29 text samples)

P) Romantic fiction (29 text samples)

R) Humour (9 text samples)