....L E R - B I M L.....L E R - B I M L.....L E R - B I M L.....L E R - B I M L.....L E R - B I M L.....L E R - B I M L.....

The Corpus

 

Corpus release and download details 

Scottish Gaelic corpus 

A beta version of the Scottish Gaelic corpus can now be downloaded here.

Corpus contents:

All files are encoded in UTF-8 format.


Welsh corpus 

A beta version of the Welsh corpus can now be downloaded here.

Corpus contents:

All files are encoded in UTF-8 format.

A part-of-speech tagged version of the Welsh corpus, and resources for Welsh part-of-speech tagging with the Brill Tagger, can be downloaded here.

A set of Welsh resources for tagging with Oliver Mason's QTag part-of-speech tagger can be downloaded here.