Korpukset

British National Corpus
The British National Corpus (BNC) is a 100 million word collection of samples of written and spoken language from a wide range of sources, designed to represent a wide cross-section of current British English, both spoken and written.
University of Oxford

Scottish Corpus of Texts and Speech
The website provides a large electronic corpus of both written and spoken texts for the languages of Scotland.

Corpus of Global Web-Based English
This corpus provide a window to the frequencies of words, phrases, or grammatical constructions in 20 different countries as well as the chance to compare features in two sets of dialects.

WebCorp
WebCorp is a suite of tools which allows access to the World Wide Web as a corpus – a large collection of texts from which facts about the language can be extracted.
Birmingham City University