The corpora from English-Corpora.org are free, but there are two ways to obtain increased access to the corpus data: purchasing full-text data, and obtaining an academic / site license. These are two very different options, and universities or other organizations typically choose just one of the two.


Academic / site license

Full-text data


Online access to the corpora, such as iWeb, COCA, COHA, NOW, GloWbE, or BYU-BNC.

Download the data to your own computer(s). Several corpora currently available: iWeb, COCA, COHA, GloWbE, Wikipedia, NOW, SOAP, the TV Corpus, the Movies Corpus, and the Corpus del Espaņol.

Typical users

Students, and teachers/professors who are fine with the web interface, and who do not need to manipulate the underlying corpus data.

Those who want to process the corpus data for their own purposes (typically those with programming skills, to manipulate lots of data), and those who do not want to be constrained by the web interface.

Effort involved

Essentially none; just use the web interface.

(Potentially) quite a bit: downloading the files, (possibly) formatting them for quick retrieval, mastering the use of text retrieval software, etc.

Format / queries

Queries available via the web interface.

Three formats (simple text, word/lemma/PoS, and database); many different uses.


$300 - $700 for a one year site license (more...)

$245 - $795, depending on the number of corpora and the number of users  (more...)


Typically one year, although there is the possibility of discounted two-year and three-year licenses.

One-time purchase; the license never expires.