Full-text data |
Download nearly one billion words of data from COCA, in any of three
different formats. Once downloaded, you can process this offline
data in any way you want. |
Word Frequency |
Download lists of the top 60,000 lemmas in COCA, including the
frequency by the eight main genres and nearly 100 sub-genres. You
can also download a list with the frequency of the word forms (e.g.
decide, decides, deciding, decided), as well as a list of the
top 219,000 words (not lemmas) in COCA, including frequency by
genre. |
Collocates |
Download lists with the top 200-300
collocates (nearby words) for the top 60,000 lemmas in COCA -- 13,500,000 node/collocate
pairs in all. |
N-grams |
Download lists (in various formats) of all 2, 3, 4, and 5-word
strings that occur at least four times in COCA -- more than 40
million n-grams in total. |