English Corpora: most widely used online corpora. Billions of words of data: free online access

You can compare the collocates of two words, to see how they differ in meaning and usage. For example, compare the noun collocates of utter and complete + NOUN (note the negative collocates with utter) or warm / hot or small and little, or the adjectives near boys and girls or Democrats and Republicans, or the objects of destroy / ruin or sanction / approve. By comparing collocates, you can move far beyond the simplistic entries in a thesaurus, to "tease out" slight differences in words, or (as in the case of boy and girl ) what is the difference in what is being said about two different things.

The following are the first few lines from the results of a search comparing the nouns immediately after utter and complete in COCA. A different search (in another corpus) will of course yield different results, but the general concepts remain the same. Before you try to interpret the numbers, notice how much the collocates of utter are much more negative than those with complete (this is an example of semantic prosody).

The basic idea of the table is that we want to see how frequent a collocate is with two competing words, compared to the overall frequency of those two words. For example, if there are twice as many tokens of Word1 as Word2 in the corpus overall, but a given collocate occurs fifty times as much with Word1 as with Word2, then the ratio of Word1 to Word2 with that collocate is 25 times what would otherwise be "expected".

1, 2. The two words being compared
3, 4. The overall frequency for the two words. In this example, there are 5812 tokens of utter and 92087 tokens of complete.
5, 6. The ratio of the frequency of the two words. For example, there are .06 tokens of utter for every token of complete in the corpus, and 15.84 tokens of complete for every token of utter. In other words, because complete is about 16 times as frequent as utter, any collocate (all things being equal) should occur about 16 times more frequently with complete than with utter.
7. The rank-ordered list of words or phrases that occur with [1]. Click on the word or phrase to see the "Keyword in Context" display.
8. The frequency of [7] with [1]. In this case we looked for nouns after utter, so this indicates that there were 16 tokens of utter helplessness (the fourth entry on the left).
9. The frequency of [7] with the competing word [2]. In this case, it shows that there are just 3 cases of complete helplessness.
10. The ratio of [8] / [9]. In this case, there are 5.3 times as many cases of utter helplessness as there are complete helplessness (When the competing word has a frequency of 0, it is set to .5, to avoid division by 0.)
11. The ratio of [10], compared to [5]. Remember that there should be about .06 tokens with utter for every token with complete, since that is the overall ratio of the two words in the corpus. In the case of helplessness, though, the ratio of [utter/complete] is 5.3, which is 84.5 times the "expected" frequency of .06. The results are sorted by the decreasing figures in this column.

Note that in the example above, the entries are sorted by the "score", which is a function of the ratio of the two words. But if you just want to see which are the most frequent strings with each word (regardless of what is happening with the other word), then select OPTIONS / [SORT BY] = [FREQUENCY] in the search form.

English-Corpora.org