SORTING  1    [TYPE]  2  

Selects how the results will be sorted. The default is sorting by raw FREQUENCY, where the most frequent results appear first. You can also sort by RELEVANCE. Generally relevance is defined by the Mutual Information score, which is a measure of how "tightly" linked two words are. This takes into account the overall frequency of collocates, and sorts out high-frequency "noise" words. Compare the regular listing for hard * or * himself and the relevance-sorted listings of the same: hard * or * himself.

If you are comparing two sections of the corpus, then relevance shows what words are in Section 1 but not Section 2 and vice versa.

 2  Shows how relevance is being calculated (see above)