English-Corpora.org

QUERIES

All of the corpora have exactly the same architecture and interface, which allows users to carry out the following types of searches. One of the important advantages of our corpus architecture is that with one simply query and one click, users can analyze variation by comparing different sections of a corpus; e.g. genres in COCA or the BNC, dialects in GloWbE or NOW, or across time periods (COHA, TIME, recent changes in COCA or NOW, and Google Books (Advanced)).


Visualization. You can see (examples with end up V-ing): Limiting and comparing sections
1) a chart with the overall frequency of all matching strings
2) the individual strings (overall - all sections)
3) individual strings (in each section of the corpus: genre, dialect, or time period)
1. You can also limit the search to just particular sections of the corpus (e.g. hard NOUN in Fiction)
2. More importantly, you can compare between two sections of the corpus (e.g. hard NOUN in FIC vs ACAD) -- either by genre, dialect, or time period.


Note: click on any link on this page to see the corpus data, and then click on the "BACK" image (see left) at the top of the page to come back to this page.

Type of search COCA-General COCA-Genres GloWbE-Dialects COHA-Historical
Specific word or phrase I guess validity lah! of no little
Substring *al_j *al_j (MAG/ACAD) *ism (core/SAsia) *ism (earlier/later)
Lemma (forms of a word) CONJ PRON BE like ,
( and she was like , )
ADJ CHAIN (FIC/ACAD) BE different to HAVE quite V-ed
Part of speech ADJ eyes ADJ body (MAG/ACAD) went ADJ a most ADJ NOUN
Synonyms =strong =strong (FIC/ACAD) =beautiful WOMAN =beautiful =girl
User-defined lists @colors @clothes FEEL @emotions (FIC/ACAD) @colors @clothes felt @emotions
Sortable concordance lines fathom argue (ACAD) diametrically swell (1930s)
Collocates (nearby words) BREAK_v chair (FIC/ACAD) scheme (US/GB) gay (earlier/later)
-- Use Mutual Information score  BREAK_v      
-- Compare two words  utter / sheer