()   

 

Integrated AI features: free with a premium or academic license.

[ Sample searches | Get started ]

This corpus results from a grant from the National Science Foundation (2013-2016, "A Linguistic Taxonomy of English Web Registers"), whose main researchers are Douglas Biber, Mark Davies, and Jesse Egbert. More information: A, B.

This corpus contains more than 50 million words of text from the web. Unlike other corpora from the web, which are just big "blobs" of data, this is the first large web-based corpus that is carefully categorized into many different registers.

Click on any of the links in the search form on the search page for context-sensitive help, and to see the range of queries that the corpus offers. You might pay special attention to the comparisons between registers and virtual corpora, which allow you to create personalized collections of texts related to a particular area of interest.

Finally, the corpus is related to other corpora from English-Corpora.org, which are the most widely used corpora of English and which offer unparalleled insight into variation in English.

Examples of web registers