()   

 

Now available! At English-Corpora.org, we're introducing a new way to interact with corpus data. Using Large Language Models (LLMs) like GPT, Gemini, and Claude, users can now have collocates, phrases, and frequency data clustered, categorized, and explained automatically. The underlying corpus data remains unchanged — but AI provides an optional layer of analysis to help users spot patterns and connections more quickly.

Just click on any of the green links on a corpus results page to use these features.   [Sample searches | Get started]   [More]

Compare the BNC and COCA: academic, informal, web texts
Information on BNC 2014 Written

The British National Corpus (BNC) was originally created by Oxford University press in the 1980s - early 1990s, and it contains 100 million words of text from a wide range of genres (e.g. spoken, fiction, magazines, newspapers, and academic).

The BNC is related to other corpora from English-Corpora.org, which are the most widely used corpora of English and which offer unparalleled insight into variation in English.

Click on any of the links in the search form on the search page for context-sensitive help, and to see the range of queries that the corpus offers. You might pay special attention to the comparisons between genres and virtual corpora, which allow you to create personalized collections of texts related to a particular area of interest. And you might want to check out the new expanded help files.

Five minute tour