The following shows the number of words (in billions) for the
different
Google
Books datasets. After doing a search in one dataset, you can
quickly and easily re-do the same search in another database by clicking
on one of the links after COMPARE in the header above. This will allow
you, for example, to quickly compare a number of phenomena in
British
and American English. Note also that although all of these datasets are
available via this interface, nearly all of the examples in the help
files come from just the American English dataset.
As far as the content of each dataset, the American and British datsets
should be self-explanatory. "Fiction" is both American and British. The "One Million Books"
datset is a subset of the entire English set,
and contains just those books whose OCR quality is the best, and it t is also
more balanced by subject for the last 100 years or so. Please remember that the frequency listings
from the n-grams are for the particularly database that you are
searching. But when you then click to see the extracts from Google
Books, you are seeing the extracts from ALL datasets. There is
unfortunately no way around this. (More
information...) |