Corpora

A corpus is ‘a collection of pieces of language, selected and ordered according to explicit linguistic criteria in order to be used as a sample of language’ (Sinclair, 1996).

Welcome back!

Englicious is totally free for everyone to use!

But you will have to log in to see our library of teaching resources.

If you don’t have an account, that’s perfectly OK. You can register (for free).

It only takes a minute or two.

Corpora: Useful web tools

The following are corpus-related websites which we think are helpful for investigating language.

Wordle

Wordle is a simple-to-use site that lets you paste in your own data and then creates an attractive ‘word cloud’ based on the frequency of the words you’ve used. You can use Wordle as a very simple corpus tool for something like a poem, a song lyric, a political speech or a soliloquy from a play and get a visual representation of the language within it. (See also the lesson entitled 'Word clouds in action', which uses Wordle as a way in to analysing a poem).

Welcome back!

Englicious is totally free for everyone to use!

But you will have to log in to see our library of teaching resources.

If you don’t have an account, that’s perfectly OK. You can register (for free).

It only takes a minute or two.