Skip to main content

Largest Web-based, Genre-balanced English Corpus (COCA): A Tutorial and Suggestions for Classroom Use

Corpus of Contemporary American English was released on the web in 2008 with more than 400 million words of corpus, the largest free web-based corpus. However, what makes this corpus special is not its enormous size but its :
1- genre-balanced search ability , as you can look up the word or collocation in any genre available ( or in all) and compare between spoken, fiction, popular magazines, newspapers, and academic, or even between sub-genres (or domains), such as movie scripts, sports magazines, newspaper editorial, or scientific journals
2- over time search, as you can compare different years from 1990 to the present time
3- “.. semantically-based queries of the corpus. For example, you can contrast and compare the collocates of two related words (little/small, democrats/republicans, men/women), to determine the difference in meaning or use between these words [ also how each is used in different genres]”

4- interface, as you can easily conduct searches between all genres and years. You can have your search result listed as frequency  or relevance or as a chart. All of this is made possible because of the window frames.
Frame 1 : Navigation

Frame2:  Query Result (including frequency and word collocation)


Frame3: The query in context (Key Word in Context) and the source text extract.
5- Great tutorial that takes about 5 mins to complete. Help is always there if you need it.

The following video is a tutorial to show you how to use COCA and its great features.( Please use your headphones as the audio volume is low)

How to use it in the language classroom

1- You could do a vocabulary activity sheet same as the one embedded below. In such worksheet you turn the students’ attention to how words are used in authentic language and how each word behaves in a given sentence ( collocation with words but not with/more than others). You can also have students notice the nuance of difference between seemingly synonymous words and how each collocates with different words.
The worksheet below was constructed using COCA as a pre-reading vocabulary activity where students were introduced to  new lexis in context and asked to use context clues to figure out the meaning and asked about the collocation of each as well as semantic and syntactic significance of each.
vocabulary activity sheetgrade 8 mokey;s paw

2- You could do word part analysis worksheet where students practice on prefix or root words of new vocabulary to result in noticing of how words are formed. The document below was also constructed using the COCA and the wildcard (*) as a query ( I searched for the the prefix*, e.g. circum* to result in all words that start with circum-). After students formed the words ( working in pairs) they filled in the blanks  with the appropriate words in sentences cited from corpus. As such, students experimented with new vocabulary not in isolation but in authentic language.

3- You could have students derive a certain grammatical rule for the word “any” for example and when it is used. The options for using it in the classroom are endless.

Preparing corpus and concordancing for classroom use takes time and effort, and training for teachers/students too. The result is definitely one of satisfaction and enhancement of language learning as it is now learned using authentic language. Not only it changes the role of the students to a language explorer but also it aids in understanding English language ( from teachers’ and students’ perspective) like never before.
I hope this post was helpful enough :)
Any comments, suggestions, or queries are welcome in the comment feature.

Popular posts from this blog

Edmodo: A Microblogging Educational Platform

I’ve been aware of edmodo for quite a time now though I have never had the chance to use it with my students yet, as the scholastic year did not start yet.
What is Edmodo?
We all know twitter as a social networking platform and a microblogging platform for language learners right!!! The thing is that twitter does not have the security that our students need for safe microblogging. This is where edmodo comes in with its enhanced new features.

Simply put, Edmodo is a microblogging platform for education. You notice this on the home page of edmodo where there you can sign up as a teacher or a student.

Once you enter as a teacher you have to create an account to use edmodo. Your pesonal page contains all the features you need to connect with your students. You can upload assignments with files, link to urls, embed videos, or post a note.
The security in edmodo is that you have to create a group to connect to. Once this is done, you are given a code which in turn you give to your students.…

No More Text Comments: Giving Voice Feedback on Google Docs

One of the best features of Google Docs is that teachers can comment on students’ essays by highlighting the selected text and giving textual commentary. Students can in turn comment back on their teacher’s comment, making it a great formative feedback for essay writing. It does not only kill the red ink annotations, which are a real annoyance to students, but also targets the intended selection to comment on in an organized manner. However, if teachers want to take formative feedback to new levels of personalized feedback, a voice feedback would be a great solution. This is what Kaizena actually does, and more. Kaizena is a voice commentary online application that integrates fully with Google Drive to maintain the smooth workflow. You don’t even have to go to Kaizena website to install it. It works much like the Google Docs commentary but instead of textual commentary in the highlighted essay section, you include a voice commentary. Kaizena also supports text commentary and highlight…

Emerging Technologies, Key Trends,and Challenges in K- 12 Education

The NMC Horizon 2013 report  came out couple of weeks back with its time-to-adaption of emerging technologies in k-12 education. What New Media Consortium Horizon does is conduct extensive research in the domain of digital learning, and project their probability on the adoption of emerging learning technologies. The report features six technologies with three adoption horizons: 1 year, 2 to 3 years, and 4 to 5 years.The report also includes major trends in the area of digital learning in k-12 education and the major challenges facing education in terms of using technology in education.
Time-to-Adoption for K-12
New-term Horizon (Time-to-adoption 1 year)
Mobile Learning Mobile learning is becoming an essential part in k-12 education. There have already been many initiative programs like the one-to-one and the BYOD programs to help students learn anytime and everywhere. Mobile learning also has more affordance than laptops or PCs for combining the real world and virtual tools in what’s…