Authentic Assessment
Authentic assessment requires learners to perform real-world tasks that mirror actual language use, rather than answering decontextualised, discrete-point test items. Instead of circling the correct verb form, the learner writes an email, gives a presentation, or participates in a discussion — tasks that have a communicative purpose beyond demonstrating knowledge of the language system.
The concept gained prominence alongside Communicative Language Teaching in the 1980s–90s, as the field recognised that testing isolated grammar points and vocabulary items was misaligned with communicative teaching goals.
Authentic vs Traditional Assessment
| Feature | Traditional | Authentic |
|---|---|---|
| Task type | Multiple choice, gap-fill, matching | Writing tasks, role-plays, projects, portfolios |
| Context | Decontextualised | Situated in real or realistic scenarios |
| Skills tested | Discrete, isolated | Integrated, holistic |
| Response | Select from options | Construct a response |
| Criterion | Correct/incorrect | Quality judged against rubric or rating scale |
| Validity | May lack face and construct validity for communicative ability | Strong face validity; tests what it claims to test |
Principles
- Tasks resemble real-world language use — If the goal is communicative competence, the test should require communication
- Integration of skills — Real-world tasks rarely involve one skill in isolation; reading-into-writing or listening-into-speaking tasks are more authentic than pure receptive tests
- Meaningful context — Tasks are embedded in a scenario the learner might plausibly encounter (ordering food, summarising a report, negotiating a plan)
- Criteria reflect real-world success — Scoring evaluates whether the communicative purpose was achieved, not just linguistic accuracy
Authenticity Is a Continuum
No assessment is fully authentic — any test situation is artificial to some degree. Authenticity in assessment is better understood as a continuum:
- Low authenticity: Select the correct preposition from four options
- Moderate authenticity: Read a text and write a summary
- High authenticity: Role-play a job interview with an assessor
Bachman & Palmer (1996) use the term authenticity as one of their test usefulness qualities, defined as the degree to which test tasks correspond to the features of real-world target language use.
Challenges
- Scoring — Constructed responses require subjective judgment; rubrics and trained raters are essential
- Reliability — Subjective scoring introduces variance; inter-rater reliability must be actively managed
- Practicality — Authentic tasks take longer to administer and score than multiple-choice items
- Task equivalence — Ensuring different test forms are comparable is harder when tasks are complex
- Sampling — Authentic tasks assess performance on specific tasks; generalising to broader ability requires sufficient sampling
In Practice
Authentic assessment works best when integrated into the teaching cycle rather than reserved for final exams:
- Portfolio Assessment — collecting work over time
- Project presentations
- Simulated workplace tasks (writing reports, making phone calls)
- Peer interaction tasks
- Performance Assessment — any task requiring production rather than recognition
Key References
- Bachman, L. F. & Palmer, A. S. (1996). Language Testing in Practice. Oxford University Press.
- Wiggins, G. (1990). The case for authentic assessment. Practical Assessment, Research, and Evaluation, 2(1).
- McNamara, T. (2000). Language Testing. Oxford University Press.
See Also
- Validity — authentic tasks support construct validity for communicative ability
- Performance Assessment — a closely related concept
- CLT — the teaching approach that authentic assessment naturally complements
- Washback — authentic assessment encourages communicative teaching