Summative Assessment
Summative assessment measures what learners have achieved at the end of a defined period: a unit, course, term, or program. Its purpose is to certify, grade, or rank, not to inform ongoing instruction. Often called "assessment OF learning."
Characteristics
- Retrospective. It looks back at what has been learned, not forward at what to teach next.
- High-stakes (usually). Results carry consequences: grades, progression, certification, selection.
- Standardized (usually). Conditions, tasks, and marking criteria are fixed to allow fair comparison.
- Infrequent. End-of-unit tests, final exams, standardized proficiency tests.
Common Forms in ELT
| Type | Example | What it measures |
|---|---|---|
| End-of-course exam | Course final covering Units 1-12 | Achievement against course objectives |
| Standardized proficiency test | IELTS, TOEFL, Cambridge exams | General language proficiency |
| Portfolio evaluation | Collected work assessed against rubric | Development and achievement over time |
| Oral examination | Structured speaking test with rubric | Speaking proficiency |
| Exit test | Test at end of a program level | Readiness to move to next level |
Quality Criteria
A summative assessment is only as useful as its quality. The three pillars:
- Validity: Does it measure what it claims to measure? A writing test that is mostly multiple-choice grammar has questionable validity as a measure of writing ability.
- Reliability: Would different markers, or the same marker on different days, give the same score? Reliability is especially challenging with productive skills (writing, speaking) where subjective judgment is involved.
- Practicality: Can it be administered, scored, and interpreted within the available resources?
Summative vs Formative: A Spectrum
The distinction is about purpose, not format. A mid-term test used only for grading is summative. The same test, if its results are analyzed to adjust the second half of the course, serves a formative function. In practice, good assessment programs use both: Formative Assessment during the course to improve learning, summative assessment at the end to certify it.
Washback Effects
High-stakes summative tests powerfully shape what happens in classrooms. See Washback. If the final exam tests grammar translation, teachers will teach grammar translation regardless of what the syllabus says. This makes test design a curricular decision, not just an assessment one. Well-designed summative assessments create positive washback by testing the skills and knowledge that matter.
Pitfalls
- Over-reliance on summative data. A final grade tells you what a learner achieved but not why they succeeded or failed, nor what to do about it.
- Teaching to the test. When summative assessments dominate, instruction narrows to test preparation. The solution is not to eliminate summative assessment but to ensure the test is worth teaching to.
- One-shot measurement. A single exam on a single day is a limited sample of ability. Illness, anxiety, or a bad day can distort results. Multiple assessment points increase reliability.