Summative Assessment
Summative assessment measures what learners have achieved at the end of a defined period — a unit, course, term, or program. Its purpose is to certify, grade, or rank, not to inform ongoing instruction. Often called "assessment OF learning."
Characteristics
- Retrospective. It looks back at what has been learned, not forward at what to teach next.
- High-stakes (usually). Results carry consequences: grades, progression, certification, selection.
- Standardized (usually). Conditions, tasks, and marking criteria are fixed to allow fair comparison.
- Infrequent. End-of-unit tests, final exams, standardized proficiency tests.
Common Forms in ELT
| Type | Example | What it measures |
|---|---|---|
| End-of-course exam | Course final covering Units 1-12 | Achievement against course objectives |
| Standardized proficiency test | IELTS, TOEFL, Cambridge exams | General language proficiency |
| Portfolio evaluation | Collected work assessed against rubric | Development and achievement over time |
| Oral examination | Structured speaking test with rubric | Speaking proficiency |
| Exit test | Test at end of a program level | Readiness to move to next level |
Quality Criteria
A summative assessment is only as useful as its quality. The three pillars:
- Validity — Does it measure what it claims to measure? A writing test that is mostly multiple-choice grammar has questionable validity as a measure of writing ability.
- Reliability — Would different markers, or the same marker on different days, give the same score? Reliability is especially challenging with productive skills (writing, speaking) where subjective judgment is involved.
- Practicality — Can it be administered, scored, and interpreted within the available resources?
Summative vs Formative: A Spectrum
The distinction is about purpose, not format. A mid-term test used only for grading is summative. The same test, if its results are analyzed to adjust the second half of the course, serves a formative function. In practice, good assessment programs use both: Formative Assessment during the course to improve learning, summative assessment at the end to certify it.
Washback Effects
High-stakes summative tests powerfully shape what happens in classrooms. See Washback. If the final exam tests grammar translation, teachers will teach grammar translation — regardless of what the syllabus says. This makes test design a curricular decision, not just an assessment one. Well-designed summative assessments create positive washback by testing the skills and knowledge that matter.
Pitfalls
- Over-reliance on summative data. A final grade tells you what a learner achieved but not why they succeeded or failed, nor what to do about it.
- Teaching to the test. When summative assessments dominate, instruction narrows to test preparation. The solution is not to eliminate summative assessment but to ensure the test is worth teaching to.
- One-shot measurement. A single exam on a single day is a limited sample of ability. Illness, anxiety, or a bad day can distort results. Multiple assessment points increase reliability.