Rubric

Rubric

Assessmentrubricsscoring rubricscoring guidemarking rubricassessment rubric

A rubric is a scoring tool that articulates the expectations for a performance or product by listing criteria (what to evaluate) and performance level descriptions (what quality looks like at each level along a continuum). Unlike a simple rating scale that uses numerical or evaluative labels alone, a true rubric provides descriptive language at each level, making the basis for judgement transparent to both assessors and learners.

In language assessment, rubrics operationalise the construct by specifying exactly what features of language performance are being evaluated and what distinguishes one level from another. The IELTS Writing band descriptors, the Cambridge Writing assessment scales, and the CEFR illustrative descriptors are all rubrics.

Key Components

Every rubric contains two essential elements (Brookhart, 2013):

Criteria — the dimensions of quality being assessed (e.g., Task Achievement, Coherence & Cohesion, Lexical Resource)
Performance level descriptions — prose descriptions of what work looks like at each level (e.g., Band 5, Band 6, Band 7)

A checklist (yes/no) is not a rubric. A rating scale with only numerical labels ("1 = poor, 5 = excellent") is not a rubric. The defining feature is descriptive language depicting what performance actually looks like at each level.

Types of Rubric

Analytic vs Holistic

Feature	Analytic rubric	Holistic rubric
Structure	Separate scores on each criterion	Single overall score
Feedback value	High — shows strengths and weaknesses per criterion	Low — only an overall impression
Scoring speed	Slower (multiple judgements)	Faster (one judgement)
Reliability	Generally higher — criteria constrain judgement	Lower — more subjective
Best for	Formative feedback, diagnostic purposes	Summative grading, large-scale screening
ELT example	IELTS Writing (4 criteria, each scored 0–9)	TOEFL independent writing (single 0–5 scale)

See Analytic vs Holistic Scoring for a detailed comparison.

General vs Task-Specific

Feature	General rubric	Task-specific rubric
Scope	Applies to a family of similar tasks	Designed for one particular task
Reusability	High — same rubric across assignments	Low — new rubric per task
Learning value	Higher — students internalise transferable standards	Lower — students learn specific content expectations
ELT example	IELTS Writing Task 2 descriptors (any essay topic)	A rubric for a specific class presentation on climate change

Arter and Chappuis (2006) argue that general rubrics are preferable for learning because they help students develop a transferable understanding of quality that applies across tasks.

Designing Quality Rubrics

Brookhart (2013) identifies the following principles for effective rubric design:

Use descriptive language, not evaluative language — write what the work looks like, not judgement words like "excellent" or "poor"
Align criteria to learning outcomes — assess what the task is meant to develop, not compliance with procedures
Ensure criteria are distinct — each criterion should capture a different dimension without overlap
Write a clear progression — each level should describe a qualitatively different stage, not just "more" or "less"
Keep the number of levels manageable — research shows 3–5 levels work best; more than 6 becomes hard to distinguish reliably

Arter and Chappuis (2006) recommend a bottom-up development process: collect student work samples, sort them into quality groups, then describe what distinguishes each group. This inductive approach produces more authentic descriptors than top-down theorising.

Rubrics in Language Assessment

Assessment	Type	Criteria	Levels
IELTS Writing	Analytic	Task Achievement/Response, Coherence & Cohesion, Lexical Resource, Grammatical Range & Accuracy	Bands 0–9
IELTS Speaking	Analytic	Fluency & Coherence, Lexical Resource, Grammatical Range & Accuracy, Pronunciation	Bands 0–9
Cambridge B2 First Writing	Analytic	Content, Communicative Achievement, Organisation, Language	0–5 per criterion
CEFR	General descriptive	Multiple can-do descriptors across skills	A1–C2

Teaching Implications

Share rubrics with learners before the task — research consistently shows this improves performance and reduces anxiety (Andrade, 2000; Brookhart, 2013)
Use rubrics for self-assessment and peer assessment — rubrics make the criteria explicit enough for learners to evaluate their own and each other's work
Rubrics shape washback — what the rubric emphasises, teachers and learners will focus on
Standardisation training requires rubrics — raters cannot be calibrated without shared descriptors to anchor their judgements
Rubric quality affects reliability — vague descriptors produce inconsistent scoring; precise descriptors improve inter-rater reliability

Key References

Arter, J. A. & Chappuis, J. (2006). Creating & Recognizing Quality Rubrics. Pearson.
Brookhart, S. M. (2013). How to Create and Use Rubrics for Formative Assessment and Grading. ASCD.
Andrade, H. G. (2000). Using rubrics to promote thinking and learning. Educational Leadership, 57(5), 13–18.
Allen, D. & Tanner, K. (2006). Rubrics: Tools for making learning goals and evaluation criteria explicit for both teachers and learners. CBE—Life Sciences Education, 5(3), 197–203.
Bachman, L. F. & Palmer, A. S. (1996). Language Testing in Practice. Oxford University Press.