ELTiverse

Search Terms

Search for ELT terms and concepts

Criterion-referenced Testing

AssessmentCriterion-referenced AssessmentCRT

Criterion-referenced tests (CRTs) measure performance against fixed, predetermined criteria — specific descriptions of what a learner can or cannot do. The question is not "How does this learner compare to others?" but "Can this learner do X?"

How It Works

Performance is judged against a standard, not against other test-takers. Every learner who meets the criteria passes; every learner who does not, fails. In theory, 100% of test-takers could pass (or fail).

Example: A criterion-referenced writing test might specify: The learner can write a formal email of complaint that includes a clear statement of the problem, supporting details, and a request for action, using appropriate register.

A learner either demonstrates this ability or does not. Their score is meaningful on its own — it does not need comparison to a peer group.

CRT vs NRT

FeatureCriterion-referencedNorm-referenced
Reference pointFixed criteria/standardsOther test-takers
Score interpretation"Can do X" / "Cannot do X""Better/worse than Y% of peers"
Score distributionNot predeterminedDesigned to spread (bell curve)
PurposeCertification, mastery, achievementRanking, selection, placement
Item designItems match learning objectivesItems designed to discriminate

Where It Is Used

  • Classroom achievement tests — Most teacher-made tests are (or should be) criterion-referenced: "Can learners use the past simple to narrate events?"
  • CEFR-aligned assessments — The CEFR descriptors ("Can understand the main point of clear standard input on familiar matters") are criteria. Assessments aligned to CEFR levels are criterion-referenced.
  • Competency-based programs — Pass/fail decisions based on demonstrated ability.
  • Can-do checklists — Self-assessment and teacher assessment against specific skill descriptors.

Advantages

  • Scores are directly interpretable — you know what the learner can actually do
  • Aligns assessment with learning objectives (strong content validity)
  • Supports Formative Assessment — criteria make it clear what to work on next
  • Does not require a comparison group

Challenges

  • Setting cut scores. Where is the line between pass and fail? This is a judgment call, and different judges may disagree.
  • Defining criteria precisely. Vague criteria ("writes adequately") lead to unreliable assessment. Criteria must be specific and observable.
  • Limited discrimination at the top. CRTs do not differentiate well among learners who all exceed the criteria. If the purpose is ranking or selection, Norm-referenced Testing is more appropriate.

Related Terms