ELTiverse

Search Terms

Search for ELT terms and concepts

Between-Group Study

SLAbetween-group designbetween-subjects designcomparison study

A between-group study (also called between-subjects design) compares two or more groups that receive different treatments. In SLA research, this typically means a treatment group (e.g., receiving TBLT instruction) compared to a control or comparison group (e.g., receiving traditional instruction or no instruction).

Why It Matters

Between-group designs are the standard for causal claims about instructional effectiveness. The logic: if both groups start at the same level and only the treatment differs, post-treatment differences can be attributed to the treatment.

Contrast with within-group (pre-post) designs, which test the same group before and after treatment. Within-group designs cannot distinguish treatment effects from maturation, practice effects, or other factors.

Requirements for a Sound Between-Group Study

  1. Pre-test for both groups — establishes baseline equivalence. Without this, post-treatment differences might reflect pre-existing ability gaps.
  2. Random assignment (ideal) or demonstrated equivalence — ensures groups are comparable. In classroom research, true randomization is often impractical, making pre-testing even more critical.
  3. Clear description of both groups — you need to know what each group actually did. A vague "traditional instruction" label for the control group makes interpretation impossible.
  4. Same outcome measure — both groups must be assessed with the same instrument under the same conditions.

The Problem of Design Conflation

Pooling effect sizes from between-group and within-group studies in a single meta-analysis inflates the overall estimate. Within-group effect sizes tend to be larger because they measure change within individuals (which includes learning from any cause), while between-group effect sizes isolate the treatment effect.

This was one of the core problems in the The Bryfonski-McKay [[TBLT Meta-Analysis in [[SLA|Meta-Analysis in [[SLA|Meta-Analysis]]]] Controversy|Bryfonski & McKay (2019) meta-analysis]]: of 52 studies, only 27 were between-group designs, and even among those 27, many lacked pre-tests or adequate control group descriptions.

Effect Size

The standard measure for between-group comparisons is Cohen's d (or Hedges' g for small samples):

  • d = 0.2 — small effect
  • d = 0.5 — medium effect
  • d = 0.8 — large effect

Bryfonski & McKay reported d = 0.93 (large). Xuan et al.'s recalculation from better-screened studies found g = 0.61 (medium).

In ELT Research

Most classroom-based SLA studies use quasi-experimental between-group designs (intact classes, not random assignment). This makes pre-testing and transparency about comparison conditions especially important. When these are missing, the study cannot reliably answer the question it set out to answer.