Chunking
Chunking is the cognitive process of organising individual items into larger, meaningful units (chunks) to overcome working memory limitations. In SLA, chunking explains how learners acquire and store multi-word sequences as single processing units, and how fluency develops as the grain size of processing increases.
The Cognitive Basis
George Miller (1956) established that working memory can hold approximately 7 ± 2 items. Chunking circumvents this limit by recoding multiple items into a single unit. The sequence C-A-T-S-I-T-M-A-T (9 items) becomes CAT-SIT-MAT (3 chunks), freeing WM capacity for other operations.
In language, the principle operates at every level:
| Level | Individual items | Chunk |
|---|---|---|
| Phonological | /k/ /æ/ /t/ | /kæt/ |
| Lexical | pick + up | pick up (phrasal verb) |
| Phrasal | by + the + way | by the way (discourse marker) |
| Clausal | if + I + were + you | if I were you (formulaic frame) |
Chunking and L2 Acquisition
Formulaic Language: Much of native-like fluency rests on a large repertoire of pre-fabricated chunks, including collocations, idioms, sentence stems, and discourse markers. These are stored and retrieved as single units, requiring minimal WM resources. The Lexical Approach (Lewis, 1993) made this insight central to pedagogy.
Fluency development: As learners chunk more language, their speech becomes faster and more fluent, not because individual processes speed up, but because fewer processing operations are needed. A learner who produces I'd like to as a single chunk processes one unit instead of four.
Automaticity: Chunking is a mechanism through which automatisation occurs. Repeated co-occurrence of items leads to their storage as a single representation, reducing the processing load and enabling faster retrieval.
Developmental progression: Beginners rely heavily on memorised chunks (unanalysed wholes). As they develop, learners analyse chunks into components, extract rules, and then re-chunk at higher levels, a cycle that connects to U-shaped development and Restructuring.
Chunk Size and Proficiency
Research suggests that L2 proficiency correlates with chunk size: advanced learners process in larger units than beginners. Native speakers store vast numbers of multi-word chunks; estimates suggest hundreds of thousands of formulaic sequences alongside individual words.
Teaching Implications
- Vocabulary teaching should include multi-word units, not just individual words: collocations, phrases, and sentence frames
- Repetition and retrieval practice consolidate chunks in long-term memory
- Extensive reading and listening provide the frequency of exposure needed for natural chunking to occur
- Activities like dictogloss, sentence builders, and pattern drills promote chunking through repeated processing of multi-word sequences
- Learners should be encouraged to notice and record language in chunks rather than word by word
References
- Miller, G.A. (1956). The magical number seven, plus or minus two. Psychological Review, 63(2), 81–97.
- Lewis, M. (1993). The lexical approach. Language Teaching Publications.
- Wray, A. (2002). Formulaic language and the lexicon. Cambridge University Press.
- Ellis, N.C. (1996). Sequencing in SLA: Phonological memory, chunking, and points of order. Studies in Second Language Acquisition, 18(1), 91–126.