Naturalness vs Pedagogic Clarity
The naturalness–clarity tradeoff is the core design tension in writing and recording listening texts for the language classroom. A text high in naturalness retains the Features of Unplanned Spoken Discourse (false starts, fillers, ellipsis, vague language, reduced phonology, real speech rate) and represents the listening signal learners must eventually decode in the world. A text high in pedagogic clarity strips those features to expose target structures, control information density, and keep beginners afloat. The two pull against each other on every line of every script.
The shape of the tradeoff
Maximum naturalness, achieved through unedited authentic recording, presents at lower levels as undifferentiated noise. Even fluent speakers from the target community navigate spontaneous discourse using shared cultural and contextual knowledge that learners lack; the linguistic surface alone is opaque without that scaffolding. Maximum pedagogic clarity, achieved through fully scripted studio recording, presents as a clean signal but a foreign register. Learners decode every word and still fail to comprehend the same content delivered naturally, because the perceptual repair work, the discourse-marker tracking, and the ellipsis resolution required by real speech were never on the table.
Coursebook publishers historically resolved the tension toward clarity. Carter (1998) and McCarthy, working from CANCODE, documented the consequence: the gap between coursebook dialogue and corpus-evidenced spoken English is large and systematic. Mishan (2005) frames the problem in design terms: authenticity is not binary but a graded property of materials, and texts can be designed to retain target features of natural speech while editing for level.
Gilmore's reframing
Gilmore (2007) shifts the debate from binary opposition to graded engineering. Authenticity, in his account, is multi-dimensional: provenance (where the text came from), task (what learners do with it), interaction (whether the classroom use mirrors real-world use), and learner response (what counts as authentic engagement for a given group). A semi-scripted text recorded with corpus-informed briefs can be more authentic, on most of these dimensions, than a transcribed authentic recording stripped of its context and used for a comprehension quiz. The design question is not "scripted or authentic" but "which features of natural speech does this text need to retain, and which can be edited for clarity at this level?"
Operational consequence
The position now widely endorsed across materials-development authorities (Brian Tomlinson, Field, Buck, Mishan, Gilmore) is a phased blend. At lower levels, scripts retain enough naturalness features to keep the signal recognisable as speech (contractions, common discourse markers, reduced forms in high-frequency phrases) while editing aggressively for length and density. At intermediate levels, semi-scripted recording briefs replace fully-written scripts. At upper-intermediate and above, authentic recordings dominate, with task design carrying the load that script editing once carried at lower levels. The tradeoff does not disappear; it shifts from being absorbed by the script to being absorbed by the task.
References
- Carter, R. (1998). Orders of reality: CANCODE, communication, and culture. ELT Journal, 52(1), 43–56.
- Gilmore, A. (2007). Authentic materials and authenticity in foreign language learning. Language Teaching, 40(2), 97–118. PDF: https://www.alexandergilmore.com/wp-content/uploads/2022/04/Gilmore_2007.pdf
- Mishan, F. (2005). Designing Authenticity into Language Learning Materials. Intellect.