Pronunciation

PhonologySkillspronunciation teachingL2 pronunciation

Pronunciation as a teaching domain covers how learners produce and perceive the spoken form of a language, across the full range from individual phonemes up to the melodic and rhythmic organisation of connected utterances. It is not reducible to "getting the sounds right." Celce-Murcia, Brinton and Goodwin (2010) frame it as two interlocking levels: segmentals (consonants and vowels) and suprasegmentals (word stress, sentence stress, rhythm, intonation, and connected speech processes), all of which can affect whether a listener understands and how the speaker is heard socially.

Segmentals and suprasegmentals

Segmentals are the discrete phonemes of the language, typically taught through minimal pairs, articulatory description, and perceptual discrimination. Suprasegmentals are the prosodic features that operate over syllables, words, and phrases: stress placement, rhythm (including the stress-timed vs syllable-timed distinction), intonation contours, and the reductions, linking, and assimilations of connected speech.

Historically, pronunciation teaching was heavily segmental. The reformed method and audiolingualism treated pronunciation as phoneme drilling, while early communicative language teaching largely ignored pronunciation altogether (Levis, 2005). The shift came from empirical work showing prosody carried more functional weight than teachers had assumed. Munro and Derwing (1999) reported that prosodic errors affected intelligibility more than segmental errors in many cases, and from the late 1990s onward research and published coursebooks moved toward a more balanced treatment in which suprasegmentals were given parity with, and sometimes priority over, individual sounds.

Nativeness vs intelligibility

Levis (2005) named the two principles that had tacitly competed in the field. The nativeness principle holds that L2 learners should aim at native-speaker pronunciation, with residual accent treated as failure. The intelligibility principle holds that the goal is being understood, and that accent and intelligibility are empirically separable: heavily accented speech can be fully intelligible, and lightly accented speech can be hard to understand. Levis argued, and most researchers since have agreed, that nativeness is an unrealistic target for post-pubertal learners and an inappropriate one for most teaching contexts: intelligibility is the defensible goal, with nativeness relevant only in specific niches (actors, spies, some broadcasters).

This reframing has practical consequences. It licenses teachers to prioritise features with high functional load, tolerate features that do not impede understanding, and stop treating accent itself as a problem to be erased.

The Jenkins Lingua Franca Core

Jennifer Jenkins (2000) took the intelligibility principle a step further in the specific context of English as a lingua franca: non-native speakers talking to other non-native speakers. Working from a corpus of actual misunderstandings, she proposed the Lingua Franca Core (LFC): a minimal set of phonological features teachers should insist on for international intelligibility, with everything else treated as optional local variation.

The core includes most consonants (with British-based /t/ rather than the American flap), aspiration of word-initial /p, t, k/, rhotic syllable-final /r/, no simplification of word-initial consonant clusters, the quantitative contrast between long and short vowels (with pre-fortis clipping), consistent vowel quality, the NURSE vowel, and nuclear (sentence) stress placement to mark information focus. The non-core, explicitly demoted, includes /θ/ and /ð/ (substitutable), weak forms of function words, connected-speech assimilations, lexical (word) stress, pitch movement for attitudinal or grammatical meaning, and stress-timed rhythm.

Critiques have been sharp. Some argue the corpus was too narrow to license a global syllabus, that demoting word stress is pedagogically risky given its role in lexical access, and that the LFC privileges some varieties over others despite its pluralist framing (Dauer, 2005; see also Spicer in the IH Journal). Jenkins has replied that much of the criticism conflates her descriptive claims about ELF interactions with prescriptive claims about teaching. The LFC remains contested but has reshaped how the field talks about pronunciation targets.

Intelligibility, comprehensibility, accent

Derwing and Munro (1995, 2005) separated three listener-side constructs that earlier work had treated as one. Intelligibility is how much of the utterance the listener actually understands (often measured by transcription accuracy). Comprehensibility is how much effort understanding requires (a listener rating). Accent is how different the speech sounds from a reference variety (another listener rating). The three are partially independent: a speaker can be heavily accented yet fully intelligible, or lightly accented yet hard to comprehend. Keeping them apart lets teachers target the first two without treating the third as a deficit, and lets researchers ask which features of L2 speech affect which construct.

Classroom approaches

Day-to-day teaching draws on a standard toolkit mapped out in Pronunciation Teaching Approaches. Minimal-pair work trains perception and production of contrasting phonemes. Shadowing builds prosodic fluency by having learners echo model speech in near-real time. Adrian Underhill's Sound Foundations phonemic chart (Underhill, 2005) gives learners and teachers a physical workspace for the English sound system, with visual dictation and discovery activities that treat pronunciation as a skill to be felt and manipulated rather than memorised. High-variability phonetic training (HVPT), exposing learners to a target contrast produced by many talkers in many contexts, shows medium-to-large effects on perception and smaller but real transfer to production (Thomson, 2018; Sakai and Moorman, 2018). Gilbert's Clear Speech materials push rhythm, stress, and thought groups ahead of segmental drilling, in line with the suprasegmental turn.

Darcy (2018) characterises pronunciation as persistently the "Cinderella" of ELT (under-taught because of time pressure, thin methodology, and curricula that leave it out) and argues that effective instruction has to be explicit, integrated into other skills work, and informed by the intelligibility principle rather than left to osmosis.

References

Celce-Murcia, M., Brinton, D. M., & Goodwin, J. M. (with Griner, B.). (2010). Teaching Pronunciation: A Course Book and Reference Guide (2nd ed.). Cambridge University Press.
Darcy, I. (2018). Powerful and effective pronunciation instruction: How can we achieve it? The CATESOL Journal, 30(1), 13–45.
Dauer, R. M. (2005). The lingua franca core: A new model for pronunciation instruction? TESOL Quarterly, 39(3), 543–550. https://doi.org/10.2307/3588494
Derwing, T. M., & Munro, M. J. (2005). Second language accent and pronunciation teaching: A research-based approach. TESOL Quarterly, 39(3), 379–397. https://doi.org/10.2307/3588486
Derwing, T. M., & Munro, M. J. (2015). Pronunciation Fundamentals: Evidence-based Perspectives for L2 Teaching and Research. John Benjamins.
Gilbert, J. B. (2008). Teaching Pronunciation: Using the Prosody Pyramid. Cambridge University Press.
Jenkins, J. (2000). The Phonology of English as an International Language. Oxford University Press.
Levis, J. M. (2005). Changing contexts and shifting paradigms in pronunciation teaching. TESOL Quarterly, 39(3), 369–377. https://doi.org/10.2307/3588485
Munro, M. J., & Derwing, T. M. (1995). Foreign accent, comprehensibility, and intelligibility in the speech of second language learners. Language Learning, 45(1), 73–97.
Munro, M. J., & Derwing, T. M. (1999). Foreign accent, comprehensibility, and intelligibility in the speech of second language learners. Language Learning, 49(s1), 285–310.
Sakai, M., & Moorman, C. (2018). Can perception training improve the production of second language phonemes? A meta-analytic review of 25 years of perception training research. Applied Psycholinguistics, 39(1), 187–224.
Thomson, R. I. (2018). High variability [pronunciation] training (HVPT): A proven technique about which every language teacher and learner ought to know. Journal of Second Language Pronunciation, 4(2), 208–231.
Underhill, A. (2005). Sound Foundations: Learning and Teaching Pronunciation (2nd ed.). Macmillan.

Related Terms