ELTiverse

Search Terms

Search for ELT terms and concepts

Lexical Density

Language Analysislexical density

Lexical density is the ratio of content words (nouns, main verbs, adjectives, adverbs) to the total number of words in a text. It was first proposed by Ure (1971) and developed by Halliday (1985) as a key measure distinguishing written from spoken language.

Calculation

The basic formula: LD = (number of content words / total words) x 100

A spoken conversation might have a lexical density of 35-45%, while an academic journal article might reach 55-65%. The difference is not random — it reflects fundamental differences in how meaning is packaged in speech versus writing.

Written vs Spoken Language

Halliday (1985) identified a complementary relationship:

FeatureWritten languageSpoken language
Lexical densityHigh — meaning packed into content wordsLow — meaning spread across more function words
Grammatical intricacyLow — fewer clauses per sentenceHigh — more clause chaining and embedding

Written language says: The rapid industrialisation of previously agricultural regions... Spoken language says: These regions used to be agricultural but then they industrialised really quickly and...

The written version is lexically dense (many content words per clause) but grammatically simple (one clause). The spoken version is lexically sparse but grammatically intricate (multiple clauses). Both convey similar content through different packaging strategies.

Implications for Teaching

  • Text difficulty — lexical density is a reliable predictor of reading difficulty. Academic texts are hard partly because they pack so much meaning into so few words. Awareness of this helps teachers select and scaffold texts appropriately.
  • Writing development — as learners develop academic writing skills, their lexical density typically increases. They learn to nominalize (turning verbs and adjectives into nouns: They decided → The decision), which compresses meaning and raises density.
  • Register awareness — teaching learners to recognize that spoken and written registers package meaning differently helps them avoid two common problems: writing that sounds like speech (too low density, too many clauses) and speech that sounds like writing (unnaturally dense, over-nominalized).
  • Assessment — lexical density can be a useful diagnostic in writing analysis. Extremely low density may indicate the writer is padding with function words; extremely high density may indicate the text is impenetrable.

Limitations

Lexical density alone does not determine quality. A text can be dense but incoherent, or sparse but perfectly effective for its context. It is one dimension of register variation, best interpreted alongside other measures like grammatical intricacy, cohesive density, and clause complexity.

Related Terms