↓ Skip to main content

How Many Words Do We Know? Practical Estimates of Vocabulary Size Dependent on Word Definition, the Degree of Language Input and the Participant’s Age

Overview of attention for article published in Frontiers in Psychology, July 2016
Altmetric Badge

About this Attention Score

  • In the top 5% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (99th percentile)
  • High Attention Score compared to outputs of the same age and source (99th percentile)

Mentioned by

news
50 news outlets
blogs
7 blogs
twitter
76 X users
facebook
6 Facebook pages
reddit
1 Redditor
video
1 YouTube creator

Readers on

mendeley
243 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
How Many Words Do We Know? Practical Estimates of Vocabulary Size Dependent on Word Definition, the Degree of Language Input and the Participant’s Age
Published in
Frontiers in Psychology, July 2016
DOI 10.3389/fpsyg.2016.01116
Pubmed ID
Authors

Marc Brysbaert, Michaël Stevens, Paweł Mandera, Emmanuel Keuleers

Abstract

Based on an analysis of the literature and a large scale crowdsourcing experiment, we estimate that an average 20-year-old native speaker of American English knows 42,000 lemmas and 4,200 non-transparent multiword expressions, derived from 11,100 word families. The numbers range from 27,000 lemmas for the lowest 5% to 52,000 for the highest 5%. Between the ages of 20 and 60, the average person learns 6,000 extra lemmas or about one new lemma every 2 days. The knowledge of the words can be as shallow as knowing that the word exists. In addition, people learn tens of thousands of inflected forms and proper nouns (names), which account for the substantially high numbers of 'words known' mentioned in other publications.

X Demographics

X Demographics

The data shown below were collected from the profiles of 76 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 243 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 3 1%
Colombia 1 <1%
Unknown 239 98%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 47 19%
Student > Master 24 10%
Researcher 22 9%
Student > Bachelor 21 9%
Student > Doctoral Student 20 8%
Other 56 23%
Unknown 53 22%
Readers by discipline Count As %
Psychology 57 23%
Linguistics 48 20%
Social Sciences 18 7%
Computer Science 9 4%
Arts and Humanities 8 3%
Other 32 13%
Unknown 71 29%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 487. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 19 January 2024.
All research outputs
#55,271
of 25,744,802 outputs
Outputs from Frontiers in Psychology
#93
of 34,772 outputs
Outputs of similar age
#1,129
of 381,760 outputs
Outputs of similar age from Frontiers in Psychology
#1
of 394 outputs
Altmetric has tracked 25,744,802 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 99th percentile: it's in the top 5% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 34,772 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 13.4. This one has done particularly well, scoring higher than 99% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 381,760 tracked outputs that were published within six weeks on either side of this one in any source. This one has done particularly well, scoring higher than 99% of its contemporaries.
We're also able to compare this research output to 394 others from the same source and published within six weeks on either side of this one. This one has done particularly well, scoring higher than 99% of its contemporaries.