Title |
How Many Words Do We Know? Practical Estimates of Vocabulary Size Dependent on Word Definition, the Degree of Language Input and the Participant’s Age
|
---|---|
Published in |
Frontiers in Psychology, July 2016
|
DOI | 10.3389/fpsyg.2016.01116 |
Pubmed ID | |
Authors |
Marc Brysbaert, Michaël Stevens, Paweł Mandera, Emmanuel Keuleers |
Abstract |
Based on an analysis of the literature and a large scale crowdsourcing experiment, we estimate that an average 20-year-old native speaker of American English knows 42,000 lemmas and 4,200 non-transparent multiword expressions, derived from 11,100 word families. The numbers range from 27,000 lemmas for the lowest 5% to 52,000 for the highest 5%. Between the ages of 20 and 60, the average person learns 6,000 extra lemmas or about one new lemma every 2 days. The knowledge of the words can be as shallow as knowing that the word exists. In addition, people learn tens of thousands of inflected forms and proper nouns (names), which account for the substantially high numbers of 'words known' mentioned in other publications. |
X Demographics
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 8 | 11% |
Japan | 7 | 9% |
United Kingdom | 4 | 5% |
Netherlands | 3 | 4% |
Germany | 2 | 3% |
Switzerland | 2 | 3% |
Brazil | 1 | 1% |
China | 1 | 1% |
Iceland | 1 | 1% |
Other | 9 | 12% |
Unknown | 38 | 50% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Members of the public | 61 | 80% |
Scientists | 12 | 16% |
Practitioners (doctors, other healthcare professionals) | 2 | 3% |
Science communicators (journalists, bloggers, editors) | 1 | 1% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 3 | 1% |
Colombia | 1 | <1% |
Unknown | 239 | 98% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Ph. D. Student | 47 | 19% |
Student > Master | 24 | 10% |
Researcher | 22 | 9% |
Student > Bachelor | 21 | 9% |
Student > Doctoral Student | 20 | 8% |
Other | 56 | 23% |
Unknown | 53 | 22% |
Readers by discipline | Count | As % |
---|---|---|
Psychology | 57 | 23% |
Linguistics | 48 | 20% |
Social Sciences | 18 | 7% |
Computer Science | 9 | 4% |
Arts and Humanities | 8 | 3% |
Other | 32 | 13% |
Unknown | 71 | 29% |