↓ Skip to main content

Is automatic speech-to-text transcription ready for use in psychological experiments?

Overview of attention for article published in Behavior Research Methods, April 2018
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (83rd percentile)
  • High Attention Score compared to outputs of the same age and source (85th percentile)

Mentioned by

blogs
1 blog
twitter
10 X users

Citations

dimensions_citation
22 Dimensions

Readers on

mendeley
67 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Is automatic speech-to-text transcription ready for use in psychological experiments?
Published in
Behavior Research Methods, April 2018
DOI 10.3758/s13428-018-1037-4
Pubmed ID
Authors

Kirsten Ziman, Andrew C. Heusser, Paxton C. Fitzpatrick, Campbell E. Field, Jeremy R. Manning

Abstract

Verbal responses are a convenient and naturalistic way for participants to provide data in psychological experiments (Salzinger, The Journal of General Psychology, 61(1),65-94:1959). However, audio recordings of verbal responses typically require additional processing, such as transcribing the recordings into text, as compared with other behavioral response modalities (e.g., typed responses, button presses, etc.). Further, the transcription process is often tedious and time-intensive, requiring human listeners to manually examine each moment of recorded speech. Here we evaluate the performance of a state-of-the-art speech recognition algorithm (Halpern et al., 2016) in transcribing audio data into text during a list-learning experiment. We compare transcripts made by human annotators to the computer-generated transcripts. Both sets of transcripts matched to a high degree and exhibited similar statistical properties, in terms of the participants' recall performance and recall dynamics that the transcripts captured. This proof-of-concept study suggests that speech-to-text engines could provide a cheap, reliable, and rapid means of automatically transcribing speech data in psychological experiments. Further, our findings open the door for verbal response experiments that scale to thousands of participants (e.g., administered online), as well as a new generation of experiments that decode speech on the fly and adapt experimental parameters based on participants' prior responses.

X Demographics

X Demographics

The data shown below were collected from the profiles of 10 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 67 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 67 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 14 21%
Researcher 7 10%
Student > Master 6 9%
Lecturer 5 7%
Student > Bachelor 4 6%
Other 14 21%
Unknown 17 25%
Readers by discipline Count As %
Psychology 17 25%
Computer Science 9 13%
Unspecified 3 4%
Engineering 3 4%
Nursing and Health Professions 2 3%
Other 13 19%
Unknown 20 30%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 13. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 08 December 2018.
All research outputs
#2,811,788
of 25,382,440 outputs
Outputs from Behavior Research Methods
#330
of 2,526 outputs
Outputs of similar age
#56,749
of 340,059 outputs
Outputs of similar age from Behavior Research Methods
#4
of 27 outputs
Altmetric has tracked 25,382,440 research outputs across all sources so far. Compared to these this one has done well and is in the 88th percentile: it's in the top 25% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 2,526 research outputs from this source. They typically receive more attention than average, with a mean Attention Score of 8.2. This one has done well, scoring higher than 86% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 340,059 tracked outputs that were published within six weeks on either side of this one in any source. This one has done well, scoring higher than 83% of its contemporaries.
We're also able to compare this research output to 27 others from the same source and published within six weeks on either side of this one. This one has done well, scoring higher than 85% of its contemporaries.