↓ Skip to main content

On the effect of dropping layers of pre-trained transformer models

Overview of attention for article published in Computer Speech & Language, January 2023
Altmetric Badge

About this Attention Score

  • In the top 5% of all research outputs scored by Altmetric
  • One of the highest-scoring outputs from this source (#1 of 431)
  • High Attention Score compared to outputs of the same age (95th percentile)

Mentioned by

news
1 news outlet
blogs
1 blog
twitter
59 X users
reddit
1 Redditor

Citations

dimensions_citation
24 Dimensions

Readers on

mendeley
59 Mendeley