↓ Skip to main content

Analyzing the field of bioinformatics with the multi-faceted topic modeling technique

Overview of attention for article published in BMC Bioinformatics, May 2017
Altmetric Badge

Mentioned by

twitter
1 X user

Citations

dimensions_citation
18 Dimensions

Readers on

mendeley
71 Mendeley
Title
Analyzing the field of bioinformatics with the multi-faceted topic modeling technique
Published in
BMC Bioinformatics, May 2017
DOI 10.1186/s12859-017-1640-x
Pubmed ID
Authors

Go Eun Heo, Keun Young Kang, Min Song, Jeong-Hoon Lee

Abstract

Bioinformatics is an interdisciplinary field at the intersection of molecular biology and computing technology. To characterize the field as convergent domain, researchers have used bibliometrics, augmented with text-mining techniques for content analysis. In previous studies, Latent Dirichlet Allocation (LDA) was the most representative topic modeling technique for identifying topic structure of subject areas. However, as opposed to revealing the topic structure in relation to metadata such as authors, publication date, and journals, LDA only displays the simple topic structure. In this paper, we adopt the Tang et al.'s Author-Conference-Topic (ACT) model to study the field of bioinformatics from the perspective of keyphrases, authors, and journals. The ACT model is capable of incorporating the paper, author, and conference into the topic distribution simultaneously. To obtain more meaningful results, we use journals and keyphrases instead of conferences and bag-of-words.. For analysis, we use PubMed to collected forty-six bioinformatics journals from the MEDLINE database. We conducted time series topic analysis over four periods from 1996 to 2015 to further examine the interdisciplinary nature of bioinformatics. We analyze the ACT Model results in each period. Additionally, for further integrated analysis, we conduct a time series analysis among the top-ranked keyphrases, journals, and authors according to their frequency. We also examine the patterns in the top journals by simultaneously identifying the topical probability in each period, as well as the top authors and keyphrases. The results indicate that in recent years diversified topics have become more prevalent and convergent topics have become more clearly represented. The results of our analysis implies that overtime the field of bioinformatics becomes more interdisciplinary where there is a steady increase in peripheral fields such as conceptual, mathematical, and system biology. These results are confirmed by integrated analysis of topic distribution as well as top ranked keyphrases, authors, and journals.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 71 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Mexico 1 1%
Unknown 70 99%

Demographic breakdown

Readers by professional status Count As %
Researcher 11 15%
Student > Master 11 15%
Student > Ph. D. Student 7 10%
Librarian 5 7%
Student > Bachelor 4 6%
Other 13 18%
Unknown 20 28%
Readers by discipline Count As %
Computer Science 15 21%
Agricultural and Biological Sciences 8 11%
Social Sciences 8 11%
Biochemistry, Genetics and Molecular Biology 3 4%
Business, Management and Accounting 3 4%
Other 13 18%
Unknown 21 30%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 07 June 2017.
All research outputs
#15,464,404
of 22,979,862 outputs
Outputs from BMC Bioinformatics
#5,391
of 7,308 outputs
Outputs of similar age
#198,815
of 316,429 outputs
Outputs of similar age from BMC Bioinformatics
#77
of 108 outputs
Altmetric has tracked 22,979,862 research outputs across all sources so far. This one is in the 22nd percentile – i.e., 22% of other outputs scored the same or lower than it.
So far Altmetric has tracked 7,308 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one is in the 18th percentile – i.e., 18% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 316,429 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 28th percentile – i.e., 28% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 108 others from the same source and published within six weeks on either side of this one. This one is in the 26th percentile – i.e., 26% of its contemporaries scored the same or lower than it.