↓ Skip to main content

A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration

Overview of attention for article published in BMC Bioinformatics, April 2018
Altmetric Badge

Mentioned by

twitter
1 X user

Citations

dimensions_citation
18 Dimensions

Readers on

mendeley
70 Mendeley
citeulike
1 CiteULike
Title
A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration
Published in
BMC Bioinformatics, April 2018
DOI 10.1186/s12859-018-2123-4
Pubmed ID
Authors

Pathima Nusrath Hameed, Karin Verspoor, Snezana Kusljic, Saman Halgamuge

Abstract

Drug repositioning is the process of identifying new uses for existing drugs. Computational drug repositioning methods can reduce the time, costs and risks of drug development by automating the analysis of the relationships in pharmacology networks. Pharmacology networks are large and heterogeneous. Clustering drugs into small groups can simplify large pharmacology networks, these subgroups can also be used as a starting point for repositioning drugs. In this paper, we propose a two-tiered drug-centric unsupervised clustering approach for drug repositioning, integrating heterogeneous drug data profiles: drug-chemical, drug-disease, drug-gene, drug-protein and drug-side effect relationships. The proposed drug repositioning approach is threefold; (i) clustering drugs based on their homogeneous profiles using the Growing Self Organizing Map (GSOM); (ii) clustering drugs based on drug-drug relation matrices based on the previous step, considering three state-of-the-art graph clustering methods; and (iii) inferring drug repositioning candidates and assigning a confidence value for each identified candidate. In this paper, we compare our two-tiered clustering approach against two existing heterogeneous data integration approaches with reference to the Anatomical Therapeutic Chemical (ATC) classification, using GSOM. Our approach yields Normalized Mutual Information (NMI) and Standardized Mutual Information (SMI) of 0.66 and 36.11, respectively, while the two existing methods yield NMI of 0.60 and 0.64 and SMI of 22.26 and 33.59. Moreover, the two existing approaches failed to produce useful cluster separations when using graph clustering algorithms while our approach is able to identify useful clusters for drug repositioning. Furthermore, we provide clinical evidence for four predicted results (Chlorthalidone, Indomethacin, Metformin and Thioridazine) to support that our proposed approach can be reliably used to infer ATC code and drug repositioning. The proposed two-tiered unsupervised clustering approach is suitable for drug clustering and enables heterogeneous data integration. It also enables identifying reliable repositioning drug candidates with reference to ATC therapeutic classification. The repositioning drug candidates identified consistently by multiple clustering algorithms and with high confidence have a higher possibility of being effective repositioning candidates.

X Demographics

X Demographics

The data shown below were collected from the profile of 1 X user who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 70 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 70 100%

Demographic breakdown

Readers by professional status Count As %
Student > Master 16 23%
Student > Ph. D. Student 16 23%
Researcher 7 10%
Professor 4 6%
Student > Bachelor 3 4%
Other 8 11%
Unknown 16 23%
Readers by discipline Count As %
Computer Science 18 26%
Pharmacology, Toxicology and Pharmaceutical Science 8 11%
Biochemistry, Genetics and Molecular Biology 6 9%
Medicine and Dentistry 4 6%
Engineering 4 6%
Other 8 11%
Unknown 22 31%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 16 April 2018.
All research outputs
#18,836,331
of 23,344,526 outputs
Outputs from BMC Bioinformatics
#6,430
of 7,387 outputs
Outputs of similar age
#256,592
of 329,993 outputs
Outputs of similar age from BMC Bioinformatics
#80
of 106 outputs
Altmetric has tracked 23,344,526 research outputs across all sources so far. This one is in the 11th percentile – i.e., 11% of other outputs scored the same or lower than it.
So far Altmetric has tracked 7,387 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.5. This one is in the 5th percentile – i.e., 5% of its peers scored the same or lower than it.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 329,993 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 11th percentile – i.e., 11% of its contemporaries scored the same or lower than it.
We're also able to compare this research output to 106 others from the same source and published within six weeks on either side of this one. This one is in the 16th percentile – i.e., 16% of its contemporaries scored the same or lower than it.