↓ Skip to main content

BiC2PAM: constraint-guided biclustering for biological data analysis with domain knowledge

Overview of attention for article published in Algorithms for Molecular Biology, September 2016
Altmetric Badge

Citations

dimensions_citation
24 Dimensions

Readers on

mendeley
15 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
BiC2PAM: constraint-guided biclustering for biological data analysis with domain knowledge
Published in
Algorithms for Molecular Biology, September 2016
DOI 10.1186/s13015-016-0085-5
Pubmed ID
Authors

Rui Henriques, Sara C. Madeira

Abstract

Biclustering has been largely used in biological data analysis, enabling the discovery of putative functional modules from omic and network data. Despite the recognized importance of incorporating domain knowledge to guide biclustering and guarantee a focus on relevant and non-trivial biclusters, this possibility has not yet been comprehensively addressed. This results from the fact that the majority of existing algorithms are only able to deliver sub-optimal solutions with restrictive assumptions on the structure, coherency and quality of biclustering solutions, thus preventing the up-front satisfaction of knowledge-driven constraints. Interestingly, in recent years, a clearer understanding of the synergies between pattern mining and biclustering gave rise to a new class of algorithms, termed as pattern-based biclustering algorithms. These algorithms, able to efficiently discover flexible biclustering solutions with optimality guarantees, are thus positioned as good candidates for knowledge incorporation. In this context, this work aims to bridge the current lack of solid views on the use of background knowledge to guide (pattern-based) biclustering tasks. This work extends (pattern-based) biclustering algorithms to guarantee the satisfiability of constraints derived from background knowledge and to effectively explore efficiency gains from their incorporation. In this context, we first show the relevance of constraints with succinct, (anti-)monotone and convertible properties for the analysis of expression data and biological networks. We further show how pattern-based biclustering algorithms can be adapted to effectively prune of the search space in the presence of such constraints, as well as be guided in the presence of biological annotations. Relying on these contributions, we propose BiClustering with Constraints using PAttern Mining (BiC2PAM), an extension of BicPAM and BicNET biclustering algorithms. Experimental results on biological data demonstrate the importance of incorporating knowledge within biclustering to foster efficiency and enable the discovery of non-trivial biclusters with heightened biological relevance. This work provides the first comprehensive view and sound algorithm for biclustering biological data with constraints derived from user expectations, knowledge repositories and/or literature.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 15 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Germany 1 7%
Unknown 14 93%

Demographic breakdown

Readers by professional status Count As %
Student > Master 2 13%
Professor > Associate Professor 2 13%
Student > Ph. D. Student 2 13%
Researcher 2 13%
Lecturer > Senior Lecturer 1 7%
Other 1 7%
Unknown 5 33%
Readers by discipline Count As %
Computer Science 7 47%
Arts and Humanities 1 7%
Biochemistry, Genetics and Molecular Biology 1 7%
Psychology 1 7%
Unknown 5 33%