Report for: A machine learning approach for predicting methionine oxidation sites

You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.

Title	A machine learning approach for predicting methionine oxidation sites
Published in	BMC Bioinformatics, September 2017
DOI	10.1186/s12859-017-1848-9
Pubmed ID	28962549
Authors	Juan C. Aledo, Francisco R. Cantón, Francisco J. Veredas
Abstract	The oxidation of protein-bound methionine to form methionine sulfoxide, has traditionally been regarded as an oxidative damage. However, recent evidences support the view of this reversible reaction as a regulatory post-translational modification. The perception that methionine sulfoxidation may provide a mechanism to the redox regulation of a wide range of cellular processes, has stimulated some proteomic studies. However, these experimental approaches are expensive and time-consuming. Therefore, computational methods designed to predict methionine oxidation sites are an attractive alternative. As a first approach to this matter, we have developed models based on random forests, support vector machines and neural networks, aimed at accurate prediction of sites of methionine oxidation. Starting from published proteomic data regarding oxidized methionines, we created a hand-curated dataset formed by 113 unique polypeptides of known structure, containing 975 methionyl residues, 122 of which were oxidation-prone (positive dataset) and 853 were oxidation-resistant (negative dataset). We use a machine learning approach to generate predictive models from these datasets. Among the multiple features used in the classification task, some of them contributed substantially to the performance of the predictive models. Thus, (i) the solvent accessible area of the methionine residue, (ii) the number of residues between the analyzed methionine and the next methionine found towards the N-terminus and (iii) the spatial distance between the atom of sulfur from the analyzed methionine and the closest aromatic residue, were among the most relevant features. Compared to the other classifiers we also evaluated, random forests provided the best performance, with accuracy, sensitivity and specificity of 0.7468±0.0567, 0.6817±0.0982 and 0.7557±0.0721, respectively (mean ± standard deviation). We present the first predictive models aimed to computationally detect methionine sites that may become oxidized in vivo in response to oxidative signals. These models provide insights into the structural context in which a methionine residue become either oxidation-resistant or oxidation-prone. Furthermore, these models should be useful in prioritizing methinonyl residues for further studies to determine their potential as regulatory post-translational modification sites.

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profiles of 4 X users who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
United States	1	25%
Unknown	3	75%

Demographic breakdown

Type	Count	As %
Members of the public	2	50%
Scientists	2	50%

Mendeley readers

The data shown below were compiled from readership statistics for 44 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
Unknown	44	100%

Demographic breakdown

Readers by professional status	Count	As %
Student > Ph. D. Student	7	16%
Student > Master	6	14%
Researcher	6	14%
Student > Bachelor	5	11%
Lecturer	3	7%
Other	7	16%
Unknown	10	23%

Readers by discipline	Count	As %
Biochemistry, Genetics and Molecular Biology	12	27%
Agricultural and Biological Sciences	7	16%
Computer Science	4	9%
Chemical Engineering	2	5%
Medicine and Dentistry	2	5%
Other	6	14%
Unknown	11	25%

Attention Score in Context

This research output has an Altmetric Attention Score of 1. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 15 October 2017.

All research outputs

#15,480,316

of 23,003,906 outputs

Outputs from BMC Bioinformatics

#5,394

of 7,312 outputs

Outputs of similar age

#200,975

of 321,103 outputs

Outputs of similar age from BMC Bioinformatics

#68

of 100 outputs

Altmetric has tracked 23,003,906 research outputs across all sources so far. This one is in the 22nd percentile – i.e., 22% of other outputs scored the same or lower than it.

So far Altmetric has tracked 7,312 research outputs from this source. They typically receive a little more attention than average, with a mean Attention Score of 5.4. This one is in the 18th percentile – i.e., 18% of its peers scored the same or lower than it.

Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 321,103 tracked outputs that were published within six weeks on either side of this one in any source. This one is in the 28th percentile – i.e., 28% of its contemporaries scored the same or lower than it.

We're also able to compare this research output to 100 others from the same source and published within six weeks on either side of this one. This one is in the 25th percentile – i.e., 25% of its contemporaries scored the same or lower than it.

A machine learning approach for predicting methionine oxidation sites

Mentioned by

Citations

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown

Attention Score in Context