↓ Skip to main content

The GAAIN Entity Mapper: An Active-Learning System for Medical Data Mapping

Overview of attention for article published in Frontiers in Neuroinformatics, January 2016
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (85th percentile)
  • Good Attention Score compared to outputs of the same age and source (70th percentile)

Mentioned by

news
1 news outlet
twitter
2 X users

Citations

dimensions_citation
4 Dimensions

Readers on

mendeley
42 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
The GAAIN Entity Mapper: An Active-Learning System for Medical Data Mapping
Published in
Frontiers in Neuroinformatics, January 2016
DOI 10.3389/fninf.2015.00030
Pubmed ID
Authors

Naveen Ashish, Peehoo Dewan, Arthur W. Toga

Abstract

This work is focused on mapping biomedical datasets to a common representation, as an integral part of data harmonization for integrated biomedical data access and sharing. We present GEM, an intelligent software assistant for automated data mapping across different datasets or from a dataset to a common data model. The GEM system automates data mapping by providing precise suggestions for data element mappings. It leverages the detailed metadata about elements in associated dataset documentation such as data dictionaries that are typically available with biomedical datasets. It employs unsupervised text mining techniques to determine similarity between data elements and also employs machine-learning classifiers to identify element matches. It further provides an active-learning capability where the process of training the GEM system is optimized. Our experimental evaluations show that the GEM system provides highly accurate data mappings (over 90% accuracy) for real datasets of thousands of data elements each, in the Alzheimer's disease research domain. Further, the effort in training the system for new datasets is also optimized. We are currently employing the GEM system to map Alzheimer's disease datasets from around the globe into a common representation, as part of a global Alzheimer's disease integrated data sharing and analysis network called GAAIN. GEM achieves significantly higher data mapping accuracy for biomedical datasets compared to other state-of-the-art tools for database schema matching that have similar functionality. With the use of active-learning capabilities, the user effort in training the system is minimal.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 42 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 42 100%

Demographic breakdown

Readers by professional status Count As %
Student > Master 7 17%
Student > Bachelor 7 17%
Professor 5 12%
Student > Ph. D. Student 5 12%
Researcher 4 10%
Other 5 12%
Unknown 9 21%
Readers by discipline Count As %
Computer Science 7 17%
Medicine and Dentistry 4 10%
Engineering 4 10%
Agricultural and Biological Sciences 4 10%
Neuroscience 4 10%
Other 10 24%
Unknown 9 21%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 10. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 29 January 2016.
All research outputs
#3,070,436
of 22,837,982 outputs
Outputs from Frontiers in Neuroinformatics
#168
of 749 outputs
Outputs of similar age
#55,951
of 395,522 outputs
Outputs of similar age from Frontiers in Neuroinformatics
#3
of 10 outputs
Altmetric has tracked 22,837,982 research outputs across all sources so far. Compared to these this one has done well and is in the 86th percentile: it's in the top 25% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 749 research outputs from this source. They typically receive more attention than average, with a mean Attention Score of 8.3. This one has done well, scoring higher than 76% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 395,522 tracked outputs that were published within six weeks on either side of this one in any source. This one has done well, scoring higher than 85% of its contemporaries.
We're also able to compare this research output to 10 others from the same source and published within six weeks on either side of this one. This one has scored higher than 7 of them.