Report for: Building a multi-scaled geospatial temporal ecology database from disparate data sources: fostering open science and data reuse

You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.

Title	Building a multi-scaled geospatial temporal ecology database from disparate data sources: fostering open science and data reuse
Published in	Giga Science, July 2015
DOI	10.1186/s13742-015-0067-4
Pubmed ID	26140212
Authors	Patricia A. Soranno, Edward G. Bissell, Kendra S. Cheruvelil, Samuel T. Christel, Sarah M. Collins, C. Emi Fergus, Christopher T. Filstrup, Jean-Francois Lapierre, Noah R. Lottig, Samantha K. Oliver, Caren E. Scott, Nicole J. Smith, Scott Stopyak, Shuai Yuan, Mary Tate Bremigan, John A. Downing, Corinna Gries, Emily N. Henry, Nick K. Skaff, Emily H. Stanley, Craig A. Stow, Pang-Ning Tan, Tyler Wagner, Katherine E. Webster
Abstract	Although there are considerable site-based data for individual or groups of ecosystems, these datasets are widely scattered, have different data formats and conventions, and often have limited accessibility. At the broader scale, national datasets exist for a large number of geospatial features of land, water, and air that are needed to fully understand variation among these ecosystems. However, such datasets originate from different sources and have different spatial and temporal resolutions. By taking an open-science perspective and by combining site-based ecosystem datasets and national geospatial datasets, science gains the ability to ask important research questions related to grand environmental challenges that operate at broad scales. Documentation of such complicated database integration efforts, through peer-reviewed papers, is recommended to foster reproducibility and future use of the integrated database. Here, we describe the major steps, challenges, and considerations in building an integrated database of lake ecosystems, called LAGOS (LAke multi-scaled GeOSpatial and temporal database), that was developed at the sub-continental study extent of 17 US states (1,800,000 km(2)). LAGOS includes two modules: LAGOSGEO, with geospatial data on every lake with surface area larger than 4 ha in the study extent (~50,000 lakes), including climate, atmospheric deposition, land use/cover, hydrology, geology, and topography measured across a range of spatial and temporal extents; and LAGOSLIMNO, with lake water quality data compiled from ~100 individual datasets for a subset of lakes in the study extent (~10,000 lakes). Procedures for the integration of datasets included: creating a flexible database design; authoring and integrating metadata; documenting data provenance; quantifying spatial measures of geographic data; quality-controlling integrated and derived data; and extensively documenting the database. Our procedures make a large, complex, and integrated database reproducible and extensible, allowing users to ask new research questions with the existing database or through the addition of new data. The largest challenge of this task was the heterogeneity of the data, formats, and metadata. Many steps of data integration need manual input from experts in diverse fields, requiring close collaboration.

View on publisher site Alert me about new mentions

X Demographics

The data shown below were collected from the profiles of 20 X users who shared this research output. Click here to find out more about how the information was compiled.

Geographical breakdown

Country	Count	As %
United States	5	25%
United Kingdom	2	10%
Colombia	1	5%
New Zealand	1	5%
India	1	5%
Brazil	1	5%
Ecuador	1	5%
Italy	1	5%
Hong Kong	1	5%
Other	0	0%
Unknown	6	30%

Demographic breakdown

Type	Count	As %
Members of the public	13	65%
Scientists	6	30%
Science communicators (journalists, bloggers, editors)	1	5%

Mendeley readers

The data shown below were compiled from readership statistics for 181 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country	Count	As %
United States	2	1%
Brazil	2	1%
Canada	1	<1%
Portugal	1	<1%
Unknown	175	97%

Demographic breakdown

Readers by professional status	Count	As %
Researcher	41	23%
Student > Ph. D. Student	32	18%
Student > Master	21	12%
Student > Bachelor	14	8%
Professor > Associate Professor	13	7%
Other	38	21%
Unknown	22	12%

Readers by discipline	Count	As %
Environmental Science	42	23%
Agricultural and Biological Sciences	35	19%
Computer Science	16	9%
Earth and Planetary Sciences	14	8%
Engineering	8	4%
Other	24	13%
Unknown	42	23%

Attention Score in Context

This research output has an Altmetric Attention Score of 23. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 05 May 2021.

All research outputs

#1,622,906

of 25,411,814 outputs

Outputs from Giga Science

#292

of 1,168 outputs

Outputs of similar age

#19,833

of 277,639 outputs

Outputs of similar age from Giga Science

of 13 outputs

Altmetric has tracked 25,411,814 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 93rd percentile: it's in the top 10% of all research outputs ever tracked by Altmetric.

So far Altmetric has tracked 1,168 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 21.8. This one has done well, scoring higher than 75% of its peers.

Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 277,639 tracked outputs that were published within six weeks on either side of this one in any source. This one has done particularly well, scoring higher than 92% of its contemporaries.

We're also able to compare this research output to 13 others from the same source and published within six weeks on either side of this one. This one has gotten more attention than average, scoring higher than 61% of its contemporaries.

Building a multi-scaled geospatial temporal ecology database from disparate data sources: fostering open science and data reuse

About this Attention Score

Mentioned by

Citations

Readers on

X Demographics

Geographical breakdown

Demographic breakdown

Mendeley readers

Geographical breakdown

Demographic breakdown

Attention Score in Context