Analytical guidelines to increase the value of citizen science data: using eBird data to estimate species occurrence

Mapping Intimacies ◽

10.1101/574392 ◽

2019 ◽

Cited By ~ 8

Author(s):

A Johnston ◽

WM Hochachka ◽

ME Strimas-Mackey ◽

V Ruiz Gutierrez ◽

OJ Robinson ◽

...

Keyword(s):

Data Processing ◽

Citizen Science ◽

Model Performance ◽

Species Distributions ◽

Ecological Knowledge ◽

Sample Sizes ◽

Science Data ◽

Science Projects ◽

Wide Range ◽

The Impact

AbstractCitizen science data are valuable for addressing a wide range of ecological research questions, and there has been a rapid increase in the scope and volume of data available. However, data from large-scale citizen science projects typically present a number of challenges that can inhibit robust ecological inferences. These challenges include: species bias, spatial bias, and variation in effort.To demonstrate addressing key challenges in analysing citizen science data, we use the example of estimating species distributions with data from eBird, a large semi-structured citizen science project. We estimate two widely applied metrics of species distributions: encounter rate and occupancy probability. For each metric, we assess the impact of data processing steps that either degrade or refine the data used in the analyses. We also test whether differences in model performance are maintained at different sample sizes.Model performance improved when data processing and analytical methods addressed the challenges arising from citizen science data. The largest gains in model performance were achieved with: 1) the use of complete checklists (where observers report all the species they detect and identify); and 2) the use of covariates describing variation in effort and detectability for each checklist. Occupancy models were more robust to a lack of complete checklists and effort variables. Improvements in model performance with data refinement were more evident with larger sample sizes.Here, we describe processes to refine semi-structured citizen science data to estimate species distributions. We demonstrate the value of complete checklists, which can inform the design and adaptation of citizen science projects. We also demonstrate the value of information on effort. The methods we have outlined are also likely to improve other forms of inference, and will enable researchers to conduct robust analyses and harness the vast ecological knowledge that exists within citizen science data.

Download Full-text

The impact of data quality filtering of opportunistic citizen science data on species distribution model performance

Ecological Modelling ◽

10.1016/j.ecolmodel.2021.109453 ◽

2021 ◽

Vol 444 ◽

pp. 109453

Author(s):

Camille Van Eupen ◽

Dirk Maes ◽

Marc Herremans ◽

Kristijn R.R. Swinnen ◽

Ben Somers ◽

...

Keyword(s):

Data Quality ◽

Citizen Science ◽

Species Distribution ◽

Species Distribution Model ◽

Model Performance ◽

Distribution Model ◽

Science Data ◽

Quality Filtering ◽

The Impact

Download Full-text

Integrating citizen science data with expert surveys increases accuracy and spatial extent of species distribution models

10.1101/806547 ◽

2019 ◽

Cited By ~ 2

Author(s):

O.J. Robinson ◽

V. Ruiz-Gutierrez ◽

M.D. Reynolds ◽

G.H. Golet ◽

M. Strimas-Mackey ◽

...

Keyword(s):

Survey Data ◽

Citizen Science ◽

Species Distribution ◽

Central Valley ◽

Habitat Associations ◽

Biological Information ◽

Ecological Knowledge ◽

Science Data ◽

Trade Offs ◽

Wide Range

AbstractInformation on species’ habitat associations and distributions, across a wide range of spatial and temporal scales, are a fundamental source of ecological knowledge. However, collecting biological information at relevant scales if often cost prohibitive, although it is essential for framing the broader context of more focused research and conservation efforts. Citizen-science data has been signaled as an increasingly important source of biological information needed to fill in data gaps needed to make more comprehensive and robust inferences on species distributions. However, there are perceived trade-offs of combining highly structured, scientific survey data with largely unstructured, citizen-science data. As a result, the focus of most methodological advances to combine these sources of information has been on treating these sources as independent. The degree to which each source of information is allowed to directly inform a common underlying process (e.g. species distribution) depends on the perceived quality of the data. In this paper, we explore these trade-offs by applying a simplified approach of filtering citizen-science data to resemble structured survey data, and analyze both sources of data under a common framework. To accomplish this, we explored ways of integrating high-resolution survey data on shorebirds in the northern Central Valley of California with observations in eBird for the entire region that were filtered to improve their quality. The integration of survey data with the filtered citizen-science data in eBird resulted in improved inference and predictive ability, and increased the extent and accuracy of inferences on shorebirds for the Central Valley. The structured surveys were found to improve the overall accuracy of ecological inference based only on citizen-science data, by increasing the representation of data collected from high quality habitats for shorebirds (e.g. rice fields). The practical approach we have shown for data integration can be also be used to improve the efficiency of designing biological surveys in the context of larger, citizen-science monitoring efforts, ultimately reducing the financial and time expenditures typically required of monitoring programs and focused research. The simple processing and filtering method we present can be used to integrate other types of data (e.g. camera traps) with more localized efforts (e.g. research projects), ultimately improving our ecological knowledge on the distribution and habitat associations of species of conservation concern worldwide.

Download Full-text

Using citizen science data to monitor the Sustainable Development Goals: a bottom-up analysis

Sustainability Science ◽

10.1007/s11625-021-01001-1 ◽

2021 ◽

Author(s):

Laura Ballerini ◽

Sylvia I. Bergh

Keyword(s):

Sustainable Development ◽

Citizen Science ◽

Sustainable Development Goals ◽

Comparative Case Study ◽

Science Data ◽

The Sustainable Development ◽

Case Study Analysis ◽

Wide Range ◽

Development Goals

AbstractOfficial data are not sufficient for monitoring the United Nations Sustainable Development Goals (SDGs): they do not reach remote locations or marginalized populations and can be manipulated by governments. Citizen science data (CSD), defined as data that citizens voluntarily gather by employing a wide range of technologies and methodologies, could help to tackle these problems and ultimately improve SDG monitoring. However, the link between CSD and the SDGs is still understudied. This article aims to develop an empirical understanding of the CSD-SDG link by focusing on the perspective of projects which employ CSD. Specifically, the article presents primary and secondary qualitative data collected on 30 of these projects and an explorative comparative case study analysis. It finds that projects which use CSD recognize that the SDGs can provide a valuable framework and legitimacy, as well as attract funding, visibility, and partnerships. But, at the same time, the article reveals that these projects also encounter several barriers with respect to the SDGs: a widespread lack of knowledge of the goals, combined with frustration and political resistance towards the UN, may deter these projects from contributing their data to the SDG monitoring apparatus.

Download Full-text

Can citizen science analysis of camera trap data be used to study reproduction? Lessons from Snapshot Serengeti program

10.1101/2020.11.30.400804 ◽

2020 ◽

Author(s):

Thel Lucie ◽

Chamaillé-Jammes Simon ◽

Keurinck Léa ◽

Catala Maxime ◽

Packer Craig ◽

...

Keyword(s):

Citizen Science ◽

Camera Trap ◽

Camera Traps ◽

Life History Trait ◽

List Type ◽

Breeding Phenology ◽

Science Data ◽

Morphological Criteria ◽

Wide Range ◽

Trained Observers

AbstractEcologists increasingly rely on camera trap data to estimate a wide range of biological parameters such as occupancy, population abundance or activity patterns. Because of the huge amount of data collected, the assistance of non-scientists is often sought after, but an assessment of the data quality is a prerequisite to their use.We tested whether citizen science data from one of the largest citizen science projects - Snapshot Serengeti - could be used to study breeding phenology, an important life-history trait. In particular, we tested whether the presence of juveniles (less than one or 12 months old) of three ungulate species in the Serengeti: topi Damaliscus jimela, kongoni Alcelaphus buselaphus and Grant’s gazelle Nanger granti could be reliably detected by the “naive” volunteers vs. trained observers. We expected a positive correlation between the proportion of volunteers identifying juveniles and their effective presence within photographs, assessed by the trained observers.We first checked the agreement between the trained observers for age classes and species and found a good agreement between them (Fleiss’ κ > 0.61 for juveniles of less than one and 12 month(s) old), suggesting that morphological criteria can be used successfully to determine age. The relationship between the proportion of volunteers detecting juveniles less than a month old and their actual presence plateaued at 0.45 for Grant’s gazelle and reached 0.70 for topi and 0.56 for kongoni. The same relationships were however much stronger for juveniles younger than 12 months, to the point that their presence was perfectly detected by volunteers for topi and kongoni.Volunteers’ classification allows a rough, moderately accurate, but quick, sorting of photograph sequences with/without juveniles. Obtaining accurate data however appears more difficult. We discuss the limitations of using citizen science camera traps data to study breeding phenology, and the options to improve the detection of juveniles, such as the addition of aging criteria on the online citizen science platforms, or the use of machine learning.

Download Full-text

Entomological citizen science in Canada

The Canadian Entomologist ◽

10.4039/tce.2017.48 ◽

2017 ◽

Vol 149 (6) ◽

pp. 774-785 ◽

Cited By ~ 4

Author(s):

John H. Acorn

Keyword(s):

Citizen Science ◽

Natural World ◽

Voluntary Participation ◽

Scientific Process ◽

Short Term ◽

Science Data ◽

Geographic Ranges ◽

Science Projects ◽

Online Databases ◽

Gathering Data

AbstractCitizen science involves voluntary participation in the scientific process, typically by gathering data in order to monitor some aspect of the natural world. Entomological citizen science, as an extension of traditional amateur entomology, is an active field in Canada, with online databases such as eButterfly and BugGuide attracting both contributors and database users. As well, traditional amateur entomology continues to be important in Canada, as do short-term insect-themed educational events, the involvement of amateurs in entomological societies, and online crowdsourcing initiatives. Success of citizen science projects can be measured in many ways. In terms of published papers that analyse trends in citizen science data, Canadian projects have only begun to deliver. More valuable are particular records that improve our knowledge of geographic ranges and phenology. In terms of the endurance of particular projects, and the willingness of volunteers to participate, citizen science entomology in Canada is clearly a success. However, quality control of citizen science data remains an issue for some projects. As well, challenges remain with respect to balancing the goals of researchers, participants, and supporting institutions.

Download Full-text

Rapid assessment of the suitability of multi-species citizen science datasets for occupancy trend analysis

10.1101/813626 ◽

2019 ◽

Cited By ~ 1

Author(s):

Michael J.O. Pocock ◽

Mark W. Logie ◽

Nick J.B. Isaac ◽

Charlotte L. Outhwaite ◽

Tom August

Keyword(s):

Great Britain ◽

Citizen Science ◽

Expert Elicitation ◽

List Type ◽

Occupancy Models ◽

Science Data ◽

Focal Species ◽

Wide Range ◽

Simple Rules ◽

Taxonomic Groups

AbstractSpecies records from volunteers are a vast and valuable source of information on biodiversity for a wide range of taxonomic groups. Although these citizen science data are opportunistic and unstructured, occupancy analysis can be used to quantify trends in distribution. However, occupancy analysis of unstructured data can be resource-intensive and requires substantial expertise. It is valuable to have simple ‘rules of thumb’ to efficiently assess the suitability of a dataset for occupancy analysis prior to analysis.Our analysis was possible due to the production of trends, from our Bayesian occupancy analysis, for 10 967 species from 34 multi-species recording schemes in Great Britain. These schemes had an average of 500 visits to sites per year, and an average of 20% of visited sites received a revisit in a year. Occupancy trend outputs varied in their precision and we used expert elicitation on a subset of outputs to determine a precision threshold above which trends were suitable for further consideration. We then used classification trees with seven metrics to define simple rules explaining when the data would result in outputs that met the precision threshold.We found that the suitability of a species’ data was best described by (i) the number of records of the focal species in the 10% best-recorded years, and (ii) the proportion of recording visits for that taxonomic group with non-detections of the focal species. Surprisingly few data were required to be predicted to meet the precision threshold. Specifically, for 98% confidence that our Bayesian occupancy models would produce outputs meeting the precision threshold, there needed to be ≥29 records of the focal species in the 10% best-recorded years (equivalent to an average of 12.5 records per year in our dataset), although only ≥10 records (equivalent to 4.5 records per year) were required for species recorded in less than 1 in 25 visits.We applied these rules to regional species data for Great Britain. Data from 32% of the species:region combinations met the precision threshold with 80% confidence, and 14% with 98% confidence. There was great variation between taxonomic groups (e.g. butterflies, moths and dragonflies were well recorded) and region (e.g. south-east England was best recorded).These simple criteria provide no indication of the accuracy or representativeness of the trend outputs: this is vital, but needs to be assessed individually. However our criteria do provide a rapid, quantitative assessment of the predicted suitability of existing data for occupancy analysis and could be used to inform the design and implementation of multi-species citizen science recording projects elsewhere in the world.

Download Full-text

Citizen science with colour blindness: A case study on the Forel-Ule scale

PLoS ONE ◽

10.1371/journal.pone.0249755 ◽

2021 ◽

Vol 16 (4) ◽

pp. e0249755

Author(s):

Olivier Burggraaff ◽

Sanjana Panchagnula ◽

Frans Snik

Keyword(s):

Citizen Science ◽

Data Entry ◽

Science Data ◽

Training Materials ◽

Colour Scale ◽

The Social ◽

Uncertainty Estimates ◽

Colour Blindness ◽

The Impact

Many citizen science projects depend on colour vision. Examples include classification of soil or water types and biological monitoring. However, up to 1 in 11 participants are colour blind. We simulate the impact of various forms of colour blindness on measurements with the Forel-Ule scale, which is used to measure water colour by eye with a 21-colour scale. Colour blindness decreases the median discriminability between Forel-Ule colours by up to 33% and makes several colour pairs essentially indistinguishable. This reduces the precision and accuracy of citizen science data and the motivation of participants. These issues can be addressed by including uncertainty estimates in data entry forms and discussing colour blindness in training materials. These conclusions and recommendations apply to colour-based citizen science in general, including other classification and monitoring activities. Being inclusive of the colour blind increases both the social and scientific impact of citizen science.

Download Full-text

Maps in Citizen Science: A Preliminary Analysis of Use and User Issues

Abstracts of the ICA ◽

10.5194/ica-abs-1-339-2019 ◽

2019 ◽

Vol 1 ◽

pp. 1-2

Author(s):

Artemis Skarlatidou ◽

Marcos Moreu

Keyword(s):

Citizen Science ◽

Social Networking Sites ◽

Spatial Information ◽

Geographic Information ◽

Environmental Conservation ◽

Recent Analysis ◽

Skill Sets ◽

Fully Integrated ◽

Science Projects ◽

Wide Range

Abstract. Citizen Science involves a collaboration or partnership between scientists and amateur volunteers, which may take various forms; from simple data collection to a close collaboration where both parts jointly define their aims, methodologies and analysis approaches in the scientific endeavour. Although citizen science has existed for more than two centuries (Silvertown, 2009), the widespread use of information and communication technology (ICT) now plays a significant role in the way citizen science is currently shaped and utilised. At present, there are hundreds of citizen science applications available which engage thousands of volunteers in the disciplines of astronomy, environmental conservation, biology, marine science, geography and many others. A relatively recent analysis of 388 citizen science projects revealed that they have been used to engage 1.3 million volunteers, contributing up to US$2.5 billion in-kind annually (Theobald et al. 2015).Web 2.0 and its associated technologies, which have existed for almost 15 years now, have enabled the development of websites which supported content generation by their end users (aka crowdsourcing; Howe, 2008) and multiple interactions amongst them. Examples include web-based communities, social-networking sites, wikis, mashups, and others (Batty et al., 2010). In this context the term ‘Neogeography’ was coined (Eisnor, 2006) and since then it has been used within the geographic and cartographic circles to describe the multi-directional generation of geospatial contents and interactions, which enables non-GIS professionals to create and share maps and other geographic information online “on their own terms” simply using the “elements of an existing toolset” (Eisnor, 2006). Map mashups started to not only be used for disseminating spatial information to a wider user audience, but applications have been created which enabled the crowdsourcing of geographic information for the production of geospatial knowledge; a trend, which is also known under the term Volunteered Geographic Information (Goodchild, 2007). OpenStreetMap (OSM) is perhaps one of the earliest examples that the literature cites to demonstrate how harnessing the power of the crowds for the collection of geographic information can result in the creation of a free, open source of map of the world (Goodchild, 2007; Haklay et al., 2008; Batty et al., 2010).We argue in this paper that the above developments from the geospatial context have massively contributed to the current state of citizen science. While interactive web maps made their appearance as mainly “way-finding” tools (Skarlatidou and Haklay, 2006), they quickly became part of digital interactions in a much broader context and they are currently a basic component of most citizen science projects. The relevance and significance of space has been fully exploited by technological features such as geotagging, GPS-enabled mobile devices fully integrated with other sensors, which has made the collection and sharing of data much easier (Haklay, 2013). Sinton (2018) argues that it is such the power of maps in citizen science that “it would be difficult to pursue a project in biological conservation, for example, without incorporating mapping”. The breadth of citizen science applications is so wide that we observe an extremely wide range of potential users, with very different skill sets, backgrounds, literacy levels and user needs.

Download Full-text

An investigation into the spatiotemporal patterns of the Nymphalid butterfly Vagrans egista sinha (Kollar, [1844])

10.1101/2022.01.02.474748 ◽

2022 ◽

Author(s):

Paul Pop ◽

Kuldeep Singh Barwal ◽

Randeep Singh ◽

Puneet Pandey ◽

Harminder Pal Singh ◽

...

Keyword(s):

Seasonal Variation ◽

Citizen Science ◽

Range Shift ◽

Range Extension ◽

Science Data ◽

The West ◽

North West ◽

The North ◽

Spatial Differences ◽

The Impact

Vagrans egista sinha (Kollar, [1844]), the Himalayan Vagrant is a subspecies of Nymphalid (Brush-footed) butterflies spread across Asia, whose western limit is in the north-west India. Observations of this subspecies have considerably increased over the past half-a-decade, with a spike in new sightings to the west of their previously known range. This has been considered as a range extension. The current study reports new records of this species from Bilaspur District, Himachal Pradesh, India (which are the first records for the district), through systematic and opportunistic sampling. This raises the question of whether the purported range extension towards the west could instead be a range shift or vagrancy, and whether there is any shift in elevational ranges in the populations across their known range. Questions pertaining to spatial differences in elevational ranges and seasonal variation, across their range, also piqued our curiosity. Using data from academic sources (such as published literature and museum collections), supplemented by data from public participation in scientific research and personal observations, these research questions are addressed. The accuracy of results when using citizen science data is also explored using the same dataset, focused on the impact of method of extraction of coordinates, and elevation derived from it under different scenarios. It was discovered that there has not been a range shift (either longitudinal or latitudinal) and observations do not suggest vagrancy but a case of range extension. Other results indicated that there was no climb of population to higher elevations, no spatial differences in elevational ranges in the populations, or seasonal variation in activities across their range. It was also discovered that the method of data collection by, and extraction from, citizen science databases, can influence the accuracy of the results. Some problems involved in collecting data are discussed, and remedial solutions are suggested.

Download Full-text

Increasing research impact with citizen science: The influence of recruitment strategies on sample diversity

Public Understanding of Science ◽

10.1177/0963662519840934 ◽

2019 ◽

Vol 28 (5) ◽

pp. 606-621 ◽

Cited By ~ 7

Author(s):

Stijn Brouwer ◽

Laurens K. Hessels

Keyword(s):

Citizen Science ◽

Research Impact ◽

Recruitment Strategy ◽

Recruitment Strategies ◽

Water Research ◽

Research And Innovation ◽

Science Projects ◽

Strategy Versus ◽

The Impact ◽

Insight Into

Despite the fact that citizen engagement in research is widely practised and regarded as one of the keys to maximizing the impact of research and innovation, empirical evidence on the value, potential and possibilities of engaging a broad diversity of citizens in practice is scant. The purpose of our article is twofold: (1) to provide more insight into the value and opportunities of engaging audiences that typically are not engaged with science and (2) to explore the effect of a targeted recruitment strategy versus a generic recruitment strategy on the profile, motivation and retainment of citizen science volunteers. Our empirical research is based on five citizen science projects in the domain of surface and drinking water research in the Netherlands. This article finds that using a targeted recruitment strategy, it is possible and worth to recruit a diverse sample of citizen science volunteers.

Download Full-text