A dataset of citizen science practitioners’ experiences and practices
Michael O'Grady, Eleni Mangina

TL;DR
This paper introduces a dataset from a survey of 100 citizen scientists, capturing their experiences and practices to support future research and initiatives.
Contribution
The novelty lies in presenting a comprehensive and structured dataset from citizen science practitioners, enabling further empirical research.
Findings
The dataset includes 100 participants engaged in diverse citizen science projects.
The survey covers themes like demographics, data practices, and open research knowledge.
The questionnaire is available for use in future surveys at different geographic or project levels.
Abstract
There has been renewed interest in Citizen Science (CS) in recent years as it offers an intriguing vision of enabling a scientifically literate population engage in scientific investigations and policy formation. Nonetheless, citizen scientists remain an understudied population, possibly due to the voluntary and part-time nature of their endeavours. Here, a dataset of CS practitioners’ experiences collected using an online survey is presented. The survey sample comprises 100 adults (18+) active in diverse CS projects. The survey contains 47 questions designed for quantitative analysis. Questions cluster around several broad themes - participant demographics, project profiles, experience in citizen science, data collection practices, management, dissemination, knowledge of open research principles, and training received. The dataset offers the potential for further empirical research or…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpecies Distribution and Climate Change · Animal and Plant Science Education · Insect and Arachnid Ecology and Behavior
Specifications TableSubjectEnvironmental Science, Management, Monitoring, Policy and LawSpecific subject areaCitizen scienceType of dataRaw data in a table (Excel spreadsheet)Data collectionData was collected using an online survey implemented through Google Forms. Participants could only contribute once they confirmed they were over 18 years of age, understood the motivations behind the project, and gave permission to share the data. No identifiable data was requested. Data was stored at the authors’ institution only after participants pressed the submit widget on the final page of the survey.Data source locationInstitution: University College DublinCity/Town/Region: DublinCountry: IrelandData accessibilityRepository name: ZenodoData identification number: 10.5281/zenodo.12548824Direct URL to data: https://zenodo.org/doi/10.5281/zenodo.12548824Related research articleM.J. O'Grady, E. Mangina, Citizen scientists - practices, observations, and experience, Humanit. Soc. Sci. Commun 11 (2024) 1-9. https://doi.org/10.1057/s41599-024-02966-x. [1]
Value of the Data
1
- •Open datasets that report on studies of the participatory science community are rare.
- •This dataset provides an in-depth snapshot of CS participants, their projects and activities, understanding of data practices, knowledge of open science, and training.
- •Primary beneficiaries of this dataset are those who plan to launch and operate a participatory science initiative.
- •Insights into how the crucial issue of data quality is perceived and practised may be gleaned.
- •The dataset provides a baseline for longitudinal studies monitoring the practices of the CS community.
Background
2
Participatory science [2] involves the public contributing to scientific activities ranging from data collection to analysis to interpretation. Diverse models of participatory science exist, of which Citizen Science (CS) is one of the most common. Here, public members contribute to empirical science initiatives, often through data collection. More recently, the potential of the CS community to confront the perceived democratic deficit [3,4] has been considered [5].
CS participants have been studied through online questionnaires [6] and open-ended interviews [7]. However, little is known about their understanding of fundamental concepts such as open science, Responsible Research and Innovation (RRI) [8], and good data practices. In the latter instance, data is a cornerstone of CS and data quality is an omnipresent concern in conventional CS [9,10]. For communities seeking to influence policy definition [11], it is vital that they can provide data of adequate quality to form a credible, transparent, and robust evidence base. Typically, a professional scientist acts as a de facto quality controller. However, such expertise may be lacking for community-driven initiatives. Thus, to maximise the potential of CS, the training needs of CS participants, as well as their understanding, experiences, and perceptions, merit further investigation.
Data Description
3
This dataset is presented as a Microsoft Excel worksheet. The data is in raw format. Each column relates to one question, and that question's text forms the header. For clarity purposes, the questionnaire is also included in the supplementary material. Most questions are multiple-choice or checkboxes where more than one option can be selected. Some grid-style questions are also harnessed. All mandatory questions had to be completed to proceed from one form to another. One question was optional – about the country that the participant was from. Conditional branching occurred once in the questionnaire. When participants responded positively, they were asked to complete four more questions. All identifiable and personal data has been removed. With these exceptions, the dataset is complete.
This dataset contains data provided by 100 participants. Of these, 53% identified as female, 45% as male, and 2% preferred not to say. Table 1 illustrates the age profile, with the 35–44 profile being the largest sub-group. 95% of participants reported as being from Europe. Seventeen countries were represented, but 14% of participants did not declare their nationality. Ireland provided the most participants (36%), most of whom were involved in biodiversity recording.Table 1. Participants’ age distribution.Table 1. Age%18–242%25–3413%35–4431%45–5422%55–6419%65+13%
88% of participants were involved in data collection, an activity synonymous with CS. In terms of roles, 50% defined themselves as an “Active Citizen Scientist”, while 40% categorised themselves as managers of a CS project. Overall, 63% contributed to decision-making in their projects.
A diverse spectrum of scientific domains was presented, with environmental science being the most popular (Table 2).Table 2. Citizen Science domains where participants contribute**.**Table 2. Domain%Biodiversity22%Earth Science24%Environmental Science33%Life Sciences4%Social Science8%Space Science3%Other6%
Experimental Design, Materials and Methods
4
The design of the survey questionnaire was influenced by project objectives and findings from related research in this domain [6,12]. The dataset was collected through an online survey. This survey was developed using Google Forms. It was subsequently translated into several languages - namely French, German, Greek, Italian, Polish, Portuguese, Spanish, and Turkish. Native speakers completed translations. However, the authors translated the responses as needed.
Adopting a convenience sampling approach, links to the survey were circulated widely using a combination of social media, online fora, and mailing lists. All channels were selected based on the expectation that their readership was predominantly CS participants. Potential survey participants were directed to the project website, where additional background and contextual information were available.
Before undertaking the survey, participants were requested to read some background material about the project's motivations. They were also informed that the dataset would be made openly available and that no personal or identifiable data would be requested. Participants could start the survey only when they agreed and confirmed that they consented to the conditions. As the survey was deemed low-risk, ethical exemption was granted. Participants were warned that completing the study would take at least 20 minutes. The survey was undertaken in Spring 2023, and 120 participants volunteered to contribute.
On analysing the data from a quality perspective, 20 contributions were found to lack consistency and exhibited internal contradictions. These were excluded, rendering the final dataset of 100 participants.
Limitations
This survey is archetypal of online surveys. Thus, it encompasses many disadvantages synonymous with such surveys, including self-selection bias and an overall lack of control [13]. Measures were undertaken to ensure the integrity of the dataset, and the anonymity of participants may have contributed to more honest answers. Ultimately, it must be assumed that participants considered themselves citizen scientists, irrespective of their role in a particular project at the time of the survey. However, the risk of false or randomly selected answers is omnipresent.
The dataset is limited. One hundred participants is considered small for online surveys, reducing the potential for generalisation. While the decision to go deep rather than broad was a pragmatic design decision, the results must be interpreted more as indicative than general.
The participants are overwhelmingly from Europe. Thus, the dataset broadly reflects the experiences of those within a limited geographic area. The degree to which the findings may apply to the other regions of the global north, especially North America, where CS is well-established, is open to question. Moreover, a global south perspective is lacking, even though CS has a long tradition in South America.
From an inclusion perspective, participants were limited to those who were computer-literate.
Ethics Statement
All research was performed according to relevant guidelines at University College Dublin, including, but not limited to, informed consent and data protection. The research was carried out in accordance with the Declaration of Helsinki as far as applicable to this type of study. As this survey was assessed as low risk, an exemption was obtained from the Office of Research Ethics at University College Dublin (LS-E-21-235-OGrady).
Informed consent—All participants provided informed consent. After reading the background, motivations, and data management policies, participants could either agree and proceed to the survey or disagree, in which case they could not. Participants further confirmed their agreement that the resultant dataset would be published as an academic publication.
CRediT authorship contribution statement
Michael O'Grady: Conceptualization, Methodology, Formal analysis, Visualization, Writing – original draft. Eleni Mangina: Investigation, Formal analysis, Writing – review & editing.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1O'Grady M.J.Mangina E.Citizen scientists - practices, observations, and experience Humanit. Soc. Sci. Commun 1120241910.1057/s 41599-024-02966-x · doi ↗
- 2Haklay M.Participatory citizen science. Citizen science: Innovation in open science Soc. Policy 20185262 http://www.jstor.org/stable/j.ctv 550cf 2.11
- 3Mavrouli R.Van Waeyenberge A.EU Responses to the democratic deficit and the rule of law crisis: is it time for a (New) European exceptionalism?Hague J. Rule Law 153202340543910.1007/s 40803-023-00198-w · doi ↗
- 4Liargovas P.Papageorgiou C.The perception of “Democratic Deficit” in the European Union The European Integration 22024 Institutions and Policies 15918410.1007/978-3-031-47176-6_4 · doi ↗
- 5Roszczyńska-Kurasińska M.Beyond data: recognizing the democratic potential of 154 citizen science IEEE Technol. Soc. Mag.4242023475610.1109/MTS.2023.3344904155 · doi ↗
- 6Schade S.Tsinaraki C.Survey Report: Data Management in Citizen Science Projects 2016 Publication Office of the European Union Luxembourg 10.2788/539115 · doi ↗
- 7Jones G.M.Childers G.Andre T.Corin E.N.Hite R.Citizen scientists and non-citizen scientist hobbyists: motivation, benefits, and influences Int. J. Sci. Educ. B 84201828730610.1080/21548455.2018.1475780 · doi ↗
- 8Zwart H.Barbosa Mendes A.Blok V.Epistemic inclusion: a key challenge for global RRIJ. Responsible Innov.111202410.1080/23299460.2024.2326721 · doi ↗
