Data Quality Studies

This page lists 29 peer-reviewed papers tagged with Data Quality in the Prolific Citations Library, a curated collection of research powered by high-quality human data from Prolific.

Papers (20 of 29)

Generative AI meets open-ended survey responses: Research participant use of ai and homogenization

Authors: S Zhang, J Xu, AJ Alvero

Year: 2025

Published in: Sociological Methods & Research, 2025 - journals.sagepub.com

Institution: University of Maryland, Indiana University, University of Minnesota Duluth

Research Area: Sociological Methods, Generative AI, Survey Methodology

Discipline: Sociology, Social Science

The study finds that 34% of research participants use generative AI tools like large language models (LLMs) to assist with open-ended survey responses, leading to more homogeneity and positivity in their answers, which could impact data validity by masking social variations.

Methods: The study conducted an original survey on a popular online platform and simulated comparisons between human-written responses from pre-ChatGPT studies and LLM-generated responses.

Key Findings: Use of LLMs by survey participants, differences in text homogeneity, positivity, and masking of social variation in open-ended survey responses.

Citations: 26
Can intelligent agents improve data quality in online questiosnnaires? A pilot study

Authors: A Söderström, A Shatte

Year: 2025

Published in: Behavior Research ..., 2021 - Springer

Institution: University of Helsinki

Research Area: Intelligent Agents, Health Research Methodology, Behavioral Research Methods

Discipline: Research Methodology, Behavioral Science

The study found that chatbot-assisted surveys modestly improve data quality, with most users finding the chatbots helpful and widely using them.

Methods: Randomized participants into chatbot-supported and unassisted survey conditions; assessed chatbot use, user satisfaction, and data quality via validated and deliberately confusing challenge items.

Key Findings: Effects of chatbot assistance on data quality, user satisfaction, and usage patterns in online questionnaires.

DOI: https://doi.org/10.3758/s13428-021-01574-w

Citations: 7

Sample Size: 300
Reflection-Philosophy Order Effects and Correlations Across Samples

Authors: N Byrd

Year: 2025

Published in: Byrd, N. (2025). Reflection-Philosophy Order Effects and Correlations Across Samples. Analysis. DOI: 10.1093/analys/anaf015. https://osf.io/preprints/psyarxiv/y8sdm

Institution: Stevens Institute of Technology

Research Area: Behavioral Research Methods, Experimental Psychology, Crowdsourcing Platforms

Discipline: Psychology

Reflective reasoning correlates with certain philosophical decisions, and the study suggests bidirectional causal paths between reflection and philosophy, with test order effects influencing reflection test outcomes but not philosophical decisions.

Methods: Participants from four sources (Amazon Mechanical Turk, CloudResearch, Prolific, and a university) were tested on reflective reasoning and their decisions on 10 philosophical thought experiments.

Key Findings: Impact of reflective reasoning on philosophical decisions and the effect of test order on reflection and philosophy outcomes.

Citations: 4
Unlocking Insights into Prolific: Research Implications, Participant Behavior and Motivations

Authors: B Aksoy, S Nevo

Year: 2025

Published in: Participant Behavior and Motivations (March 21 ..., 2025 - papers.ssrn.com

Institution: Rensselaer Polytechnic Institute

Research Area: Crowdsourcing, Participant Behavior

Discipline: Computational Social Science

Research on Prolific reveals that participant compensation significantly impacts sample selection, potentially introducing biases, and offers insights into participant motivations and behavior to improve study reliability and design.

Methods: A carefully designed experiment was performed to analyze correlations between participants' reservation wages, socioeconomic attributes, and study compensations; sensitivity analyses were conducted for further guidance.

Key Findings: Participant reservation wages, socioeconomic attributes, perceptions of general behavior and motivations, and implications of study design decisions.

Citations: 3
Data quality in crowdsourcing and spamming behavior detection

Authors: Y Ba, MV Mancenido, EK Chiou, R Pan

Year: 2025

Published in: Behavior Research Methods, 2025 - Springer

Institution: University of Delaware, National Taiwan University, University of British Columbia, Monash University

Research Area: Crowdsourcing, Data Quality, Spamming Behavior Detection, LLM Applications in Behavioral Research

Discipline: Computer Science, Artificial Intelligence, Large Language Models

The paper introduces a systematic method to evaluate crowdsourced data quality and detect spam behaviors through variance decomposition, proposing a spammer index and credibility metrics to improve consistency and reliability in labeling tasks.

Methods: Variance decomposition, Markov chain models, and generalized random effects models were used to assess annotator consistency and credibility; metrics were applied to both simulated and real-world data from two crowdsourcing platforms.

Key Findings: Quality of crowdsourced data, spammer behaviors, annotators’ consistency, and credibility.

Citations: 2
Evaluating mobile-based data collection for crowdsourcing behavioral research

Authors: DT Esch, N Mylonopoulos, V Theoharakis

Year: 2025

Published in: Behavior Research Methods, 2025 - Springer

Institution: University of Cologne, University of Piraeus, Aristotle University of Thessaloniki

Research Area: Crowdsourcing Behavioral Research, Mobile Data Collection

Discipline: Behavioral Research Methods

Mobile-based responses via platforms like Pollfish are comparable in quality to computer-based ones from MTurk and Prolific, though attentiveness varies significantly across platforms and is influenced by factors like incentives, distractions, and system 1 thinking.

Methods: Conducted two studies distributing the same survey across MTurk, Prolific, Pollfish, and Qualtrics panels to compare data quality and analyze attentiveness scores.

Key Findings: Attentiveness, device usage (mobile vs. computer), and factors influencing data quality such as incentives, respondent activity, distractions, and survey familiarity.

Citations: 1
Influencing Humans to Conform to Preference Models for RLHF

Authors: S Hatgis-Kessell, WB Knox, S Booth, S Niekum

Year: 2025

Published in: arXiv preprint arXiv ..., 2025 - arxiv.org

Institution: Stanford University, UMass Amherst, Carnegie Mellon University

Research Area: Reinforcement Learning from Human Feedback (RLHF)

Discipline: Artificial Intelligence, Human-Computer Interaction

The paper investigates whether human preferences can be influenced to align more closely with assumed preference models in RLHF algorithms through interventions such as showing model-derived quantities, training on preference models, and modifying elicitation questions.

Methods: Three human studies were conducted where interventions were tested, including revealing model-derived quantities, training participants on a preference model, and altering how preference questions were framed.

Key Findings: Evaluated the impact of interventions on humans' expression of preferences to align better with the assumed preference models of RLHF algorithms.

DOI: https://doi.org/10.48550/arXiv.2501.06416

Citations: 1
Why you shouldn’t trust data collected on MTurk

Authors: CS Kay

Year: 2025

Published in: Behavior Research Methods, 2025•Springer

Institution: Stanford University

Research Area: Behavioral Research Methods

Discipline: Behavioral Science, Behavioral Research Methods

Data collected on Amazon's Mechanical Turk (MTurk) shows substantial quality issues, with semantic antonym pairs being positively correlated instead of negatively, even after implementing data screening and using high-reputation participants.

Methods: 27 semantic antonym pairs were administered to participants from Connect (N=100), Prolific (N=100), and MTurk (N=400, N=600) to examine response quality and correlation patterns.

Key Findings: The correlation of responses to semantic antonym pairs as an indicator of data quality across different survey platforms.

Citations: 1

Sample Size: 1200
Improving annotation quality: empirical insights into bias, human-AI collaboration, and workflow design

Authors: J Beck

Year: 2025

Published in: 2025 - edoc.ub.uni-muenchen.de

Institution: Ludwig-Maximilians-Universität München, University of Bayreuth

Research Area: Annotation Quality, Human-AI Collaboration, Behavioral Science, Human-Computer Interaction

Discipline: Human-Computer Interaction

The study empirically evaluates annotation bias, proposes strategies to reduce its impact, and explores the use of large language models in automated and hybrid annotation workflows.

Methods: Empirical assessments and experimental evaluations involving annotation workflows and large language models.

Key Findings: Annotation bias, annotation quality, and the effectiveness of hybrid workflows integrating human input and AI models.
Detecting the corruption of online questionnaires by artificial intelligence

Authors: B Lebrun, S Temtsin, A Vonasch

Year: 2024

Published in: Frontiers in Robotics and ..., 2024 - frontiersin.org

Institution: University of Lausanne, University of California Berkeley, University of Massachusetts Amherst, Arizona State University

Research Area: AI in Social Science Research, Survey Methodology, Data Quality

Discipline: Artificial Intelligence

The study examines the integrity of online questionnaire responses and concludes that humans can identify AI-generated text with 76% accuracy, but current AI detection systems are ineffective, raising concerns about data quality in online surveys.

Methods: Human participants and automatic AI detection systems were tested on their ability to differentiate AI-generated text from human-generated text in the context of online questionnaires.

Key Findings: The study measured the ability of humans and AI detection tools to correctly identify whether text was generated by a human or an AI system for online questionnaire responses.

DOI: https://doi.org/10.3389/frobt.2023.1277635

Citations: 26
The ethical, societal, and global implications of crowdsourcing research

Authors: S Du, MT Babalola, P D'cruz, E Dóci

Year: 2024

Published in: Journal of Business ..., 2024 - Springer

Institution: Nottingham University Business School, University of Reading, Oxford Brookes University, University of Portsmouth

Research Area: Crowdsourcing Ethics, Social Science, Organizational Behavior

Discipline: Social Science

The paper explores the ethical, societal, and global implications of using crowdsourcing platforms for research, emphasizing the need for fair compensation, transparency, and consideration of global disparities between the Global North and South.

Methods: The paper provides a conceptual analysis and critique of crowdsourcing research practices, focusing on ethical and societal considerations.

Key Findings: Ethical, societal, and global implications of crowdsourcing research practices, including data quality, reporting transparency, fair remuneration, and the role of global disparities.

Citations: 24
Comparing data quality from an online and in-person lab sample on dynamic theory of mind tasks

Authors: AC Krendl, K Hugenberg, DP Kennedy

Year: 2024

Published in: Behavior research methods, 2024 - Springer

Institution: Indiana University

Research Area: Psychological Research Methods, Data Quality in Online Experiments, Theory of Mind Assessment

Discipline: Research Methodology, Cognitive Psychology

This study found that online samples can reliably complete dynamic, complex theory of mind tasks, though familiarity with task content can influence performance.

Methods: Compared in-lab and online participants' performance on two dynamic theory of mind tasks, using one familiar and one relatively novel TV-based paradigm and counterbalancing task order.

Key Findings: Performance on theory of mind tasks, including inferring beliefs, understanding motivations, detecting deception, identifying faux pas, and understanding emotions.

Citations: 13

Sample Size: 668
Crowdsourcing Academic Online Research on Prolific

Authors: Eyal Peer

Year: 2024

Published in: CAMBRIDGE

Institution: Hebrew University, University of Cambridge

Research Area: Crowdsourcing, Research Methodology in Behavioral and Social Sciences

Discipline: Social, Behavioral Science

Citations: 7
Motivation Matters: Challenges and Pit-Falls of Crowdsourced Online Studies

Authors: E Becks, V Matkovic, T Weis

Year: 2024

Published in: 2025 IEEE International Conference on ..., 2025 - computer.org

Institution: University of Stuttgart, University of Applied Sciences Offenburg, University of Hohenheim

Research Area: Crowdsourced Online Studies, Human-Computer Interaction (HCI) in AI Systems, Behavioral Research Methodology

Discipline: Human-Computer Interaction
Data quality in online human-subjects research: Comparisons between MTurk, Prolific, CloudResearch, Qualtrics, and SONA

Authors: BD Douglas, PJ Ewell, M Brauer

Year: 2023

Published in: Plos one, 2023 - journals.plos.org

Institution: University of Alabama, University of Wisconsin-Madison, Florida Atlantic University

Research Area: Social Science Research Methods, Behavioral Research Methods, Data Quality in Crowdsourcing

Discipline: Social Science Research Methods

Citations: 1598
From lab-testing to web-testing in cognitive research: Who you test is more important than how you test

Authors: K Uittenhove, S Jeanneret, E Vergauwe

Year: 2023

Published in: Journal of Cognition, 2023 - pmc.ncbi.nlm.nih.gov

Institution: University of Lausanne, University of Geneva, EPFL, University of Neuchâtel, NiH

Research Area: Cognitive Psychology, Research Methodology, Behavioral Research Methods, Web-based Behavioral Research

Discipline: Cognitive Research, Psychology

Citations: 83
Data quality of platforms and panels for online behavioral research

Authors: E Peer, D Rothschild, A Gordon, E Damer

Year: 2022

Published in: Behavior Research Methods, 2022 - Springer

Institution: The Hebrew University of Jerusalem, Microsoft Research, Prolific

Research Area: Online Behavioral Research, Data Quality, Research Methodology

Discipline: Computational Social Science, Behavioral Research Methods

Citations: 2112
Replication: How well do my results generalize now? The external validity of online privacy and security surveys

Authors: J Tang, E Birrell, A Lerner

Year: 2022

Published in: ... symposium on usable privacy and security ..., 2022 - usenix.org

Institution: University of California Berkeley, George Washington University, Stanford University

Research Area: Online privacy and security surveys, External validity, Replication studies, Human-Computer Interaction (HCI) in security

Discipline: Computer Science, Behavioral Science

Citations: 164
Conducting linguistic experiments online with OpenSesame and OSWeb

Authors: S Mathôt, J March

Year: 2022

Published in: Language Learning, 2022 - Wiley Online Library

Institution: Ghent University, KU Leuven

Research Area: Online experimental methodology, Psycholinguistics, Behavioral research tools

Discipline: Psycholinguistics

Citations: 76
Online experimentation and sampling in cognitive aging research.

Authors: NR Greene, M Naveh-Benjamin

Year: 2022

Published in: Psychology and Aging, 2022 - psycnet.apa.org

Institution: University of Kansas, University of Arkansas, University of California Riverside

Research Area: Cognitive Aging Research, Online Experimentation, Sampling Methods in Psychology

Discipline: Psychology, Cognitive Aging Research

Citations: 56