Data Imbalances in Coincidence Analysis: A Simulation Study

Saved in:
Bibliographic Details
Title: Data Imbalances in Coincidence Analysis: A Simulation Study
Language: English
Authors: Martyna Daria Swiatczak (ORCID 0000-0002-7537-1813), Michael Baumgartner (ORCID 0000-0003-1536-2816)
Source: Sociological Methods & Research. 2025 54(2):739-771.
Availability: SAGE Publications. 2455 Teller Road, Thousand Oaks, CA 91320. Tel: 800-818-7243; Tel: 805-499-9774; Fax: 800-583-2665; e-mail: journals@sagepub.com; Web site: https://sagepub.com
Peer Reviewed: Y
Page Count: 33
Publication Date: 2025
Document Type: Journal Articles
Reports - Research
Descriptors: Causal Models, Comparative Analysis, Data Analysis, Statistical Distributions, Statistical Data
DOI: 10.1177/00491241241227039
ISSN: 0049-1241
1552-8294
Abstract: In this paper, we investigate the conditions under which data imbalances, a common data characteristic that occurs when factor values are unevenly distributed, are problematic for the performance of Coincidence Analysis (CNA). We further examine how such imbalances relate to fragmentation and noise in data. We show that even extreme data imbalances, when not combined with fragmentation or noise, do not negatively affect CNA's performance. However, an extended series of simulation experiments on fuzzy-set data reveals that, when mixed with fragmentation or noise, data imbalances may substantially impair CNA's performance. Furthermore, we find that the performance impairment is higher when endogenous factors are imbalanced than when exogenous factors are concerned. Our results allow us to quantify these impacts and demarcate degrees at which data imbalances should be considered as problematic. Thus, applied researchers can use our demarcation guidelines to enhance the validity of their studies.
Abstractor: As Provided
Entry Date: 2025
Accession Number: EJ1473620
Database: ERIC
Full text is not displayed to guests.
Be the first to leave a comment!
You must be logged in first