Concordance between self-reported SARS-CoV-2 positivity and laboratory-confirmed positivity
Collin James Catalfamo, Elizabeth T. Jacobs, Laura P. Falk, Priscilla Lauro, Kacey C. Ernst, Leslie V. Farland, Kelly M. Heslin, Kristen Pogreba-Brown, Pamela C. Garcia-Filion

TL;DR
This study finds that most people accurately report their SARS-CoV-2 test dates, but accuracy decreases with age and time since testing.
Contribution
The study quantifies the accuracy of self-reported SARS-CoV-2 test dates compared to laboratory records and identifies factors affecting reliability.
Findings
82.5% of participants reported test dates within 7 days of laboratory confirmation.
Older age and longer time since testing were linked to larger self-reporting errors.
Self-reported dates were deemed sufficient for tracking Long COVID when lab data is unavailable.
Abstract
As the use and availability of at-home antigen tests for SARS-CoV-2 infection have increased, the number of individuals with SARS-CoV-2 infections that are reported to state COVID-19 surveillance systems have decreased. Self-reported infection dates are critical to accurately track incidence and outbreaks of COVID-19 and for continued research on illness progression; however, the reliability of self-reported infection dates is unknown to date. To assess accuracy of self-reported test dates, we utilized self-reported SARS-CoV-2 testing data from the Arizona CoVHORT Study (CoVHORT) and laboratory-confirmed testing data collected by the Arizona Department of Health Services (ADHS) and calculated the difference in days between dates to examine their percent agreement. We used logistic regression to assess if any participant characteristics were associated with self-reporting a test date >7…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4- —Arizona Biomedical Research Centre
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLong-Term Effects of COVID-19 · COVID-19 Clinical Research Studies · SARS-CoV-2 detection and testing
Introduction
The SARS-CoV-2 pandemic had immense global impacts on morbidity and mortality and has led to extensive societal changes since the first scientific report was published in February 2020 [1]. While early response and research efforts by necessity focused on the impact of acute illness, a collection of longer-term sequelae were identified early in the pandemic and have been further refined over time. These symptoms include cardiovascular abnormalities, gastrointestinal disorders, fatigue, shortness of breath, and “brain fog” or cognitive difficulties [1–14]. Further characterization of long-term symptoms was achieved when the Centers for Disease Control and Prevention defined new or ongoing post-infectious symptoms occurring 4 weeks or more following acute infection as post-COVID condition, or PCC [15,16]. These conditions have also been referred to as post-acute sequelae of COVID-19 (PASC) or, colloquially, as Long COVID [1–14].
To date, numerous studies have been conducted to assess the risk factors for and symptoms of PCC using various epidemiological methods ranging from retrospective medical record reviews to formal prospective studies [5,7–10,14], with many of these studies employing time since self-reported SARS-CoV-2 positivity. This research strategy arose for several reasons, including reducing the burden on study participants, protecting patient privacy, and the increased adoption of home-testing kits. The first of these kits were granted emergency use approval in the United States by the Food and Drug Administration in November 2020 [17]. Uptake of home testing became widespread throughout 2021 and 2022, with the provision of free tests by the United States government under the Biden Administration beginning in January 2022 [18]. Because many studies of acute COVID-19, as well as PCC, rely on accuracy of the positive test date or the date the illness began, it is of key importance to understand the accuracy of self-reported infection dates by comparing them to laboratory-confirmed dates.
Therefore, the primary objective of the present study was to compare self-reported data on testing positive for SARS-CoV-2 with laboratory-confirmed data obtained through COVID-19 surveillance and case investigations conducted by the Arizona Department of Health Services (ADHS). Our secondary objective was to assess whether the accuracy of self-reported positive tests varied by participant characteristics.
Methods
Study design and setting
This prospective cohort study analyzed data from participants enrolled in the Arizona CoVHORT Study (CoVHORT) [19] between May 28, 2020, and April 1, 2022. The study was conducted in Arizona using a population-based approach that recruited participants statewide through academic health department partnerships, phased mailing campaigns, and recruitment materials at vaccine distribution sites [20]. Data collection occurred through electronic surveys administered via Research Electronic Data Capture (REDCap) [21], with participants prospectively providing self-reported SARS-CoV-2 testing information at enrollment and follow-up assessments. Self-reported data were subsequently linked to laboratory-confirmed test results from the Arizona Department of Health Services (ADHS) surveillance system (MEDSIS). The analysis period covered SARS-CoV-2 positive test results occurring between January 1, 2020, and April 1, 2022.
Study participants
Participant information for the present study was collected from two primary sources: CoVHORT and the Arizona Department of Health Services (ADHS) surveillance dataset (MEDSIS), which captures all laboratory-reported positive SARS-CoV-2 test results in the state as part of routine public health surveillance. All labs and hospitals report positive PCR and antigen tests to ADHS which includes the date of testing. We included CoVHORT participants who were 18 years of age or older at enrollment, had self-reported a positive PCR or antigen test date for SARS-CoV-2 infection at baseline or during follow-up, and were enrolled in CoVHORT between May 28, 2020, and April 1, 2022. Laboratory-confirmed test data from MEDSIS included SARS-CoV-2 positive test results that occurred prior to April 2, 2022. For participants with multiple SARS-CoV-2 infections reported to the state surveillance system, only information pertaining to the first reported infection was included in the analysis. We excluded all participants who were missing a self-reported SARS-CoV-2 test or a matched laboratory-reported test.
Ethical approval
Participants completed all of their surveys electronically via Research Electronic Data Capture (REDCap) [21] and all participants provide written informed consent upon joining the study. The University of Arizona Institutional Review Board provided ethical approval for the completion of the study (Protocol #2003521636).
Available data
CoVHORT collected information about participant demographics at baseline and assessed SARS-CoV-2 infection status at baseline and during follow-up. Demographic information collected at baseline included age (18−24 years, 25−34 years, 35−44 years, 45−54 years, 55−64 years, or ≥65 years), gender (female or male), race (White, American Indian/Alaska Native, Asian, Black or African American, Native Hawaiian/Pacific Islander, Mixed race, or prefer not to answer), ethnicity (Non-Hispanic, Hispanic, or prefer not to answer), highest level of education completed (High School graduate or less, some college, or college graduate or greater), annual household income (<25,000 to 50,000 to 75,000), self-reported employment status (full-/part-time employed, unemployed, or retired), and student status (full-time student or not full-time student). At baseline and during follow-up, participants were asked if they have had a COVID-like illness and/or whether they had tested positive for a SARS-CoV-2 infection, including the type of test used (laboratory conducted polymerase chain reaction (PCR) or antigen tests, or at-home antigen tests) and the date of their test prior to joining the study.
Case definition
For this study, a “case” was defined as a CoVHORT participant who self-reported a positive SARS-CoV-2 test (PCR or antigen) and had a corresponding laboratory-confirmed positive test recorded in the ADHS surveillance system (MEDSIS). Cases were identified through a data linkage process that matched self-reported positive test dates from CoVHORT participants to their laboratory-reported positive test dates in MEDSIS. The most-likely match between self-reported and laboratory-reported test dates was determined by identifying the pair with the smallest difference in number of days among all potential matches for each participant.
Outcomes
The primary outcome of this study was to assess the accuracy of self-reported SARS-CoV-2 test dates by comparing participant self-reported positive test dates to laboratory-confirmed PCR or antigen test dates with independent records in the MEDSIS surveillance system. Accuracy was measured by calculating the concordance between self-reported and laboratory-reported test dates, expressed as percent agreement. The difference in days between participant self-reported dates and laboratory-reported dates was used to categorize participants into accuracy groups, with accurate self-reporting defined as test dates differing by seven days or fewer from the laboratory-reported date, and inaccurate self-reporting defined as test dates differing by more than 7 days from the laboratory-reported date.
Statistical analyses
We first examined concordance between positive test dates reported to MEDSIS and participant self-reported test dates using percent agreement. We calculated the difference, in number of days, between the most-likely self-reported test date match to participants’ laboratory-reported SARS-CoV-2 diagnostic test date. The difference in days between participant self-reported dates and laboratory reported dates were used to categorize participants, and the number and of participants in each group was used to calculate the percent agreement.
Second, we used logistic regression to evaluate factors associated with inaccurate self-reporting of test dates (defined as reporting a test date greater than 7 days different than the laboratory reported date). Independent variables included demographic characteristics (age, gender, race, ethnicity, highest level of education completed, annual household income, self-reported employment status, and student status), symptomatic status at time of diagnostic test for SARS-CoV-2 infection (asymptomatic or symptomatic), and the number of weeks between the date of the diagnostic test and when participants reported the positive test to CoVHORT. We performed univariate logistic regressions to examine the association between self-reporting a test date greater than 7 days different than the laboratory reported test date with each of the independent variables specified. In multivariable models, we included participant age at baseline, with adjustment for number of weeks between the date of the laboratory reported SARS-CoV-2 diagnostic test and the date participants reported their positive test to CoVHORT. Statistical significance was defined as a p-value of 0.05. All analyses were conducted using STATA 18 [22].
Results
A total of 1900 participants recruited into the CoVHORT study between May 28, 2020, through April 1, 2022, who had a positive laboratory test for SARS-COV-2 reported to MEDSIS between January 1, 2020, and April 1, 2022 (Table 1). The median time between participants’ SARS-CoV-2 laboratory test and the date they reported their test to CoVHORT was 5 weeks (IQR: 1.3 weeks – 16.1 weeks).
Table 1: Demographic characteristics of 1,900 Arizona CoVHORT participants aged 18 years or older that enrolled in the study between May 28, 2020, and April 2, 2022, and reported a positive result from laboratory conducted SARS-CoV-2 diagnostic test prior to April 2, 2022.
Overall, 82.5% (n = 1568) self-reported a positive test date within 7 days of the laboratory test date (Table 2). Participant self-reported positive test dates agreed exactly with the laboratory test dates for 28.3% (n = 538) of the cohort. Among participants who did not self-report a positive test date that agreed exactly with the laboratory reported date, most self-reported a positive test date that was after the laboratory test date (46.63% vs 12.53%).
Table 2: Percent agreement and number of days between self-reported SARS-CoV-2 diagnostic test dates and laboratory-reported (MEDSIS) test dates among 1,900 Arizona CoVHORT participants aged 18 years or older, who reported a SARS-CoV-2 infection prior to April 2, 2022.
Table 3 presents the likelihood of disagreement by participant characteristics. For every ten years of age, the odds of disagreement (>7 days difference) increased by 15% (OR 1.15). Individuals who self-reported their employment status as being retired were observed to have a 50% increase in the odds of disagreement (OR 1.50). Those aged 55–64 years (OR 2.12) and 65 + years (OR 1.96) had approximately two-fold odds of disagreement compared to 18–24-year-olds. These associations of age persisted after adjusting for number of weeks since testing positive to reporting illness to CoVHORT (Table 4).
Table 3: Unadjusted odds ratios (95% confidence intervals) for self-reporting a SARS-CoV-2 diagnostic test date to the Arizona CoVHORT greater than 7 days different than the matched laboratory test date reported to ADHS.
Table 4: Adjusted odds ratios (95% confidence intervals) for self-reporting a SARS-CoV-2 diagnostic test date to the Arizona CoVHORT greater than 7 days different than the matched laboratory test date reported to ADHS.
Disagreement became more likely with increasing time intervals (number of weeks) between infection status and reporting illness to CoVHORT (OR 1.02)(Table 4); this association remained unchanged after adjusting for age. Testing positive four weeks prior to reporting the illness to CoVHORT was associated with a 9.4% increased likelihood of an inaccurate self-report of test dates, and an interval of three months was associated with a 30.8% increased likelihood.
Discussion
The results of this study show that the vast majority of participants in a population-based cohort study can accurately self-report their laboratory test date. Approximately 82% of participants reported a test date that was within 7 days of the laboratory-confirmed date collected by the state health department, 89% were within 14 days, and 92% were within 28 days. Those aged 55 years or above or reported being retired were significantly more likely to self-report dates that differed by 7 days or more from laboratory-confirmed testing than younger groups and those who are currently employed at work or as a student. Finally, a longer length of time between testing positive and completing the survey was significantly and inversely associated with the accuracy of test date reporting.
Participant responses were less accurate among participants aged 55 years and above as compared to younger respondents. This finding has been demonstrated in prior studies of the impact of age on survey responses [23]. The accuracy of self-reports requires several aspects of cognitive processes, including understanding survey items, recall ability, and information processing [23]. In addition, individuals with frequent healthcare utilization may face additional challenges in distinguishing between multiple similar medical encounters, potentially increasing variability in recall of specific test dates and medical events [24–26]. Despite the variation in report accuracy by age, a stronger relationship was observed for the length of time between the laboratory-confirmed test date and the date of self-report, which supports prior work. For example, it was demonstrated that monthly self-reports of healthcare utilization effects greater accuracy regarding healthcare utilization than do yearly surveys [24], indicating that reports which are more proximal to the events being queried are more reliable. This phenomenon was first described in detail by Ebbinghaus in 1885, has been supported by a wealth of studies related to memory capacity, and arises due to numerous cognitive processes [27]. Hence, regular administration of surveys regarding COVID-19 test dates, as with the majority of other self-reported data, yield benefits in regard to accuracy of reporting.
Overall, we observed moderately high accuracy between the participant self-reported diagnostic test dates and those that were reported to ADHS by the laboratories that processed participant samples, with 82.5% of participants reporting a date within 7 days of the laboratory reported date. Although, slightly less accurate, our results are similar to other studies examining the reliability and accuracy of self-reported COVID vaccination status and other conditions. Among patients diagnosed with a SARS-CoV-2 infection who visited Canadian emergency departments, 96% of patients correctly reported their vaccination status at telephone follow-up compared to their vaccination record as in the Quebec Vaccination Registry [28]. A separate study conducted among adolescent girls further supports the reliability of self-reported diagnoses, which found a high overall agreement between self-report, medical record, and state surveillance record for past chlamydia and gonorrhea diagnoses [29].
Much about the acute-illness and long-term sequelae remain unknown despite the numerous studies of COVID-19 that have been conducted since the first appearance of SARS-CoV-2. Accurate infection dates, symptom start dates and dates of diagnosis, are crucial to ongoing research of illness progression, including PCC. However, with the widespread availability and utilization of at-home antigen tests, an increasing proportion of COVID-19 cases are likely not reported to health department surveillance systems and therefore are not associated with any laboratory confirmed testing information. The reliability of self-reported infection dates observed in this study support their utilization in future research, including those associated with at-home tests that have not been previously reported to any health-department based surveillance system.
Strengths of this study include the ability to collaborate with the Arizona Department of Health Services to obtain accurate data for laboratory testing dates, a large study population, and well-characterized participant data that permitted the evaluation of characteristics that were related to accuracy of testing self-reports. Limitations include a largely white and highly educated study population, affecting the generalizability of these findings.
In conclusion, these findings demonstrate that self-reported COVID-19 test dates exhibit high concordance with laboratory-based test dates reported to the state health department in Arizona. These findings should provide confidence in using these data to assess the presence or absence of PCC among individuals who have tested positive for SARS-CoV-2.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Zhu N, Zhang D, Wang W, Li X, Yang B, Song J, et al. A Novel Coronavirus from Patients with Pneumonia in China, 2019. N Engl J Med. 2020;382(8):727–33. doi: 10.1056/NEJ Moa 2001017 31978945 PMC 7092803 · doi ↗ · pubmed ↗
- 2Bell ML, Catalfamo CJ, Farland LV, Ernst KC, Jacobs ET, Klimentidis YC, et al. Post-acute sequelae of COVID-19 in a non-hospitalized cohort: Results from the Arizona Co VHORT. P Lo S One. 2021;16(8):e 0254347. doi: 10.1371/journal.pone.0254347 34347785 PMC 8336814 · doi ↗ · pubmed ↗
- 3Fernández-de-Las-Peñas C, Palacios-Ceña D, Gómez-Mayordomo V, Florencio LL, Cuadrado ML, Plaza-Manzano G, et al. Prevalence of post-COVID-19 symptoms in hospitalized and non-hospitalized COVID-19 survivors: A systematic review and meta-analysis. Eur J Intern Med. 2021;92:55–70. doi: 10.1016/j.ejim.2021.06.009 34167876 PMC 8206636 · doi ↗ · pubmed ↗
- 4Michelen M, Manoharan L, Elkheir N, Cheng V, Dagens A, Hastie C, et al. Characterising long COVID: a living systematic review. BMJ Glob Health. 2021;6(9):e 005427. doi: 10.1136/bmjgh-2021-005427 34580069 PMC 8478580 · doi ↗ · pubmed ↗
- 5Asadi-Pooya AA, Akbari A, Emami A, Lotfi M, Rostamihosseinkhani M, Nemati H, et al. Long COVID syndrome-associated brain fog. J Med Virol. 2022;94(3):979–84. doi: 10.1002/jmv.27404 34672377 PMC 8662118 · doi ↗ · pubmed ↗
- 6Hossain MA, Hossain KMA, Saunders K, Uddin Z, Walton LM, Raigangar V, et al. Prevalence of Long COVID symptoms in Bangladesh: a prospective Inception Cohort Study of COVID-19 survivors. BMJ Glob Health. 2021;6(12):e 006838. doi: 10.1136/bmjgh-2021-006838 34906986 PMC 8671853 · doi ↗ · pubmed ↗
- 7Kayaaslan B, Eser F, Kalem AK, Kaya G, Kaplan B, Kacar D, et al. Post-COVID syndrome: A single-center questionnaire study on 1007 participants recovered from COVID-19. J Med Virol. 2021;93(12):6566–74. doi: 10.1002/jmv.27198 34255355 PMC 8426910 · doi ↗ · pubmed ↗
- 8Fumagalli C, Zocchi C, Tassetti L, Silverii MV, Amato C, Livi L, et al. Factors associated with persistence of symptoms 1 year after COVID-19: A longitudinal, prospective phone-based interview follow-up cohort study. Eur J Intern Med. 2022;97:36–41. doi: 10.1016/j.ejim.2021.11.018 34903448 PMC 8648678 · doi ↗ · pubmed ↗
