Development and validation of a person-centered antenatal care scale for low- and middle-income countries
Patience A. Afulani, Hawa Malechi, Daniel Enos Sekwo, Jaffer Okiring, Moro Ali, Osamuedeme J. Odiase, Beryl Ogolla, Joyceline Kinyua, Linnet Ongeri, Raymond A. Aborigo

TL;DR
This paper introduces a new tool to measure respectful and responsive antenatal care in low- and middle-income countries.
Contribution
A validated, person-centered antenatal care scale tailored for low- and middle-income countries is developed and tested.
Findings
The PCANC scale has high reliability (Cronbach’s alpha of 0.90) and validity in Ghana and Kenya.
The scale includes three subscales: dignity and respect, communication and autonomy, and responsive and supportive care.
The tool correlates with global measures of care quality and satisfaction, indicating strong criterion validity.
Abstract
Person-centered antenatal care (PCANC)—antenatal care that is respectful and responsive to people’s needs, values, and preferences—is essential to achieving optimal pregnancy outcomes. Yet, no validated tools exist to comprehensively measure PCANC in low- and middle-income countries (LMICs). We aim to develop and validate a tool to comprehensively measure PCANC relevant to women’s experiences in LMICs. We followed standard procedures for scale development. This included a literature review to adapt items from a prior scale developed in the United States and to generate new items relevant to LMICs; expert reviews with maternal health experts, health care providers, and women with lived experiences (currently pregnant or previously pregnant) to assess content validity; and cognitive interviews and pretesting with pregnant and postpartum women to evaluate clarity, appropriateness, and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2- —Eunice Kennedy Shriver National Institute of Child Health and Human Development
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGlobal Maternal and Child Health · Maternal and Perinatal Health Interventions · Maternal and Neonatal Healthcare
Background
High-quality antenatal care (ANC) is critical for preventing, identifying, and managing pregnancy complications to ensure the health of mothers and their babies[1]. The proportion of women receiving ANC has increased globally, including in low- and middle-income countries (LMICs), with over 80% of women in sub-Saharan Africa (SSA) having at least one ANC contact during their pregnancy[2]. Although current World Health Organization (WHO) guidelines recommend eight antenatal contacts[1], less than two-thirds of women in SSA achieve the previously recommended four visits [2]. Additionally, significant gaps in quality persist, both in service provision and experience of ANC, which contributes further to the low coverage[3–5].
Person-centered care during the pregnancy and childbirth continuum is critical to maternal health from both a quality of care and human rights perspective[6, 7]. We define person-centered antenatal care (PCANC) as ANC that is respectful, compassionate, and responsive to the needs, preferences, and values of pregnant women and gender-diverse people (women used subsequently for brevity), adapted from the definition of person-centered care by the Institute of Medicine[8]. PCANC has both direct and indirect effects on maternal and neonatal outcomes through its effect on patient care, engagement, and health-seeking behavior, including skilled attendance at birth[9, 10]. The WHO recommendations for ANC highlight the importance of a positive experience[1]. Operationalizing these recommendations requires measuring women’s experiences to identify gaps and inform interventions. However, efforts to measure women’s ANC experiences have lagged, with most efforts focused on childbirth[11].
Studies on ANC have primarily focused on frequency and timing[12–17]. Among the few that address quality of care, most focus on the content of ANC service provision, with relatively fewer on the person-centeredness of care [4, 5, 18–22]. This is due to the lack of validated measures for PCANC. A recent scoping review of measures that align with the WHO recommendations for ANC found that existing measures prioritize clinical content of ANC—with measures for women’s experiences of care notably lacking[23]. The few studies that include some measures of ANC experience have, however, highlighted gaps in PCANC. For example, a study in Kenya found that about one-third of women did not often understand the purposes of tests and medicines received and did not feel able to ask questions to the healthcare provider, only half reported being consistently asked if they had any questions, and about a third reported that they could never discuss issues in private[4]. While studies in LMICs have measured various aspects of PCANC, there is still no validated PCANC scale for LMICs.
Similar to ANC, there were few validated quantitative measures of person-centered care during childbirth until recently[24–26]. However, increased interest in women’s experiences during childbirth led to the development of several quantitative tools for this in the last decade[27]. Among the comprehensive validated tools for measuring women’s experiences during childbirth is the Person-centered maternity care (PCMC) scale, initially developed and validated in Kenya and India[26, 28–30]. The PCMC scale, which has domains for communication and autonomy, dignity and respect, and supportive care, has had widespread uptake, with subsequent validation and use in many countries in Africa, Asia, and America[31–36]. The scale has enabled us to extend research on predictors and consequences of PCMC [37–39] and has been used to evaluate interventions to improve PCMC[40, 41]. Furthermore, the PCMC scale informed the development of person-centered family planning and abortion scales[42–44], and notably, a person-centered prenatal (antenatal) care scale, focusing on the experiences of people of color in the United States (the PCPC-US scale)[45].
The PCPC-US scale, which has 34 items with three domains for communication and autonomy, dignity and respect, and responsive and supportive care, was developed in the United States (US) based on standard procedures for scale development but with a focus on the experiences of Black women and other people of color in the US. The psychometric analysis in studies shows that the scale has high validity and reliability in a sample of predominantly Black women [45] and predominantly Latina women in the US [46]. The PCPC-US has, however, not been validated in an LMIC. We seek to address this gap. Our aim in this study is to adapt the PCPC-US scale to make it relevant to LMIC contexts, with an initial validation in Ghana and Kenya. We refer to this scale for LMICs as a PCANC scale (as against PCPC) because the term ‘antenatal care’ is more commonly understood in LMICs than ‘prenatal care,’ although both refer to the same service during pregnancy.
Methods
We followed standard procedures for scale development as was used in the development of the PCMC and PCPC-US scales to ensure both conceptual and psychometric adequacy[47, 48].
Setting
The initial adaptation and validation activities took place in Ghana in the Upper East Region (UER). The final confirmation sample is the baseline data for an ongoing trial in northern Ghana’s Upper East and North East Regions and Migori and Homa Bay Counties in western Kenya. These sites have all been previously described[49, 50]. The Upper East and North East Regions are neighboring regions located in the northeastern corner of Ghana, both sharing borders with Togo to the east. The Upper East Region shares boundaries with Burkina Faso to the north and the North East Region to the south. The Upper East region is divided into 15 districts, with 11 district hospitals, 67 health centers, about 419 Community-Based Health Planning and Services (CHPS) compounds, and one regional hospital that serves as a referral center for the district hospitals[51, 52]. The North East is divided into six districts, with five district hospitals, 21 health centers, and 154 CHPS compounds [53].
Migori and Homabay are also neighboring counties along Lake Victoria in western Kenya. They each have eight sub-counties, each with a sub-county hospital and one county referral hospital. There are about 155 and 263 health facilities in Migori and Homa Bay, respectively, including county and sub-county hospitals, health centers, and faith-based and private health facilities[54].
Procedures to ensure conceptual adequacy
The first step was a literature reviewon women’s experiences during ANC in LMICs to update the items in the PCPC-US scale. Although the PCPC-US scale was developed following a comprehensive literature review to define the construct and domains, the focus for the items was on the experiences of people of color in the US. Thus, we reviewed the literature on women’s experiences during ANC in LMICs to identify issues that are most important during ANC to pregnant people in LMICs. This included reviewing the systematic review of what matters to women during ANC, the WHO guidelines on ANC for a positive pregnancy experience, and previous publications on PCANC[1, 55, 4]. Following the literature review, we assessed the PCPC-US scale to identify which items may apply to ANC in LMICs. We included additional relevant items not already captured based on the literature review.
The next phase involved expert reviews to assess content validity by asking experts to judge the comprehensiveness of items for measuring a construct and their relevance to the target population[56, 57]. Experts were purposively recruited to include maternal health experts, healthcare workers, and women with lived experiences with pregnancy (currently pregnant or had previously been pregnant). Eleven external experts—three researchers based in Ghana, three healthcare providers, three pregnant women, and two women who had recently given birth (recruited from health facilities in the Upper East region of Ghana)—in addition to the research team (four maternal health researchers, two of whom have clinical experience), participated in the expert reviews. This number exceeded the recommended sample size of at least six expert reviewers[58]. Each reviewer was first sent the initial list of questions to review individually. For each question, they were asked to think about its relevance to measuring PCANC in general (as defined for them) and to the ANC experiences of pregnant women in Ghana to rank it on a scale of 0 to 3 (“0” not relevant, “1” somewhat relevant, “2” quite relevant, and “3 highly relevant”). Experts were also asked to assess the comprehensiveness of the items for measuring PCANC on a scale of 0 to 3 (0. Not comprehensive; 1. Slightly comprehensive; 2. Comprehensive; 3. Very comprehensive) and whether all relevant aspects of PCANC were covered by these items on a scale of 0 to 3 (0. No; 1. Yes, somewhat; 2. Yes, definitely). In addition, they were asked to make recommendations on what to add, remove, or change. Following the individual reviews, experts were brought together as a group for further discussion and consensus-building in July 2022. This meeting was conducted in person at NHRC and lasted about four hours.
We revised the item list based on the expert reviews. Then, we conducted cognitive interviews with potential respondents to assess their understanding of the questions, evaluate problems with the wording of questions, and determine whether they were relevant and context-appropriate[59]. Trained research assistants recruited participants who had recently received ANC (pregnant or within 6 months of giving birth) from health facilities and conducted one-on-one interviews with them using the revised questions. Interviews were conducted at a convenient time and location for the participants in English and two local languages (Gurune and Kasem) following a training that included appropriate translations for the questions. Participants were told that their help was needed to develop the tool and that they should be comfortable recommending changes to how the questions were asked. They were then asked to respond to each question, followed by probes to understand why they answered the way they did, concerns with the wording of the questions, the relevance of the questions to their care experience, and any suggestions they had for improving the questions. We used probes instead of “thinking out loud” because these have worked better in previous cognitive interviews in similar settings. Interviews were recorded, and research assistants were engaged to debrief and make needed changes after six interviews (two in each language). Six additional interviews were conducted, during which only minor changes were suggested. Twelve cognitive interviews were conducted in total in October 2022, meeting the recommended sample size of at least 10 for cognitive interviews[60, 61].
Following revisions from the cognitive interviews, the scale items were integrated with a questionnaire including demographic and other questions, and the full questionnaire was pretested to identify any final issues with the scale items and the full questionnaire. We first pretested the revised tool with 12 pregnant and postpartum women, made relevant changes, and then pretested with another 12 women, which meets the recommended sample size of 15 to 30 for pretesting[62]. No significant changes were identified during the second set of pretests.
Survey
The final questionnaire was administered in a cross-sectional survey to obtain quantitative data to assess the psychometric adequacy of the scale. Eligible participants were women in the third trimester of pregnancy or who had given birth within the last six months and who received ANC during their pregnancy. The surveys were conducted with 600 women (300 prenatal and 300 postpartum) in August and September 2023. Sample size estimation was based on the rule of thumb of about 5–10 subjects per item on the scale, with 300–500 considered good and 500 and above considered very good, regardless of the number of items[47, 63]. Following an analysis of data from this first survey, the questions were revised and administered as part of the baseline surveys in the Caring for Providers to Improve Patient Experience (CPIPE) trial [50] to 2000 postpartum women in Ghana and Kenya (1000 per country and ~ 500 per region or county) between March and October 2024. Eligible participants were postpartum women who had given birth within the 12 weeks preceding the survey in the 40 study facilities. For both surveys, trained research assistants recruited and interviewed women in study facilities and surrounding communities. Using the delivery registers from the study facilities, the research assistants identified eligible women and contacted them for interviews. Additionally, women who had just delivered and were still in the facility or those who were attending postnatal care (PNC) appointments were recruited. Participants were told about the study, and those who consented to participate were invited to participate in one-on-one interviews at a time and location that was convenient to them. We used a convenient sampling approach where all eligible women identified were interviewed until the required sample size was achieved. The surveys were programmed into REDCap, and data was entered using a tablet.
All research participants at each stage provided written informed consent and received a small token of appreciation (two cakes of soap in Ghana and Ksh 400 (~$3) in Kenya). Ethical approval was obtained from the Institutional Review Boards of the University of California, San Francisco (UCSF) and the Navrongo Health Research Center (NHRC) for the validation study and from UCSF, NHRC, and the Kenya Medical Research Institute for the CPIPE trial.
Psychometric analysis
We assessed the scale’s construct and criterion validity and internal consistency reliability[47, 64]. We used inter-item correlations and factor analysis for item reduction and to assess construct validity—the degree to which the items represent the underlying conceptual structure. We first examined distributions of the items to note items with very low or high occurrences and to recode as needed. Negatively worded items were recoded to ensure higher numbers represent more person-centered care. Further, as in prior analyses, to ensure that all response options ranged from 0 to 3 for scoring, we took the conservative approach that assumes a positive, but not perfect, for not applicable responses. All items that had a “not applicable” or another fourth response were thus recoded to the upper middle category (2: most of the time, for positively worded items and 1: a few times, for negatively worded items)[45]. Next, we examined the correlations between individual items to identify items with very low and high correlations and the average inter-item correlation, aiming for an ideal average inter-item correlation between 0.20 and 0.40[65].
We then conducted iterative exploratory factor analysis (EFA) with the initial Ghana sample, using principal factoring with oblique rotations, which allow for correlation between the rotated factors, given that person-centered care domains are theoretically related[66]. We used the Kaiser-Meyer-Olkin (KMO) measure of sampling adequacy to assess the suitability of the variables for factor analysis, aiming for values between 0.8 and 1; Kaiser’s rule of retaining only factors with eigenvalues exceeding one and the “break” in the scree plot to determine the number of factors; and factor loadings and uniqueness to assess the performance of individual items, using a cut-off of < 0.3 for low loadings and > 9.0 for high uniqueness to mark items for potential removal (unless there was a strong conceptual rationale for their inclusion)[48, 66, 67]. Internal consistency reliability was assessed using Cronbach’s alpha, aiming for values ≥ 0.7[47, 64].
Following exploratory factor analysis, we conducted confirmatory factor analysis (CFA) using the CPIPE baseline surveys in Kenya and Ghana to assess if the three-factor structure could be uniformly applied across settings. In model building, we allowed error terms to correlate between items within the same proposed domain and used modification indices to systematically include the covariances of the errors. We assessed the goodness-of-fit of the full and reduced scales and subscales by estimating the root mean square error of approximation (RMSEA), p of close fit (pclose), comparative fit index (CFI), Tucker-Lewis index (TLI), and the Standardized root mean squared residual (SRMR) [68–70].
The responses for the final set of items are added to create summative scores, with scores standardized by dividing the mean score by the maximum possible scores (e.g., for a 36-item scale, the maximum score is 108 [36*3] and multiplying by 100). This creates a standardized score ranging from 0–100, where 0 is the worst and 100 is the best. Since there is no gold standard measure of PCANC, we assessed criterion validity—whether the measure is related to other measures in theoretically predictable ways—by examining how scores on the scale are associated with satisfaction and perceived quality of ANC and intent to use the same ANC facility in the future using cross-tabulations and linear regression. We assessed the criterion validity of the short scale by examining its correlation with the full scale and these measures.
Results
Conceptual adequacy
During the Initial review of the items by the study team, we retained all 34 items from the PCPC-US scale on assessment of face validity—at face value, they all appeared relevant to the experiences of women everywhere. We also added nine items we identified as relevant to ANC in LMICs from the literature, bringing the total number of items to 46. These 46 items were then sent out for expert reviews. During the expert review meeting, all items were deemed relevant and recommended to be maintained except for three questions. For the question, “Did providers knock on your room’s door and wait for a response before entering,” It was agreed by all that it was not relevant because in the setting, the provider is usually seated in the consulting room, and the patient walks in. This question was thus excluded at this stage. Several people also suggested removing the items on “preferred clinic” and “preferred provider” because they were generally not feasible in the setting and thus not considered highly relevant. However, a few recommended that they should be maintained. These two items were, therefore, marked for potential deletion later depending on their performance in subsequent phases. All respondents rated the tool as comprehensive or very comprehensive, which, together with the individual item ratings, indicated high content validity. There were, however, various suggestions to reword some questions and recommendations to include questions on physical and financial accessibility of services and counseling on specific issues. This increased the number of items to over 60, although some questions recommended for inclusion were not framed in a way that could facilitate scoring as part of a scale (e.g., questions on how much it costs to pay for services).
During cognitive interviews, most participants rated all items as important, with recommendations made to improve the wording of some items and to reorder some questions for better flow in the first round of interviews. No substantive changes were recommended during the second set of cognitive interviews. At the end of cognitive interviews, there were 68 questions, including 47 questions related to the three conceptual domains from the PCPC-US scale: four questions on wait time for various services, six questions on communication regarding specific ANC services, seven questions on counseling on specific issues, and four questions on perceived physical and financial accessibility. These items were all included in the questionnaire for pretesting and administered in the survey.
Psychometric adequacy
Initial validation sample
The analytic sample for the initial psychometric analysis using data from only Ghana was 595 because of incomplete data from five respondents. The demographics of respondents in the analytic sample are shown in Table 1. The average age was 27 years, with 83% between 20 and 34 years old. Most respondents were married or partnered (93%) and had one to three children (73%). Most had less than a college education (86%) and earned or received less than 300 GHS (<$30 at the time) (76%) a month. Distributions of other variables are shown in Table 1.
Distributions for individual PCANC items for the initial sample are shown in Appendix 1. In general, most questions had responses across the four response options. However, most of the negatively worded items (neglect, verbal abuse, discrimination, physical abuse, bribe) had a very low frequency of occurrence, with 94% or more responding “no never” to a negative occurrence. A few questions were also not applicable (NA) to all respondents, with > 30% NA for wait time to retrieve folder and family treated with respect and >15% NA on wait time for drugs and labs, support when needed help, support to deal with emotional wellbeing, and preferred ANC provider and clinic. The correlations between the full set of items varied widely from negative to 0.8. Of note was that the wait time and negatively worded variables negatively correlated with most other items.
The KMO values ranged from 0.6 to 0.9, with an overall KMO of 85, indicating suitability for factor analysis. Initial exploratory factor analysis with all 68 items yielded nine factors with eigenvalues of >1, accounting for 81% of the cumulative variance, although the scree plot suggested 3–4 factors. Most items from the original PCPC-US scale loaded on the first two factors, and the items on financial and physical accessibility and counseling and communication on specific tests and procedures loaded on the other factors. Therefore, most of these items were removed at this point as the analysis confirmed they were conceptually different from the rest of the items. The six questions about specific examinations and tests were recoded to combine into two variables after being told the results. All the perceived wait time questions also loaded poorly with the rest of the items. However, because of the importance of timeliness to responsiveness, we decided to retain the item on wait time to see the clinical provider and wait time for getting labs done and medications from the pharmacy (because the team thought these were important sources of delay in the facility). We dropped the question on wait time for folders (following clarifications that while some facilities keep folders, in many cases, the antenatal record that mothers keep is used; hence, mothers do not wait for their folders, explaining the high proportion of NA on that question). The two variables on wait time for labs and drugs were, however, recoded to one item on wait time to get labs or drugs because of the large proportion of NA responses for each. We also decided to combine the questions on whether they were asked about their emotional and mental wellbeing because they were interpreted similarly, had similar distributions, and were strongly correlated (r=0.76). Other items from the initial PCPC-US scale loaded poorly were retained at this stage, yielding 50 items for subsequent analysis.
EFA of the 50 items yielded five factors with eigenvalues greater than 1, accounting for 78% of the cumulative variance. However, most items were loaded on three factors, and the scree plot supported a single dominant factor. Most items had a loading of >0.3 on one of the three factors and on the single factor when constrained to this factor structure (appendix 2). The negatively worded items (discrimination, physical abuse, bribe, and neglect), which had low frequencies in the sample, as noted above, continued to load poorly (<2) and had high uniqueness (>0.95), so they were dropped at this stage. In iterative EFAs, more items were dropped based on different rationales to produce a shorter scale. This included removing items based on expert reviews because of low relevance (preferred clinic and provider) and those that had loadings of <0.3 and uniqueness >0.90 (pressured). Further, for items that were strongly correlated or captured a similar concept, only one was retained, with the one that was conceptually broader or had more favorable loading and uniqueness maintained (e.g., for “explained medicines” and “understood medicines,” we kept “explained medicines;” for “explained examinations and procedures” and “understood examinations and procedures”; we kept “explained examinations and procedures”; for asked “if they had questions “and “encouraged to ask questions,” we kept “encouraged to ask questions”; for “could ask any questions” and “held back from asking questions,” we kept “could ask any question”; for “language they could understand” and “language level they could understand,” we kept “language level they could understand”; and for “showed they cared” and “best care” we kept “best care”). “Family respected” was also removed because of the high NA responses (47%), as most respondents did not have a family with them. Some items with moderate loading (<3 but >2: privacy, verbal abuse, cleanliness, enough staff, time with the provider, asking about their birth plan, asked physical wellbeing) were, however, retained in this phase to assess potential removal in subsequent phases. The wait time items were also retained despite poor loading because of the conceptual relevance of timeliness. This yielded 36 items.
Factor analysis of the 36 items (Table 2) yields factors with eigenvalues of >1, accounting for 81% of the cumulative variance, but with one dominant factor (Figure 1) with an eigenvalue of 7.5, accounting for 61% of the cumulative variance. All items have loadings ≥3 on the three factors except for the two wait time variables with loadings less than two and six items with loadings between 2 and 3 (privacy, verbal abuse, language level, time with provider, cleanliness, and enough staff). The uniqueness for most items was <0.9 except for five of these items, with a uniqueness of 0.95 for the wait time variables on the three-factor structure (Table 3). When the analysis was restricted to the single factor structure, all items had loadings of >0.3 except 10 items (the two wait time variables have loadings less than two; and eight items—privacy, verbal abuse, time with provider, cleanliness, enough, birth plan, told exams, told results—have loadings between 2 and 3). The best fit in this sample was thus 28 items for a multidimensional scale and 26 for a unidimensional scale. Sequential exclusion of all 10 items with low loading in the 3 or single-factor structure did not significantly change the Cronbach’s alpha, and all versions had correlations >0.96. Thus, we proceeded with the 36 items for testing in subsequent samples.
As in prior validation studies, because the loading of the items does not group into clean conceptual categories, we created sub-scales based on conceptual groupings adapted from the World Health Organization’s experience of care domains [71]. This gives three subscales for “dignity and respect,” “communication and autonomy,” and “responsive and supportive care.” Factor analysis of items in each subscale yielded a single factor, with all items loading at >0.4 on the single factor and with uniqueness <0.8, except for 10 items noted above (Table 3).
The full 36-item scale has Cronbach’s alpha of 0.89, and all subscales have Cronbach’s alphas >0.7 (Table 3). This does not change with the exclusion of the wait time and time with provider items. The average interitem correlation is 0.18 for the full 36-item scale and 0.27, 0.20, and 0.15 for the “dignity and respect,” “communication and autonomy,” and “responsive and supportive care” sub-scales, respectively. These increased to 0.20 and 0.22 for the full scale and “responsive and supportive care” subscale, respectively, when the wait time and time with provider variables are excluded (i.e., 33-item scale).
The average standardized score for the 36-item PCANC scale is 77.6 out of 100, with scores of 86.4, 73.5, and 76.8. respectively for the “dignity and respect,” “communication and autonomy,” and “responsive and supportive care” sub-scales for the Ghana test sample (Table 4). Cross-tabulating and regressing the full scale on patients’ ratings of satisfaction with ANC services, general perceived quality of ANC, and whether they would receive ANC in the same facility if they were to be pregnant again showed increasing PCANC scores with higher ratings of satisfaction, perceived quality of care, and intent to receive ANC in the same facility (Table 4)
Confirmation sample:
Among the CPIPE baseline participants, 1,992 received ANC, with incomplete PCANC data from four additional respondents, which yielded an analytic sample of 1,988 for the confirmation sample. The demographics for the validation sample are shown in Table 5 by country. The average age was 26.9 (SD=5.8) and 25.4 (SD=5.6) years for Ghana and Kenya respectively. Most respondents were married or partnered (91.5% and 79.8% for Ghana and Kenya, respectively) and had 1 to 2 children (55.4% and 51.7% for Ghana and Kenya, respectively). Only about 10% of respondents had a college education (9.5 and 11% for Ghana and Kenya, respectively), and most earned or received <$100 a month (1000 or less GHS for 61.3% in Ghana and 10000 or less KES for 68.7% in Kenya).
Forty-six PCANC questions were included in the CPIPE trial baseline surveys, including the 36 items from the psychometric analysis, nine other items that the team thought were important to reassess in a new sample, and a new question on the accessibility of washrooms because some respondents noted there was no washroom in the facility in response to the question on cleanliness in the previous survey. The distribution of these items in this sample is shown in Appendix 3. An initial exploratory factor analysis with this sample confirmed that the nine items included that performed poorly in the initial Ghana sample continued to perform poorly in the validation sample (appendix 4), so these were finally dropped. In addition, the item on time with a provider continued to perform poorly in this sample, with a uniqueness of 1 on the single-factor structure. So, this was also dropped at this point. The two wait time questions, however, loaded adequately with the three-factor structure, although they were the only two items loading on the third factor and loaded poorly in the single-factor structure. The new question on the accessibility of washrooms also performed well. A decision was thus made to retain these three items, which maintained a 36-item scale.
The average KMO for the 36 items is 0.94 (range of 0.56 to 0.97), with three factors >1, which accounts for 87% of the cumulative variance, and one dominant factor (Figure 2), accounting for 66% of the cumulative variance. All 36 remaining items were loaded at >0.3 on the three factors except for verbal abuse and auditory privacy, which had loadings between 2 and 3 (appendix 6) but were maintained because of their strong conceptual relevance. As in the test sample studies, we created the sub-scales based on conceptual groupings for “dignity and respect,” “communication and autonomy,” and “responsive and supportive care.” Factor analysis of items in each subscale yielded a single factor, with all items loading at >0.4 on the single factor and with uniqueness <0.8, except for verbal abuse, language, and the time variables (appendix 5). Given the very poor loading of the two wait time variables, we also tested a 34-item scale that excludes those two variables. Further, to provide a smaller subset of items for instances where all 36 items might not be feasible to administer, we used an iterative approach where all items with factor loading <.3 were dropped, and additional items dropped based on our assessment of the strength of their relevance and similarity to other retained items, leading to a 20-item short scale (items shown in table 6).
These 36-item and 20-item scales with the three sub-scales were then evaluated in the Confirmatory Factor Analysis (CFA) for the combined sample and each country. The standardized factor loadings from the CFA for the observed items mainly ranged from 0.08 to 0.76, with only one item (wait time for labs or drugs) having a negative factor loading. All factor loadings significantly differed from 0, suggesting that most observed items positively correlated with the latent variables (Table 6). The model in Ghana alone had factor loadings that ranged from −0.84 to 0.84, and in Kenya, the factor loadings ranged from 0.08 to 0.71, with all the factor loadings significantly different from 0. The Goodness-of-Fit Indicators of the three-factor confirmatory model indicated the best-fitted model was the model with covariances of the error terms of 10 or more for the two countries combined indicated by RMSEA<0.08, p-close >0.05, CFI and TLI>.9, and SRMR <0.05 (Table 7 and Appendix 6).
The full 36-item scale has Cronbach’s alpha of 0.92, increasing to 0.93 when the two wait time variables are dropped, and that of the 20-item scale is 0.89. All subscales have Cronbach’s alphas >0.75 (Table 8). The average standardized score for the 36-item PCANC scale for the validation sample is 71.5 out of 100, with a similar score of 71.1 for the 34-item version and 70.6 for the 20-item version. The scores for the “dignity and respect,” “communication and autonomy,” and “responsive and supportive care” sub-scales are 81.2, 65.3, and 72.8, respectively, for the full sample, and that by country is shown in Table 8.
Cross-tabulating and regressing the full scale on patients’ ratings of satisfaction with ANC services, general perceived quality of ANC, and whether they would receive ANC in the same facility if they were to be pregnant again showed increasing PCANC scores with higher ratings of satisfaction, perceived quality of care, and intent to receive ANC in the same facility (Table 9). Scores on the 36, 34, and 20-item scales are strongly correlated (r=1.0 between the 34 and 36-item versions and 0.97 and 0.98 with the 20-item scale), suggesting good criterion validity for the shorter versions.
Table 10 shows all the PCANC items and the reasons for their inclusion/exclusion in the scale’s final long and short forms.
Discussion
We developed a PCANC scale with good content, construct, criterion, and known-group validity that is relevant to LMICs using a rigorous scale development process. The initial phases of scale development, including the literature reviews, expert reviews, and cognitive interviews, ensured content validity, the most important form of validity for any measurement tool [56, 57]. The psychometric analysis using a sample of pregnant and postpartum women in Ghana and Kenya yielded a 36-item comprehensive scale with adequate structural validity, three subscales for dignity and respect, communication and autonomy, and responsive and supportive care. We also developed a shorter 20-item version for instances where the 36 items might be too much. The full scales and subscales have good internal consistency reliability, with Cronbach’s alpha > 0.8 for the full scale and > 0.7 for the subscales. Higher scores on the scale are associated with higher satisfaction, perceived quality of care, and intent to use the facility in the future, as hypothesized, indicating criterion validity.
In general, the PCANC scale developed for LMICs performed similarly to the original PCPC-US scale[45]. The PCPC-US scale had 34 items on the full scale compared to 36 items on this scale. Most (26) items in the PCPC-US scale were retained in this scale, with nine items (knocking on the door, information showing they cared, family respected, physical abuse, discrimination, hold back on asking questions, coerced, preferred clinic, and provider) excluded. These are replaced with 11 items in the current version (time to get labs and drugs, auditory privacy, attention when needed help, greeted, friendly care, received best care, communication on tests and procedures, assessment of physical wellbeing, competence, availability of washrooms, cleanliness, and enough staff) not included in the PCPC-US scale. These additional items represent issues that are relevant in LMICs. The same three sub-scales are used in both versions as they represent relevant constructs globally.
Although wait time, considered very relevant during expert reviews, performed poorly in the psychometric analyses, it is maintained in the current version of the scale. Timeliness is an important aspect of person-centered care, particularly related to responsiveness, and should be measured in any work to improve person-centered care. Including timeliness thus increases the content validity of the scale. Timeliness can, however, be considered a separate construct reflected in the loading of the two wait time variables. This is also reflected in their being presented as separate domains of healthcare quality (Safe, Effective, Patient-centered, Timely, Efficient, and Equitable)[8]. Thus, we could have a 34-item scale with the two items on timeliness. In addition to safety, we have maintained them in one scale because they are unlikely to be measured if excluded or if each domain has a different scale. The current scale thus includes Safety, Patient-centeredness, and Timeliness, which, when used as an outcome, can be used to assess the effectiveness and efficiency of interventions and, when used with equity stratifies, can be used to assess equitable healthcare quality.
Many of the negatively worded items (physical abuse, neglect, discrimination, and neglect) were considered relevant; however, they did not perform even moderately well, likely because of their low frequency of occurrence in our sample and were thus excluded. These items may perform better when overt mistreatment during ANC is high. Thus, we recommend assessing all 46 relevant PCANC items in future validation studies to develop the best set of items that perform well across settings. Further, many questions excluded from the scale, including accessibility, service provision, counseling, and overt mistreatment, are still relevant to high-quality PCANC. Thus, although not included in scoring in the PCANC scale, these can be examined separately to get a full picture of the quality of ANC.
Like prior scales, the PCANC scale includes a mix of subjective (e.g., felt treated with respect) as well as more objective questions (e.g., procedures explained)[26, 45]. Both types of questions are important in assessing person-centered care in a way that accounts for people’s subjective experiences and reflects what happens during the encounter independent of people’s expectations[72]. The PCANC scale is, therefore, an actionable patient-reported experience measure that can inform quality improvement activities at various levels of the health system. The full scale captures all the key domains of PCANC and can thus be used for comprehensive assessments. The sub-scales can be used independently where some domains are deemed more relevant. The response format ensures that people’s responses are captured on a continuum, increasing the tool’s responsiveness to detect change. The PCANC scores can thus be tracked to examine intervention effects or to assess change over time and across settings.
Strengths and Limitations
The PCANC scale is based on a strong theoretical and empirical foundation, given that it is built on prior work, including the PCMC and PCPC-US scales. The rigorous adaptation process based on standard scale development steps ensured that the final scale had high content validity and was relevant to the target population. A potential limitation is that it may not capture issues of relevance to other LMICs, given that the initial adaptation process was only in Ghana and the final validation samples are only from Ghana and Kenya. However, based on our literature review, we believe the items will apply in most other LMIC settings as they were directly applicable in Kenya without a need for additional adaptation. What might be missing are items that are unique to different settings. For example, the overt mistreatment items fell off because of the low frequency of occurrence of these behaviors in the study samples. They will, however, be important where overt mistreatment during ANC exists. Future testing in diverse settings is therefore needed. The most important but challenging part of tool development is ensuring content validity. Yet most validation efforts focus on just psychometric adequacy. The pool of items we developed will thus provide a starting point for future psychometric assessments in different settings. Given the rapid uptake of the PCMC scale and validation in other settings after the initial validation, we believe this validation study in two LMIC countries will spur further validation in other LMIC settings. Other drawbacks include the length of the scale and associated participant burden. We address this by proposing a shorter version.
Conclusions
We present a valid and reliable comprehensive tool to measure the key domains of PCANC in LMIC contexts. Like prior person-centered tools, this tool will be valuable for needs assessment to identify gaps in PCANC to inform interventions and for evaluation to assess the effect of interventions on PCANC. In addition, it can be used for research to examine predictors to identify inequities in PCANC and assess its impact on maternal and neonatal outcomes to improve advocacy. It could also be used for routine monitoring for quality improvement and accountability. PCANC is critical to achieving the vision for maternal and newborn health. However, monitoring progress toward this vision will be difficult without measurement tools. This study thus addresses a critical gap in efforts to measure and provide high-quality ANC in LIMICs.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1WHO. WHO recommendations on antenatal care for a positive pregnancy experience. WHO. 2016. http://www.who.int/reproductivehealth/publications/maternal_perinatal_health/anc-positive-pregnancy-experience/en/. Accessed 12 Oct 2020.28079998 · pubmed ↗
- 2Antenatal care. UNICEF DATA. 2024. https://data.unicef.org/topic/maternal-health/antenatal-care/. Accessed 9 Aug 2024.
- 3Diamond-Smith N, Sudhinaraset M, Montagu D. Clinical and perceived quality of care for maternal, neonatal and antenatal care in Kenya and Namibia: the service provision assessment. Reproductive Health. 2016;13:92.27515487 10.1186/s 12978-016-0208-y PMC 4981972 · doi ↗ · pubmed ↗
- 4Afulani PA, Buback L, Essandoh F, Kinyua J, Kirumbi L, Cohen CR. Quality of antenatal care and associated factors in a rural county in Kenya: an assessment of service provision and experience dimensions. BMC Health Serv Res. 2019;19:1–16.31590662 10.1186/s 12913-019-4476-4PMC 6781384 · doi ↗ · pubmed ↗
- 5Amponsah-Tabi S, Dassah ET, Asubonteng GO, Ankobea F, Annan JJK, Senu E, An assessment of the quality of antenatal care and pregnancy outcomes in a tertiary hospital in Ghana. PLOS ONE. 2022;17:e 0275933.36223426 10.1371/journal.pone.0275933 PMC 9555636 · doi ↗ · pubmed ↗
- 6White Ribbon Alliance. Respectful Maternity Care: The Universal Rights of Childbearing Women (Full charter). 2011. http://www.healthpolicyproject.com/index.cfm?ID=publications&get=pub ID&pub ID=46. Accessed 28 Mar 2017.
- 7WHO. Prevention and elimination of disrespect and abuse during childbirth. WHO. 2015. http://www.who.int/reproductivehealth/topics/maternal_perinatal/statement-childbirth/en/. Accessed 1 Apr 2019.
- 8Institute of Medicine. Crossing the Quality Chasm: A New Health System for the 21st Century. 2001.25057539 · pubmed ↗
