Norwegian Translation and Validation of the Pelvic Organ Prolapse/Incontinence Sexual Questionnaire-IUGA Revised (PISQ-IR)

Tone Prøsch-Bilden; Signe Nilssen Stafne; Silje Kristine Sveen Ulven; Susan Saga

PMC · DOI:10.1007/s00192-025-06106-0·March 19, 2025

Norwegian Translation and Validation of the Pelvic Organ Prolapse/Incontinence Sexual Questionnaire-IUGA Revised (PISQ-IR)

Tone Prøsch-Bilden, Signe Nilssen Stafne, Silje Kristine Sveen Ulven, Susan Saga

PDF

Open Access

TL;DR

This study translated and validated a questionnaire to assess sexual function in Norwegian women with pelvic floor dysfunctions, finding it reliable for sexually active women.

Contribution

The paper provides a culturally adapted and validated version of the PISQ-IR questionnaire for Norwegian-speaking populations.

Findings

01

The Norwegian PISQ-IR showed good face/content validity, internal consistency, and construct validity for sexually active women.

02

Exploratory factor analysis supported the original structure for sexually active women but showed inconsistencies for non-sexually active women.

03

16 out of 19 predefined hypotheses were confirmed, indicating strong construct validity.

Abstract

The Pelvic Organ Prolapse/Incontinence Sexual Questionnaire-IUGA revised (PISQ-IR) measures sexual function in women with pelvic floor dysfunctions (PFD). The aim of this study was to translate the PISQ-IR to Norwegian and to assess its psychometric properties. The instrument was translated and reviewed through cognitive interviews with women from the target group and multidisciplinary clinical experts to establish face/content validity and cultural equivalence. Thereafter, a cross-sectional study of women with PFD from two Norwegian University hospitals was conducted. Floor and ceiling effects and internal consistency were calculated for all subscales. Construct validity was assessed through exploratory factor analysis (EFA) and by testing 19 theoretically derived hypotheses. Of 157 respondents, 111 (71%) women considered themselves sexually active (SA) and 46 (29%) non-sexually…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases2

Pelvic Organ Prolapse PFD

Figures1

Click any figure to enlarge with its caption.

Flowchart of four steps from translation to psychometric testing

Funding2

—The Joint Research Committee between St Olavs Hospital and the Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology
—NTNU Norwegian University of Science and Technology (incl St. Olavs Hospital - Trondheim University Hospital)

Keywords

IncontinencePelvic floor dysfunctionPelvic organ prolapsePISQ-IRPsychometric validationSexual function

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPelvic floor disorders treatments · Anorectal Disease Treatments and Outcomes · Urinary Tract Infections Management

Full text

Introduction

Pelvic floor dysfunction (PFD), including urinary incontinence (UI), anal incontinence (AI), and pelvic organ prolapse (POP) is common in adult women. The estimated prevalence is 23–25% of women in general, increasing with age up to 53% among 80-year-olds and older [1, 2]. PFD may have a profound impact on women’s psychological, social, physical, and sexual well-being [3, 4]. In a review of PFD, the prevalence of sexual dysfunction was reported to be 30–50% in the general female population, increasing to 50–83% among women with PFD [5].

The importance of patient-reported outcome measures (PROMs) is widely acknowledged and an essential component in the assessment of outcome after interventions, and its impact on patients’ clinical condition and well-being [6]. The Female Sexual Function Index (FSFI) and Pelvic Organ Prolapse/Urinary Incontinence Sexual Function Questionnaire 12 (PISQ-12) are two common questionnaires previously used to assess the association between PFD and sexual health. Although FSFI has demonstrated good reliability and validity across different populations [7], it is not specific for PFD. Furthermore, PISQ-12 is not validated in a population of women with AI, nor developed within a framework that intentionally included women without a partner, or those who do not consider themselves to be sexually active [8]. To address these limitations, the International Urogynecological Association (IUGA) proposed a revised version of PISQ-12, the Pelvic Organ Prolapse/Incontinence Sexual Questionnaire, IUGA-revised (PISQ-IR) in 2013 [8].

The PISQ-IR was originally validated in > 550 women seeking treatment for UI, AI, and/or POP across the USA and the UK. The questionnaire has demonstrated good evidence for content validity, internal consistency, and some good evidence of test–retest reliability and responsiveness to change [8, 9]. PISQ-IR has been validated in several languages, supporting the internal structure, internal consistency, and criterion validity of the questionnaire [10–16].

In Norway, there are currently no validated questionnaires on sexual function among women with PFD. Although both PISQ-12 and FSFI have been translated into Norwegian, neither of them have been subject to robust psychometric testing, which is important to establish validated measurement instruments. In this study, we therefore aimed to translate the PISQ-IR to Norwegian, and to assess its psychometric properties among women with PFD in the tertiary healthcare system with particular focus on face/content validity, construct validity and internal consistency.

Materials and Methods

This study required a triangulation of methods involving cognitive interviews of women from the target group, feed-back from clinical experts, and a quantitative survey. The Consensus-based Standards for the Selection of Health Measurement Instruments (COSMIN) guidelines [17] and the PISQ-IR Translation Protocol (IUGA) has guided the work.

Translation, Pilot-Testing, and Cultural Adaptation

The PISQ-IR was translated and cross-culturally adapted to Norwegian through several steps according to the PISQ-IR Translation Protocol (IUGA). First, a translation of the questionnaire was provided by a bilingual translator from English to Norwegian. Second, a pilot-test for comprehensibility, readability, and equivalence was made in three subsequent phases: (1) cognitive interviews with 10 women with PFDs, (2) assessment by multidisciplinary clinical PFD experts, and (3) additional cognitive interviews with another group of 10 women with PFDs following the two preceding steps. Third, another independent bilingual translator back translated the questionnaire from Norwegian to English. Finally, the IUGA Translation Working Group reviewed and approved this version of the questionnaire. Discrepancies were identified and amended between the steps (Fig. 1).Fig. 1. Flowchart of four steps from translation to psychometric testing

Study Population and Data Collection

In our study, three groups were recruited. The first group was a sample of 20 women with UI, AI, or POP included in the pilot testing of the questionnaire. This group was recruited from a physiotherapy pelvic floor training class at St. Olav’s Hospital, where they also received information about the study, and volunteered to participate. They were recruited in two rounds with 10 women in each, who underwent cognitive interviews during the pilot testing. They were not enrolled in the subsequent field-testing of the questionnaire. The second group included six clinical experts, comprising a gynecologist, two pelvic floor physiotherapists, two stoma care nurses, and one sexologist, who reviewed and gave feedback on the translated instrument. The correspondence was via email. The cognitive interviews with afflicted women and the evaluation by clinical experts were both part of the pilot-testing of the translated Norwegian version of the questionnaire, and served to establish the foundation for the face/content validity and cultural equivalence between the Norwegian and English versions of the PISQ-IR.

The third group comprised female patients recruited from the Department of urogynecology and gastrointestinal surgery at St. Olavs hospital (Trondheim) and the University hospital of North Norway (Tromsø), from August 2020 to October 2021. This last group was included in the field-testing of the translated PISQ-IR. For inclusion, patients had to be referred to the outpatient clinics diagnosed with UI, AI and/or POP, or any combination of the three, aged 18 years or above, have the ability to understand Norwegian, and have the ability to complete a patient-reported outcome questionnaire. Exclusion criteria included pregnancy or concomitant vulvodynia, painful bladder syndrome or chronic pelvic pain. A predetermined sample of 120 sexually active (SA) and 100 non-sexually active (NSA) women was targeted for inclusion, based on the power calculation derived from the PISQ-IR Translation Protocol. Specifically, performing a factor analysis requires 4–10 respondents per item as a rule of thumb, with 100 respondents as a preferred minimum [18]. In this study, this would require 84–210 SA women (21 items) and 48–120 NSA women (12 items).

In the field-testing, health secretaries identified patients who met the inclusion criteria based on the information given in referrals from General Practitioners. Eligible participants received an information letter and the survey packet together with the invitation to the patients first consultation at the hospital. On the first page of the survey packet, the patients were instructed to tick “yes” or “no” for the exclusion criteria: pregnancy, concomitant vulvodynia, painful bladder syndrome, or chronic pelvic pain (defined as pelvic pain for greater than 6 months). They were further instructed not to fill in the questionnaire or return it if they could tick “yes” for any of these conditions. Respondents anonymously completed the paper-based questionnaire and returned the forms via mail using prepaid envelopes. Responses with a nonresponse rate of > 50% in the PISQ-IR instrument would be excluded after data collection [19].

Measurements

PISQ-IR consists of two sections, one for non-sexually active (NSA) women and one for sexually active (SA) women, comprising 12 and 21 items respectively (A1). The first core-branching item leads the women to the correct part of the questionnaire, depending on how they consider themselves: NSA or SA, with or without a partner. The two sections are analyzed separately. The NSA section consists of four subscales: condition-specific reasons for sexual inactivity (NSA-CS; Q2c-Q2d-Q2e), partner-related reasons (NSA-PR; Q2a-Q2b), global rating for sexual quality (NSA-GQ; Q4a-Q4b-Q5a-Q6), and condition impact (NSA-CI; Q3-Q5b-Q5c). The SA section consists of six subscales: sexual arousal and orgasm (SA-AO; Q7-Q8a-Q10-Q11), partner-related reasons (SA-PR; Q13-Q14a-Q14b), condition-specific matters (SA-CS; Q8b-Q8c-Q9), global quality (SA-GQ; Q19a-Q19b-Q19c-Q20a), condition impact (SA-CI; Q18-Q20b-Q20c-Q20d), and sexual desire (SA-D; Q15-Q16-Q17). The SA-PR (partner-related) dimension is calculated only if response is given for item 12 (confirmed having a partner). All subscales are analyzed separately and calculation of a summary score is only recommended for SA women [20]. For NSA women, a higher score indicates greater negative impact on sexual function, while for SA women, a higher score indicates less impact and better sexual function.

For hypothesis-testing where PISQ-IR subscales were compared with other existing questionnaires, four related and previously validated self-reported measures were administrated:

The Female Sexual Function Index (FSFI) is a 19-item questionnaire and contains the following six subscales: desire, arousal, lubrication, orgasm, satisfaction, and pain. Scores range from 2–36, with higher score indicating greater sexual function [7]. The St. Mark’s Incontinence score is a 7-item condition-specific instrument to assess severity of anal incontinence (AI), with a score ranging from 0–24 [21], whereas the International Consultation on Incontinence Questionnaire Urinary Incontinence Short form (ICIQ-UI SF) is a 4-item condition-specific instrument to assess severity of UI, with a score ranging from 0–21 [22]. For both the St. Mark’s Incontinence score and the ICIQ-UI SF, higher scores indicate greater severity. Pelvic Floor Distress Inventory-20 (PFDI-20) assesses the distress of POP, anorectal and urinary symptoms in three subscales: Pelvic Organ Prolapse Distress Inventory (POPDI-6); Colorectal-Anal Distress Inventory (CRADI-8); and Urinary Distress Inventory (UDI-6) [23]. Each of the three scales is scored from 0 (least distress) to 100 (greatest distress). The sum of the scores of these three scales serves as the overall summary score of the PFDI-20, and ranges from 0 to 300. PFDI-20 and ICIQ-UI SF have been validated in a Norwegian population [24, 25]. In addition, demographic data and past medical history was collected.

Analysis and Statistics

Statistical analyses were conducted using SPSS version 29.0 (IMB Corp., Armonk, NY). Clinical characteristics between NSA and SA women were assessed using Student’s t-test/Mann–Whitney U or chi-square tests. Statistical significance was assumed at p < 0.05.

Subscales were scored by mean calculation. According to the recommendation, missing data was handled using listwise deletion, and imputation of missing values was not performed [19]. An item was considered acceptable if the proportion of missing values was less than 15% [18]. Floor and ceiling effects were considered as problematic if more than 15% of respondents achieved the highest- or lowest-possible scores [17]. Internal consistency was examined using Cronbach’s alpha coefficient, where a score > 0.7 was considered reliable [18]. The item-total correlation was examined and items with values below 0.3 were excluded [18].

Construct validity was assessed in two ways: (1) Explorative factor analysis (EFA) assessing PISQ-IR dimensionality, and (2) by hypothesis-testing where PISQ-IR subscales were compared with other existing questionnaires. The dimensionality of PISQ-IR was assessed using explorative factor analysis (EFA). In the EFA, each PISQ-IR subscale was tested individually as unidimensional subscales. In addition, the PISQ-IR multidimensionality was tested by entering all items simultaneously. The suitability of data was confirmed by a significant Bartlett’s test of sphericity, the Kaiser–Meyer–Olkin measure of sampling adequacy exceeding 0.6, and inspection of the correlation matrix identifying correlation coefficients of > 0.3 [26]. Principal component analysis (PCA) with oblique (Promax) rotation was applied to identify item grouping. Kaisers criterion with retention of eigenvalue > 1 and inspection of Cattel’s scree plot guided the number of factors to be extracted [26]. Construct validity was also assessed by 19 theoretically derived hypotheses expressed in terms of expected direction and magnitude of effect (Table 1). For discriminant validity, NSA and SA women were divided into two groups based upon the knowledge that a high burden of symptoms, i.e., the groups defined as St. Marks ≥ 9 [27], ICIQ UI SF ≥ 13 [22], and POPDI-6 > 17 [28], was associated with reduced quality of life and sexual function [5, 29]. Further, in investigation of convergent and discriminant validity, PISQ-IR subscales were compared with subscales measuring conceptually similar constructs, in line with previous publications [10–16]. Nonparametric Spearman rank-sum correlation was used where coefficients were considered low (< 0.3), moderate (0.3–0.59), or high (≥ 0.6 high) [30]. Table 1. Construct validity. Confirmation or rejection of hypothesesHypothesis testedValueConfirmedDiscriminant validity (between known groups)p1. The mean score on the subscale “condition impact” is significantly higher for the NSA group with heavy burden of pelvic floor symptoms than for the NSA group with low symptom burden0.769No2. The mean score on the subscale “condition-specific” is significantly higher for the SA patient group with no to moderate incontinence than for the SA group with severe symptoms of incontinence < 0.001YesConvergent and discriminant validityRhoCorrelation expectedBetweenHigh positive^a^3. PISQ-IR SA-AO and FSFI-Arousal0.719Yes4. PISQ-IR SA-AO and FSFI-Orgasm0.592No5. PISQ-IR SA-D and FSFI-Desire0.765Yes6. PISQ-IR SA-SUM and FSFI total scores0.765Yes7. PISQ-IR SA-GQ and FSFI-Satisfaction0.671YesModerate positive^b^8. PISQ-IR SA-PR and FSFI-Satisfaction0.476Yes9. PISQ-IR NSA-CI and PFDI-200.360Yes10. PISQ-IR NSA-CS and PFDI-200.336YesModerate negative11. PISQ-IR SA-CI and PFDI-20−0.396Yes12. PISQ-IR NSA-GQ and FSFI-Satisfaction−0.583YesLow positive^c^13. PISQ-IR NSA-CS and ICIQ-UI SF0.281Yes14. PISQ-IR NSA-CS and St. Marks score0.137Yes15. PISQ-IR NSA-CS and PFDI-20 POPDI-60.275YesLow negative16. PISQ-IR SA-CS and PFDI-20 POPDI-6−0.164Yes17. PISQ-IR SA-CS and ICIQ-UI SF−0.405No18. PISQ-IR SA-CS and St. Marks score−0.186YesLow to no existing19. PISQ-IR NSA-GQ and FSFI total−0.073Yes^a^ Measure the same construct^b^ Appear to measure similar but not equivalent constructs^c^ Do not measure similar construct although greater burden of PFD is associated with deterioration of sexual function

Face and content validity were assessed by a review of cognitive interviews with patients in the target population, and comments/feedback received from clinical experts on a question-by-question basis [18].

Ethics Approval

Ethical approval was obtained from The Regional Ethical Committees for Medicine and Health (REC) (reference no. 95426/REK sør-øst), the institutional review board at the Department of Urogynecology and Gastrointestinal Surgery at St. Olavs Hospital in Trondheim and the data protection official at both NTNU and the University Hospital of North Norway in Tromsø. Permission to validate PISQ-IR was also granted by the developer of the instrument, IUGA. Written consent was obtained from the women who participated in the cognitive interviews.

Results

A total of 625 women were approached for the field-test of the study, of whom 200 (32%) returned the survey packets. It was not possible to calculate an exact response rate because we could not determine how many of the 425 non-returned responses were due to self-assessment of the exclusion criteria listed at the beginning of the survey packet. After 34 of the 200 returned questionnaires were excluded due to reporting concomitant pain-conditions, in addition to 9 questionnaires excluded due to incomplete data sets (< 50% of PISQ-IR completed), this resulted in 157 responses available for analysis (Fig. 1). Among these, 46 (29%) women were identified as NSA, and 111 (71%) as SA. UI (82%) was the most common complaint, followed by POP (52%) and AI (52%). The mean age of NSA women was 60 years with 78% being postmenopausal, whereas the mean age of SA women was 51 years with 57% being postmenopausal (p < 0.001). The clinical characteristics of NSA and SA women are shown in Table 2. Table 2. Clinical characteristics of the study populationNon-sexually activen = 46Sexually activen = 111Totaln = 157n (mean ± SD) [Q1; Q3] or frequency (%)n (mean ± SD) [Q1; Q3] or frequency (%)n (mean ± SD) [Q1; Q3] or frequency (%)pAge (years)4560.2 ± 13.311151.3 ± 12.111153.9 ± 13.0 < 0.001Body mass index (BMI)4427.8 ± 5.410826.0 ± 4.115226.5 ± 4.60.056Postmenopausal4678.3%10947.7%15556.8% < 0.001Parity462.1 ± 1.01112.3 ± 1.11112.2 (1.1)0.324PFD symptom debut461091550.012 < 12 months31%9%15% 1–5 years41%43%43% 6–10 years15%20%19% > 10 years13%28%23%Duration of deteriorated sexual function2259810.325 < 12 months23%22%22% 1–5 years32%34%33% 6–10 years32%15%20% > 10 years14%29%25%Other diseases^a^4536%10928%15431%0.383Previous surgery for PFD0.071 Hysterectomy and/or BSO4628%10817%15420%0.125 Prior prolapse surgery4129%10210%14315%0.009 Prior incontinence surgery4323%10620%14921%0.660Previous/ongoing conservative treatment for PFD4372%10683%14980%0.175Previously sought professional help for sexual dysfunction260%1092%1351.5%NAICIQ-UI SF severity levels451111560.510 Urinary continent16%19%18% Slight UI (113%17%16% Moderate UI38%42%41% Sever UI29%16%20% Very sever UI4%5%5%ICIQ-UI SF score^b^3810.6 ± 4.6909.9 ± 4.710.8 ± 4.70.434St. Marks severity levels431081510.233 Anal continent37.252%48% Moderate AI (1–8)30.220%23% Sever AI (9–24)32.628%29%St. Marks score^c^338.1 ± 5.0887.1 ± 6.17.4 ± 5.80.149PFDI-20^b^ Question 3 (vaginal bulge)4466%11146%15552%0.025 POPDI-64637.8 ± 26.011127.5 ± 21.515530.5 ± 23.30.011 PFDI-20 total score44115.5 ± 57.111196 ± 49.8155101.5 ± 52.50.037FSFI^d^ Desire451.2 [1.2; 2.4]1083.0 [2.4; 3.6]1532.4 [1.2; 3.6] < 0.001 Arousal440.0 [0.0; 0.5]1104.1 [2.7; 5.1]1543.0 [0.6; 4.5] < 0.001 Lubrication440.0 [0;0]1105.0 [3.5; 5.7]1543.9 [0.0; 5.4] < 0.001 Orgasm460.0 [0;0]1114.4 [3.2; 5.6]2573.6 [0.0; 5.2] < 0.001 Satisfaction272.4 [0.8; 3.2]1054.4 [2.8; 5.2]1324.0 [2.4; 4.8] < 0.001 Pain360.0 [0;0]1105.4 [2.8; 6.0]1464.2 [0.0; 6.0] < 0.001AI anal incontinence, BSO bilateralt salpingo-ophorectomy, FSFI Female Sexual Function Index, ICIQ-UI SF International Consultation on Incontinence Questionnaire-Urinary Incontinence Short Form, PFDI-20 Pelvic Floor Dysfunction Index, POP Pelvic organ prolapse, POPDI Pelvic organ prolapse distress, St. Marks St. Marks incontinence score, UI urinary incontinencep value is calculated with Student t-test/Mann–Whitney U or chi-square^a^ Diseases that can lead to UI/AI/POP such as diabetes, neurological disease, depression, use of antidepressants, and previous radiation therapy of the pelvis^b^ Higher scores indicate greater severity^c^ Higher scores indicate more bother of incontinence. Only score ≥1 are included ^d^ Higher score indicare better sexual functioning

Face and Content Validity

A first round of cognitive interviews with ten patients with incontinence and/or POP led to moderate amendments in the wording of the Norwegian translation, although the participants experienced the questionnaire as relevant and comprehensive. An important finding was that many participants did not find it easy to choose between the two responses “Not sexually active at all” and “Sexually active with or without a partner” in the core-branch item 1 of the questionnaire: “Which of the following best describes you”. Despite the instruction that being sexually active could be with or without a partner, the interviews demonstrated that many participants were unsure which response to choose. Neither the item nor the responses were changed, since the instructions were considered to be sufficient. In the next round, a multidisciplinary group of clinical experts also considered the content of PISQ-IR as relevant and comprehensive to measure sexual function among women with incontinence and/or POP. However, they suggested moderate amendments in the wording of the questionnaire. This necessitated a second round of cognitive interviews with another ten women from the target group, which however revealed no further need for improvement of the Norwegian translation. By following these steps, a Norwegian version of PISQ-IR with good face and content validity in terms of relevance, comprehensiveness, readability, and equivalence was established.

Item Nonresponse and Floor and Ceiling Effect

Among the 46 NSA women, 24 (52%) completed all 12 items designated for this group. The nonresponse rates for NSA items varied from 4% (question 6) to 37% (question 2d and 2e). The nonresponse rates for NSA subscales varied from 11% for the global quality rating (NSA-GQ) to 35% for the condition-specific matters (NSA-CS). For the SA women, 81 of the 111 (73%) respondents completed all 21 items. Item nonresponse rates varied from 0% (question 7, 9, 11, 14a-b, 16, 17, 20a) to 11% (question 8b-c). The only SA subscale that had a nonresponse rate > 1% was condition-specific matters (SA-CS) with 10% (Table 3). Table 3. Scale parametersScaleNo. of itemsMissing n (%)Median [Q1; Q3])Cronbach’s α (n)FloorCeilingNon-sexually active (n = 46) Condition-specific (NSA-CS)316 (35%)2.3 [1.7; 3.4]0.68 (25)20%10% Partner-related (NSA-PR)28 (17%)3.5 [2.4; 4.0]n.a.^a^42%5% Global quality (NSA-GQ)45 (11%)3.0 [2.0; 3.9]0.87 (32)7%17% Condition impact (NSA-CI)37 (15%)2.7 [1.0; 3.7]0.87 (36)15%31%Sexually active (n = 111) Arousal/orgasm (SA-AO)303.8 [3.3; 4.0]0.72 (91)1%2% Partner-related (SA-PR)^b^303.3 [3.0; 3.7]0.81 (98)1%21% Condition-specific (SA-CS)311 (10%)4.3 [3.7; 5.0]0.78 (98)2%29% Global quality (SA-GQ)41 (1%)3.0 [2.3; 3.6]0.87 (103)6%9% Condition impact (SA-CI41 (1%)3.3 [2.3; 4.0]0.92 (109)8%25% Desire (SA-D)303.0 [2.3; 3.3]0.73 (110)1%1%^a^n.a. not assessed (only two items in the scale)^b^12 women excluded owing to marking off for no sexual partner

Among NSA women, floor effects were seen for the NSA-PR and NSA-CS subscales, with 42% and 20% reporting the worst possible score, respectively. Ceiling effects were seen in NSA-GQ and NSA-CI with 17% and 31% reporting the best possible score. Notably, among NSA women who responded to item 2a (no partner as reason for not being sexually active), 63% reported the worst possible score. Ceiling effects were also seen in the subscales SA-CS, SA-CI, and SA-PR, where 29%, 25%, and 21% reported the best possible score, respectively (Table 3).

Construct Validity

Twenty-four of the 46 (52%) NSA women completed the 12 items designated for this group, forming the sample for the EFA. The Kaiser–Meyer–Olkin measure of sampling adequacy was 0.68, and Bartlett’s test of sphericity reached statistical significance, supporting the factorability of the correlation matrix. PCA with Promax rotation revealed the presence of four components with eigenvalues exceeding 1, explaining 43%, 14%, 11%, and 9% of the variance, respectively (Table 4). The four factors explained 77% of the variance, revealing differences from the original PISQ-IR structure. Item 5a and 6, originally from the NSA-GQ, were loading on the same factor as the items for NSA-CI. Furthermore, item 2a, originally belonging to NSA-PR, loaded strongest on NSA-GQ with a factor loading of −0.837, although there were cross loadings on both NSA-GQ and NSA-PR (value < 0.4). However, as all items were poorly distributed, interpretating these results is challenging. Table 4. Pattern and structure matrix of four-factor solution for NSA women after PCA with Promax rotation, n = 24Pattern coefficientsStructure coefficientFactorFactorItem12341234Q5a: Frustration1.055−0.3000.909Q6: Bothersome0.911−0.3410.7640.325Q5b: Inferior0.8800.9110.3800.427Q5c: Angry0.7950.8680.5200.321Q3: Fear0.7410.3010.8510.3260.4070.483Q2e: Pain0.8630.799Q2d: Other health reasons0.7410.750Q2c: Condition impact0.3820.5330.6190.6720.3440.325Q2a: No partner0.314−0.8370.376−0.6700.342Q4b: Adequacy0.8040.5460.892Q4a: Satisfaction0.4630.5860.6750.773Q2b: No interest0.8990.819Eigenvalue^a^5.161.741.351.05Variance explained43%14.5%11.3%8.7%Total variance explained57.4%68.7%77.4%^a^ Eigenvalue refer to the total variance explained by each factor

Of the 111 SA women, 81 (73%) completed all 21 items designated for this group, forming the sample for the EFA. The Kaiser–Meyer–Olkin value was 0.78 and Bartlett’s test of sphericity reached statistical significance. PCA with Promax rotation identified the presence of five components with eigenvalues exceeding 1, explaining 35%, 12%, 11%, 7%, and 6% of the variance, respectively (Table 5). This factor solution explained a total of 69% of the variance. A scree plot revealed no distinct elbow. Promax rotation revealed the presence of a mixed structure compared to the original factor structure [8]. The items from the two factors SA condition impact (SA-CI) and SA condition-specific matter (SA-CS), which conceptually have some similarities, both loaded on one factor (component 1). Component 2 and 3 consisted of the same items as the original SA global quality (SA-GQ) and SA partner related (SA-PR) subscales, respectively. Component 4 consisted of two items from the original SA arousal orgasm (SA-AO) subscale, together with item 16 (frequency desire), which in the original factor structure belongs to the SA desire (SA-D) subscale. Component 5 consisted of two items from the original SA-D subscale, together with item 11 (pain) which originally belongs to the SA-AO subscale. Two items did not load on any component (item 8a) or had factor loadings < 0.4 (item 20a). All remaining variables demonstrated strong loadings (0.465–1.036), and they all load substantially on only one component. Table 5. Pattern and structure matrix of five-factor solution for SA women after PCA with oblique (Promax) rotation, n = 81Pattern coefficients/matrixStructure coefficient/matrixComponentComponentItem1234512345Q20d: Angry0.9600.9040.378Q20b: Inferior0.9040.8680.358Q18: CS fear restrict0.8500.8400.419Q8b: Shame0.7490.7700.3720.3500.384Q9: UI/FI with activity0.7450.626Q8c: Fear0.6630.7170.4750.389Q20c: Embarrassed0.6200.7600.5120.3710.458Q19a: Satisfaction1.0360.3280.9200.3970.369Q19b: Adequate1.0090.3320.9310.3150.4610.368Q19c: Confidence0.8730.4590.7830.358Q20a: Frustration0.3430.3010.5290.6910.5010.4540.493Q8a: Fulfilled0.5270.6780.3390.5660.577Q14b: Frequency0.9030.870Q14a: Desire0.8430.3700.868Q13: Lack of desire0.7570.386−0.3220.696Q10: Orgasm intensity0.7690.3110.740Q16: Frequency desire0.5790.5640.7420.555Q7: Sexually aroused0.3190.4770.3280.5200.4600.6430.404Q15: Wanting more0.9480.751Q11: Pain0.5840.4370.6040.642Q17: Rate desire0.4650.4260.3240.596Eigenvalue^a^7.302.612.271.301.1Variance explained34.7%12.4%10.8%6.2%5.2%Total variance explained47.2%58.0%64.2%69.4%^a^ Eigenvalues refer to the total variance explained by each factor

As recommended by the IUGA protocol, unidimensionality was assessed by adding only the proposed items for each subscale separately in the PCA. The one-factor-analysis confirmed the presence of a simple structure within all six subscales for SA women, with factor loadings ranging from 0.489 to 0.939, whereas included items showed strong loadings on only one component. Besides the subscale NSA-PR, which did not meet the criteria for EFA (with only two items included), one-factor analysis also confirmed the presence of a simple structure for NSA women, with strong factor loadings ranging from 0.731–0.931.

Hypothesis testing confirmed 16 out of 19 (84%) predefined hypotheses (Table 1). No significant difference was observed in mean scores on the subscale NSA-CI between the NSA group with high burden of PFD, compared with those with a low symptom burden. Apart from the moderate positive correlation (0.592) between SA arousal and orgasm (SA-AO) and FSFI-orgasm subscales, all other measures of the same construct between PISQ-IR and FSFI provided high positive correlations. Scales measuring similar but not equivalent structures showed moderate correlation in the anticipated direction. Scales measuring unrelated constructs showed low correlation, except from SA condition-specific impact (SA-CS) and ICIQ-UI SF.

Internal Consistency

Cronbach’s alpha coefficients varied from 0.68 to 0.92, with NSA-CS falling just below the acceptable level of 0.7 (Table 3). Except from the NSA partner related (NSA-PR) subscale, the item-total correlations indicated that all items correlated well with each subscale, with values ranging from 0.44 to 0.84 and 0.37 to 0.88 for the NSA and SA subscales, respectively.

Discussion

The present study describes the translation and validation of the Norwegian version of the PISQ-IR among women suffering from PFD. In this study, face and content validity and cultural equivalence was established through a meticulous process consisting of reviews from two rounds of cognitive interviews with women from the target group and feedback from a multidisciplinary clinical expert group. Wordings were changed if ambiguity or vagueness was identified during this process. However, some of the participants were unsure whether to respond to “Not sexually active at all” or “Sexually active with or without a partner,” which would classify the respondent as either NSA or SA. The instruction that one could be sexually active both with or without a partner seemed sufficient and no changes were made. In retrospect, an additional instruction about self-stimulation as sexual activity could also have been highlighted, thereby making the distinction between the two easier. Item nonresponse levels were high in NSA subscales. This was also found in Rockwood et al. [19] and several other studies where PISQ-IR have been translated and validated [14–16], and underlines that assessing sexual function is challenging, particularly among women who do not consider themselves as sexually active. Ceiling effect (in terms of best possible score) was found for the subscales NSA-GQ, NSA-CI, SA-CS, SA-CI, and SA-PR, while floor effect (in terms of worst possible score) was found in the NSA-CS and NSA-PR subscales. Similar to our study, both the German and French translation and validation studies of PISQ-IR identified floor and ceiling effects in many subscales [14, 16]. These effects impact the instrument’s discriminative power, making it challenging to detect potential deterioration or improvements after any type of intervention [18]. Despite these limitations found especially in NSA subscales, the Norwegian PISQ-IR can be utilized in both research and clinical practice, but results should be interpreted with caution, keeping these constraints in mind.

The factor structure of PISQ-IR was extracted by Rogers et al. [8] and later confirmed in the Spanish [10] and Chinese versions [11], but most translation and validation studies of PISQ-IR have not assessed the structural validity of the instrument [12–16]. In the Norwegian version, EFA mainly supported the original structure for both SA and NSA subscales, although it was not completely consistent and had many cross loadings. Owing to the small sample size in our study, the results must be interpreted with severe caution. Factors obtained from small data sets do not generalize as well as those derived from lager samples [26]. Therefore, unidimensional factors were assessed and confirmed the presence of one factor within all subscales for SA women and three for NSA women (except NSA-PR).

Furthermore, construct validity was assessed through hypothesis testing of each construct separately, based on assumptions and previous findings of correlations [10–16]. We expected to find moderate to low correlations between PISQ-IR and PFDI-20, St. Marks incontinences score, ICIQ-UI and FSFI due to of the structure of the questionnaires. This is because a condition-specific measurement can describe the impact and bother of symptoms but does not account for sexual function, although greater burden of diseases is associated with lower sexual function [5].

The validity was considered acceptable since 84% (> 75%) of the predefined hypotheses were confirmed [18]. For discriminating validity, we divided NSA and SA women in two groups respectively, based on burden of symptoms. Opposite to what we predicted, the mean score of the subscale “condition impact” was not significantly higher for the NSA women with heavy burden of pelvic floor symptoms compared with NSA women with low symptom burden. This may indicate that the symptom burden does not significantly affect how the condition impacts sexual function in our population.

The single subscales were assessed as constructs, which validly measured the construct to be measured [18], despite the inconclusive nature of the multidimensional aspects of the PISQ-IR. On the basis of Cronbach’s alpha coefficients, all scales demonstrated a good level of internal consistency, in line with other translations of PISQ-IR [10–16]. This reflects that the items in the respective subscales correlated well with each other, thus measuring the same concept [18].

Strengths and Limitations

This study provides a Norwegian translation and validation of a condition-specific measurement tool for sexual function in women with PFD. A strength of this study is the recruitment of participants from two university hospitals and the inclusion of women across a wide age range which enhances the external validity. In contrast to previous publications [10–16], a notable strength of our study lies in the diversity of PFDs in the study population, particularly the substantial representation of women with AI (52%). This study’s strength also lies in the comprehensive methodology used to validate the PISQ-IR and assess the translation’s psychometric properties, including evaluation of construct validity through 19 predefined hypotheses. The confirmation of 84% of these hypotheses indicates robust construct validity, aligning with theoretical expectations. The findings support the PISQ-IR’s ability to measure sexual function effectively among sexually active women with PFD in a Norwegian context. However, the study has some limitations. The overall sample size in this study is small, particularly the number of women who identified as NSA. We lack information about the women who did not respond to the study, such as whether they were younger or older, or had less or more advanced symptoms. Sexual health and sexual function remain taboo subjects, and some participants may have felt uncomfortable answering the questions. The questionnaire itself was long, and fatigue was observed in answering the PISQ-IR (the last section in the questionnaire) among some of the patients. Scale responsiveness and test–retest reliability is previously established [8, 11, 13, 15, 16]. This was however not assessed, which is a limitation in this study.

Points for Consideration

By making adjustments that can influence floor and ceiling effect (e.g., highlight self-stimulation as sexual activity in core-branch item 1 and specify that all items need to be answered), the responsiveness of the instrument may be positively affected, making the subscale better equipped to detect change over time. For administration in clinical settings, pre-reversed values can furthermore be an advantage and may make the tool more applicable in daily practice.

Conclusion

The Norwegian PISQ-IR was found to have good face/content validity, internal consistency and construct validity for each subscale among SA and NSA women, respectively, except for the subscale NSA partner related. Owing to the small sample size, especially among NSA women, conclusions must be considered with caution, and further studies of psychometric properties of the Norwegian PISQ-IR for NSA women is recommended.

Supplementary Information

Below is the link to the electronic supplementary material.Supplementary file1 (PDF 724 KB)