Assessment of current patient reported outcome measures for three core outcome domains for single-sided deafness device intervention trials
Roulla Katiri, Deborah A. Hall, Derek J. Hoare, Sandra Smith, Bethany Adams, Kathryn Fackrell, Adele Horobin, Nicholas Hogan, Nóra Buggy, Pádraig T. Kitterick

TL;DR
This study evaluates existing patient-reported outcome measures for three key domains in SSD device trials and finds none fully meet current standards.
Contribution
The study identifies key concepts for SSD outcome domains and assesses existing PROMs for content validity in this context.
Findings
No existing PROMs fully met content validity standards for SSD outcome domains.
The Spatial Hearing Questionnaire and SSQ variants showed best performance for spatial orientation.
The CPHI questionnaire was most relevant for measuring impact on social situations.
Abstract
Outcome reporting in clinical trials of auditory interventions for adults with Single-Sided Deafness (SSD) is inconsistent. The Core Rehabilitation Outcome Set for Single-Sided Deafness (CROSSSD) initiative has recommended three outcome domains as a minimum standard in the design of SSD intervention clinical trials. These are, Spatial orientation, Group conversations in noisy social situations, and Impact on social situations. The study objectives were to (i) understand exactly what the outcome domains mean to SSD experts, and (ii) identify and assess candidate PROMs in terms of how well they measure the experts’ conceptualisation of those SSD outcome domains. Stakeholder representatives participated in two semi-structured online focus groups. Participants were four adults diagnosed with SSD with experience of auditory interventions, two healthcare professionals working in the field,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHearing Loss and Rehabilitation · Hearing Impairment and Communication · Noise Effects and Management
Background
Single-Sided Deafness (SSD), or unilateral hearing loss, is defined by hearing thresholds within normal limits in one ear, and a severe-to-profound sensorineural hearing loss in the other ear [1]. SSD can be congenital [2–7], or acquired due to conditions such as Ménière’s disease [8], viral infections such as labyrinthitis or COrona VIrus Disease (COVID) [9–11], autoimmune systemic diseases, or vestibular schwannoma [12–15]. In some cases the cause of SSD is unknown [16–20]. It is estimated that SSD affects 12 to 27 individuals in every 100,000 of the general population [21].
SSD poses the listener with significant challenges [22–24]. These include poor speech perception in complex listening environments [12, 25–30] and poor spatial awareness of sounds [22, 31–33]. Associated fatigue, psychological, and social consequences have also been documented [23, 34–41]. Various hearing aids (e.g., contralateral routing of signals), and auditory implants (e.g., bone conduction devices, cochlear implants) have been utilised since the 1960s to alleviate the functional effects of SSD [22, 42–51]. However, choice of outcomes and measurement instruments to assess the benefits and harms of interventions for SSD have been diverse and inconsistent [52, 53]. A systematic review including 96 studies evaluating the therapeutic benefits and harms of SSD interventions identified many ways to measure the same domains of interest. Notably, no single measurement instrument was used by all studies [54]. For example, speech-related outcome domains were measured using 73 different measurement instruments, and spatial-related domains were measured using 43 different measurement instruments. Another systematic review of studies evaluating the effectiveness of SSD interventions identified that outcome selection was biased towards assessing functional impairments for which measures are readily available and widely used, e.g., speech perception testing in noise, and localisation tests [53]. Outcomes that assess an individual’s well-being and overall health are also relevant to those receiving treatment for SSD but these are less often measured [55]. The diverse and inconsistent use of outcome measures hinders our ability to compare or synthesise evidence and make informed decisions about optimal treatment of SSD [56–59].
The ideal scenario is that experts in SSD (healthcare users, healthcare professionals, clinical researchers) are involved in prioritising what treatment-related outcomes are critical and important to measure, so that trialists can select and administer instruments which measure those outcomes and which have good psychometric properties. The Core Rehabilitation Outcome Set for Single-Sided Deafness (CROSSSD) initiative was established to progress towards this goal. To date, the CROSSSD team have employed good practice methodology following Core Outcome Measures in Effectiveness Trials (COMET) initiative recommendations [56] to complete the prioritisation phase. An international Delphi study involving 308 experts in SSD and a meeting with key stakeholder representatives took place in order to reach a consensus [60–64]. The consensus decision was that three outcome domains are critical and important to assess in every SSD clinical trial. These are (i) Spatial orientation, (ii) Group conversations in noisy social situations, and (iii) Impact on social situations. A plain language definition for each of these outcome domains were co-created by the CROSSSD team and two study research partners with lived experience of SSD (Table 1).Table 1. The core outcome domain set for single-sided deafness [68]Outcome domain namePlain language definitionSpatial orientationKnowing where you are in relation to the position of a sound sourceGroup conversations in noisy social situationsListening and following a conversation between a group of people, when others are talking in the backgroundImpact on social situationsYour hearing loss or device limiting your ability to fully participate in the social world; especially in challenging situations or where a lot of effort is needed to follow the conversation (for example; at a restaurant; at the park; in a bar or at a party)
The next phase of the CROSSSD plan is to determine whether any existing outcome instruments appropriately measure Spatial orientation, Group conversations in noisy social situations, or Impact on social situations. Good practice guidelines exist to guide researchers through the steps to determine what is ‘appropriate’. For example, COSMIN (COnsensus-based Standards for the selection of health Measurement INstruments) guidelines set out a taxonomy of measurement properties relevant for evaluating PROMs; namely validity, reliability, and responsiveness. The COSMIN initiative states that one of the most important psychometric properties of a measurement instrument is its content validity [60, 61]. Content validity refers to the degree to which the items within an instrument are an adequate reflection of the construct to be measured. The measure of content validity encompasses (i) relevance (all instrument items are pertinent to the overall concept), and (ii) comprehensiveness (the instrument samples all the key elements of the construct) [60, 62].
The objectives of the present study were therefore to (i) understand exactly what the outcome domains mean to SSD experts, and (ii) identify and assess candidate PROMs for their relevance and comprehensiveness in measuring the SSD outcome domains.
Methods
Our approach involved people who had been treated for SSD (patient and public involvement collaborators NH and NB) as CROSSSD study research partners thus ensuring that decisions were informed by lived experience [63]. Ethical approval was granted from the Proportionate Review Nottingham 2 Research Ethics Committee (REC reference 19/EM/0222, Integrated Research Application System (IRAS project ID 239,750) on 06 August 2019. Informed consent was taken prior to participation at the focus groups using an online consent form. Participants were reminded that they could withdraw from the study at any point without needing to give a reason. All participants were volunteers and no reimbursement was given for their contribution.
Objective 1. Domain conceptualisation
A thematic approach was taken using two expert focus groups to explore the personal patterns of experience and meaning of the three outcome domains. Focus groups were conducted online in October 2020, using Microsoft Teams software. One week before the focus group, participants received the plain language definition of each outcome domain (Table 1) and were asked to reflect on how the outcome domains related to their personal experiences. Informed consent using an online form was obtained prior to the focus group discussion.
Participants
Participants were Healthcare Users (HU) diagnosed and treated for SSD, and Healthcare Professionals (HP) working in the field. Inclusion criteria were: (i) adults ≥ 18 years of age that were healthcare users or professionals with experience in the field of SSD, and (ii) had participated in the CROSSSD study consensus meeting [64]. To maximise diversity in expertise and to ensure the focus groups were representative of the consensus meeting experts, purposive sampling targeted a wide range of demographics (gender, age, country, cause of SSD), and experience of different SSD interventions. In total, four HU and three HP were recruited (Table 2). One further HP had to withdraw before the focus group meeting.Table 2. Participant demographics. SSD expertise (for Healthcare Users (HU) this was the number of years since their diagnosis i.e., their lived experience; for Healthcare Professionals (HP) this was the number of years since they started working in the field i.e., their occupational experience); and SSD intervention experienceParticipantsGenderAge range (years)CountrySSD expertiseSSD intervention experienceFocus group 1Held on 22^nd^ of October 2020Healthcare User 1 (HU1)Male70–79England28 yearsCROS aidHealthcare User 2 (HU2)Male60–69England3 years,9 monthsCROS aidHealthcare Professional 1 (HP1)Male50–59England35 yearsCROS aidsFocus group 2Held on 27^th^ of October 2020Healthcare User 3 (HU3)Male18–29England1 year,1 monthCROS aid/BCD/CIHealthcare User 4 (HU4)Female30–39Spain3 yearsCROS aidHealthcare Professional 2 (HP2)Male40–49Germany25 yearsCROS aids/BCD/CIHealthcare Professional 3 (HP3)Male60–69Netherlands32 yearsCROS aids/BCD/CI(CROS: Contralateral routing of signals aid; BCD: Bone conduction device; CI: Cochlear Implant)
Data collection instruments and technologies
During the focus groups, facilitators briefly reminded the participants of (i) the meeting ground rules, (ii) the task in hand, in the context of core outcome set development for SSD interventions, and (iii) the plain language definition of each outcome domain. Each outcome domain was allocated 45 minutes for discussion, with a short rest break between each discussion. The Spatial orientation domain was discussed first, followed by Group conversations in noisy social situations, and Impact on social situations last. Each participant was given a clear turn to voice their opinions and time was given prior to closing the discussion to comment on other participants’ views, or add further comments. Participants were given a choice when to take their turn, depending on how prepared they felt to discuss the particular outcome domain.
A semi-structured interview schedule was prepared by RK, PTK, and DAH. The schedule encouraged free narrative responses for analysis using a thematic analysis method [65]. The schedule was reviewed for content, suitability and structure of the prompts, and clarity of the themes by the CROSSSD study research partners (NB, NH) and PPI engagement manager (AH). They ensured that concepts were described clearly, participant material was written in plain language and online communication aids (e.g., captions) were sufficient for participant needs. See Fig. 1 for the schedule used to discuss Group conversations in noisy social situations, and refer to Additional file 1 for discussion prompts prepared for the other two outcome domains. The schedule incorporated open-ended questions, designed to elicit detailed descriptions from participants, probed the experience of conversations, the nature of a group, the nature of the background noise, and what in their experience constituted ‘noisy’. The schedule was followed for each outcome domain, for both focus groups.Fig. 1. Example of the discussion prompts prepared for the Group conversations in noisy social situations (Listening and following a conversation between a group of people; when others are talking in the background) outcome domain to facilitate discussions during the focus groups. Please refer to Additional file 1 for discussion prompts prepared for the other two outcome domains
Facilitators took a position of appreciative inquiry, using active listening and being non-judgemental, and curious to explore and fully understand the participants’ experience. The focus groups were co-facilitated by the lead for the CROSSSD study (RK) who is also a clinical audiologist with 14 years’ clinical experience including working with patients who have SSD, and a researcher (PTK) who has previous qualitative research experience in the field of SSD [35]. The sessions were video-recorded and subsequently transcribed verbatim by RK.
Data analysis
Transcriptions were independently reviewed by RK and PTK, and key discussion concepts were identified. Initial codes highlighted all keywords and phrases used by participants to describe the different dimensions of each outcome domain. Codes were organised to create a taxonomy of findings, with each outcome domain comprising multiple functional domain topics (such as ‘Knowing where sounds are in relation to you’) which could be further broken down into key concepts (such as ‘Incorporates sound locations that are both in front and behind’ and ‘Considers sounds that are both within and outside the visual field’). Where further clarification was necessary, this coding process referred back to the original transcription. Some codes did not readily fit into the taxonomy and so are reported separately.
Two levels of peer review were conducted to enhance trustworthiness in the taxonomy. First, it was reviewed by the rest of the study team (DAH, DJH) and the CROSSSD study research partners (NH, NB). The study research partners confirmed that the taxonomy captured all important aspects of their lived experience. Second, it was reviewed at an international CROSSSD study steering group meeting, comprising three expert healthcare professionals and researchers based in Europe and the United States, as well as a PPI manager (AH). The individual members of the steering group pre-reviewed the frameworks independently prior to the meeting. Their critical review led to the addition of one key concept for the Spatial orientation domain (‘Incorporates sounds that are static or moving’), and participant support for this concept was found in the transcripts of the focus groups.
Objective 2. Content validity
The aim of the next stage was to evaluate whether any existing instruments would be appropriate for measuring the SSD core outcome domain set now that the key concepts within each domain were identified and labelled. Previous research had identified 127 potential instruments for assessing SSD outcomes (Fig. 2). These were two systematic reviews of outcome domains and measurement instruments used in the field of SSD [54], and of hearing instruments for unilateral severe-to-profound sensorineural hearing loss [53]. Screening for English language versions of these instruments, and supplementing the list with a hand search of more recently published instruments, resulted in a final set of 76 candidate instruments (Fig. 2). These comprised two diary records, 49 questionnaires, 17 rating scales, and eight other Patient Reported Outcome Measures (PROMs) (e.g., patient report with yes/no answers, or a single-item instrument). Some of the instruments were specific to SSD (e.g., Bern Benefit in Single-Sided Deafness questionnaire), others were not (e.g., Abbreviated Hearing Aid Benefit Profile). Additional file 2 provides further details about the 76 candidate PROMs.Fig. 2. Measurement instruments reported in studies investigating interventions that seek to restore hearing in adults with single-sided deafness. PROM(s): Patient Reported Outcome Measure(s), VAS: Visual analogue scale, CROSSSD: Core Rehabilitation Outcome Set for Single-Sided Deafness, SSD: Single-Sided Deafness
All 76 candidate PROMs were first assessed for content relevance. Three coders (RK, SS, BA) independently scored each item in every PROM against the key concepts listed in the taxonomy. Coders had expertise in hearing sciences research, had access to this taxonomy (i.e., plain language definition of the core domain, functional domain topics and key concepts), and used Excel spreadsheets to record their scores (one spreadsheet per PROM).
To score item relevance, coders were asked to consider the key concept, the target population of interest (i.e., healthcare users with SSD) and the context of use (i.e., patient reported outcome from an SSD intervention [60]. Coders were advised that an individual item could be relevant for one or more key concepts. Coders considered these three features of the item and gave a single score as follows: 1 = ‘item irrelevant to the key concept’, 2 = ‘item somewhat relevant/possibly relevant to the key concept’, or 3 = ‘item explicitly/clearly relevant to the key concept’. A scoring spreadsheet template was piloted using five PROMs and all coders were thoroughly trained by RK to ensure consistency in coding, and for quality assurance [62, 66].
Agreement that an item was relevant was achieved if at least two out of the three coders scored the item as 3. Agreement that an item was not relevant was achieved if none of the three coders scored the item as 3. For the remaining items, the three coders met up to discuss the rationale for their scores against the above criteria with the aim of achieving reconciliation on the final classification (i.e., relevant/not relevant). For each PROM, three relevance metrics were generated using the agreed classifications; one for the Spatial orientation domain, one for the Group conversations in noisy social situations domain, and one for the Impact on social situations domain. For example, if a PROM had 10 items and all 10 items were classified as relevant for measuring the outcome domain in question, then the relevance metric for the PROM would be 100%. If eight out of 10 items were classified as relevant to that domain, then the relevance metric for the PROM would be 80%, and so on.
Next, all 76 candidate PROMs were assessed for comprehensiveness. Using the agreed classifications for relevance, comprehensiveness was calculated for the two lowest level in the taxonomy (i.e., comprehensiveness in terms of whether the PROM items cover all key concepts). Relevant and/or not relevant classifications were scored as follows: if a core outcome domain contained eight key concepts and all eight key concepts were covered by at least one item in the PROM, then the comprehensiveness metric for the PROM would be 100%. If six out of the eight key concepts were covered by an item, then the comprehensiveness metric for the PROM would be 75%, and so on.
Protocol variations
There were two amendments to the published protocol [64]. The first was a change from a face-to-face, 7-hour focus group to two 3-hour web-based meetings. This change was necessary because of travel and physical distancing restrictions imposed by the COVID pandemic. The second deviation from protocol was that the ratio of healthcare users to professional experts was 1:1 rather than the 4:1 stated in the protocol. All participants had previous interactions and an established relationship with the facilitators and each other, having met in a previous stage of this work (consensus meeting that took place in July 2020); and a social coffee morning held shortly before the focus groups [67, 68].
Results
Objective 1. Domain conceptualisation
The domain-level taxonomy comprising multiple functional domain topics and key concepts are given in Table 3. Each of the three outcome domains, comprised three functional domain topics. For Spatial orientation, these were further broken down into eight key concepts, for Group conversations in noisy social situations there were nine key concepts, and for Impact on social situations there were also nine key concepts. Most functional domain topics and key concepts had supporting codes from both focus groups, indicating data saturation. For transparency, codes that did not readily fit into the taxonomy are reported in Table 3, while Additional file 3 summarises the main discussion points and supporting participant quotes.Table 3. The functional topics and key concepts for each of the three SSD outcome domains (Spatial orientation, Group conversations in noisy social situations, and Impact on social situations)Outcome domain and plain language definitionFunctional topics within the domainKey conceptsCodes that did not readily fit into the taxonomySpatial orientation: Knowing where you are in relation to the position of a sound sourceBeing aware that sounds are not in your visual fieldKnowing that the sound is not where you are currently lookingo Experiencing the world as 3-dimensionalo Sense of ease or comforto Sense of securityo Personal safety aspectFeelings of:o Inadequacyo Frustrationo Anxietyo StressFearConstant challengeBeing aware of threats or harms outside your visual fieldNot needing to rely on visual cuesKnowing where sounds are in relation to youIncorporates sound locations that are both in front and behindIncorporates sounds that are static, or movingConsiders sounds that are both within and outside the visual fieldAttending to sounds in one location and not at other locationsAbility to attend to sounds in the correct location without a time delay: an active processAbility to attend to sounds in the presence of noise or other distracting sounds: a dynamic processGroup conversations in noisy social situations: Listening and following a conversation between a group of people, when others are talking in the backgroundDynamic involvementKnowing when someone has started to talko Being aware of all conversations taking placeo Contributing appropriately at the right timeo Having to rely on visual cueso Having to rely on help or hints provided by a partnero Being able to sustain attention for long enoughFeelings of:o Being rudeo Embarrassmento Being always on ‘high alert’o Tirednesso Not being included LonelinessBeing able to tell when someone new starts to contribute to the group conversationKnowing who to listen to within the groupFollowing the thread of the conversation, when someone starts to contribute, and telling is it’s a new conversationListening in the background of other conversationsBeing able to know if the person talking is part of your conversation or another conversationBeing able to separate different streams of conversationsMaintain and sustain attention in the conversationConversations in other background noiseBeing able to understand what is being said in a noisy environmentBeing able to resist distracting soundsImpact on social situations: Your hearing loss or device limiting your ability to fully participate in the social world; especially in challenging situations or where a lot of effort is needed to follow the conversationContributing to social interactionsKnowing when to take your turno Demonstrate an understanding of what others are sayingo Impact on relationships, work, education, community, societyFeelings of:o Inability to contributeo ‘Over-participation’o LonelinessExhaustionKnowing what to do or say when it’s your turnBeing able to take turns without relying on visual cues or prompts from others‘Fitting in’ sociallyFeeling that you are contributing sociallyFeeling that you are part of the social groupNot having to avail of help from others to participateEase of participationBeing able to participate without always having to concentrate intenselyBeing able to sustain participation over timeNot having to avoid or withdraw from a situation
Objective 2. Content validity
Regarding the coding of content relevance, coders agreed on the item classification of relevant/not relevant for 43 out of the 76 (57%) candidate PROMs, without the need for any reconciliation conversation. Of the remainder, a classification was agreed through discussion among the three coders. From this agreed coding for the 76 candidate instruments, it was noted that 52 instruments contained no items that were relevant for any of the three core outcome domains. These are recorded in Additional file 2 (last column), and covered domains such as device preference, device satisfaction, coping, number of days missed at work, health status, health-related quality of life, service-related issues, mental health and tinnitus. Included in this list were a number of instruments designed to measure amplification-related outcomes (Glasgow Hearing Aid Benefit Profile [69], International Outcome Inventory for Hearing Aids [70], and Satisfaction with Amplification in Daily Life [71]).
For Spatial orientation, two variants of the SSQ (SSQ-12 [72] and SSQ-18-C [73]) scored 100% on the rating of comprehensiveness, indicating that items addressed all eight of the key concepts (Table 4). However, the SSQ-12 and SSQ-18-C scored lower on relevance (25% and 29% respectively) because the instruments also contain questions about speech and qualities of hearing. However, taking only the spatial subscale into account, then for the SSQ-12, the three spatial items were 100% relevant and for the SSQ-18-C, the 17 spatial items were 82% relevant to our Spatial orientation domain. We observed that comprehensiveness of the SSQ-12 and SSQ-18-C was compromised because measurement of multiple key concepts relied on a single question, such as ‘You are outside. A dog barks loudly. Can you tell immediately where it is, without having to look?’ [72, 73] or ‘You are standing on the footpath of a busy street. Can you hear right away which direction a bus or truck is coming from before you see it?’ [72, 73]. Therefore, another instrument worth considering for assessing the spatial domain is the Spatial Hearing Questionnaire (SHQ) [74]. This scored 88% on the rating of comprehensiveness (it excluded only one key spatial hearing concept, i.e., ability to attend to sounds in the presence of noise or other distracting sounds) and 50% on the rating of relevance (12 out of the 24 items were relevant to spatial hearing). Unlike the SSQ, key concepts were assessed across multiple items.Table 4. Ratings of relevance and comprehensiveness for each of the core outcome domains for SSD. Scoring for the individual key concepts across each candidate measurement instrument can be found in Additional file 4InstrumentRelevanceComprehensivenessSpatial orientationGroup conversations in noisy social situationsImpact on social situationsSpatial orientationGroup conversations in noisy social situationsImpact on social situations01. Abbreviated Hearing Aid Benefit Profile (APHAB)0130044002. Audio Processor Satisfaction Questionnaire (APSQ)0013003303. Bern Benefit in Single-Sided Deafness Questionnaire (BBSS)101001311004. BAHA satisfaction questionnaire (Ghossaini et al, 2010)07100222207. Client Orientated Scale of Improvement (COSI)19131338112208. Communication profile for hearing impaired (CPHI)18186310010011. Dizziness Handicap Inventory (DHI)004001115. Expected Consequences of Hearing aid Ownership (ECHO)006001117. Glasgow Benefit Inventory (GBI)006001118. Glasgow Health Status Inventory (GHSI)006001121. Hearing Handicap Inventory (HHIA)0012003322. Hearing Implant Sound Quality Index (HISQUI-NL)52602567024. Hyperacusis Questionnaire (Khalfa et al, 2002)070011026. Monaural auditory capacity assessment scale (MACAS)221707556027. Multi-item, multi-domain questionnaire Schafer et al., 2013) [76]214213891128. Nijmegen Cochlear Implant Questionnaire (NCIQ)5057502241. Questionnaire (Snapp et al, 2010)502501311047. Speech Spatial & Qualities 12 items (SSQ-12)25178100891149. Spatial Hearing Questionnaire (SHQ)5000880050. Speech, Spatial & Qualities 12 items Comparative (SSQ-12-C)2517863561151. Speech, Spatial & Qualities 12 Pre and Post (SSQ-12-B)2517863561152. Speech, Spatial & Qualities 18 items Comparative (SSQ-18-C)298010089053. Speech, Spatial & Qualities 5 items (SSQ-5)20402050441158. Tinnitus Handicap Inventory (THI)0040011
For Group conversations in noisy social situations, the CPHI questionnaire [75] scored 100% on the rating of comprehensiveness, indicating that items addressed all nine of the key concepts (Table 4). Again, the CPHI scored lower on relevance (8%). We judged that only 12 out of the 145 items were relevant to group conversations in noisy environments. This is not surprising given the CPHI is a multi-dimensional instrument comprising four broad subscales (communication performance, communication environment, communication strategies, and personal adjustment). The SSQ-12 and SSQ-18-C also performed well on comprehensiveness, both scoring 89% (Table 4). In each of these two questionnaires, only one key concept was not addressed; ‘Being able to know if the person talking is part of your conversation or another conversation’ (SSQ-12), and ‘Being able to separate different streams of conversations’ (SSQ-18-C). Relevance scores for Group conversations in noisy social situations was disadvantaged because the SSQ is a multi-dimensional instrument. Taking only the speech subscale into account, then for the SSQ-12, the five speech items were 40% relevant and for the SSQ-18-C, the 14 speech items were 29% relevant to our group conversations domain. The multi-item, multi-domain questionnaire developed by Schafer et al. (2013) [76] scored 89% on comprehensiveness. Overall, this instrument scored poorly on relevance (2%) because only six out of 43 items were about group conversations in noisy social situations. For all four of the instruments reported above (i.e., CPHI, SSQ-12, SSQ-18-C, and the questionnaire by Schafer et al. [76]), comprehensiveness was compromised because results were strongly reliant on single items which asked about multiple key concepts, such as ‘You’re at a dinner party with several other people. How often can you carry on a conversation or give and receive information without a great deal of effort?’ [75], ‘You are in a group of about five people in a busy restaurant. You can/cannot see everyone else in the group. Can you follow the conversation?’ [72, 73], and ‘How difficult is it to understand multiple talkers all around you in noise, at work or school?’ [76].
For Impact on social situations, the CPHI questionnaire [75] scored 100% on the rating of comprehensiveness, indicating that items addressed all nine of the key concepts (Table 4). Unlike the two other core domains, the comprehensiveness score was not reliant on single questions because there was more of a one-to-one mapping between one question and one key concept. Regarding relevance, 26 out of the 145 items were judged to be relevant to this core domain (18% relevance). This is perhaps not surprising given that the CPHI is a multi-dimensional instrument comprising four broad subscales (communication performance, communication environment, communication strategies, and personal adjustment). No other candidate instrument was deemed to be appropriate for assessing Impact on social situations as defined by our expert stakeholders.
Discussion
This study engaged with stakeholder representatives to gain an in-depth understanding of exactly what the outcome domains mean to SSD experts and identified and assessed candidate PROMs for their relevance and comprehensiveness in measuring the SSD outcome domains. Outcome domains were complex in that they comprised multiple conceptual components. For example, Spatial orientation included aspects of being aware that sounds are not in one’s visual field, knowing where sounds are in relation to oneself, and attending to sounds in one location and not at other locations. The key concepts identified by SSD experts in the present study complement previous themes explored through qualitative research with healthcare users with SSD. Notably, Lucas et al. [35] identified communication tactics for aiding group conversations in noisy situations and for enabling full participation in the social world. These include correct positioning in a social setting to favour access to signals, speech, or visual cues to aid the person’s ability to follow conversations and to fully participate in challenging listening environments.
The most parsimonious way of measuring outcomes is that one outcome domain is measured by one instrument (e.g. [77]). However, some outcome developers consider it to be simpler and less burdensome for healthcare users and health systems to utilise a single instrument rather than many individual PROMs (e.g. [78]). In hearing sciences, it has also been traditional to develop multi-dimensional instruments that assess a number of domains at the same time. Where the core outcome domains identified by our consensus methodology [68] do not match the outcome domains considered by the developers of the original instruments, the disadvantage of this approach is obvious. Several candidate instruments scored well in terms of their comprehensiveness because they captured all key concepts of the core domain of interest, but scored poorly in terms of their relevance because many items within the instrument asked about other aspects of hearing which were irrelevant to that core domain. In this regard, none of the 76 candidate instruments met the COSMIN standards for content validity in the context of assessing the three core outcome domains for SSD. Nevertheless, there are some instruments which are worthy of further consideration because they could be modified to enhance their psychometric properties. Some suggestions about future research directions are discussed for each outcome domain in turn.
For assessing Spatial orientation, on balance the SHQ [74] and the SSQ-18-C [73] have the greatest potential for modification in order to create acceptable instruments. Twelve items from the SHQ (Items 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, and 24) and 14 items from the SSQ-18-C (Items 15, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 29, 30, and 31) were found to be relevant to the key concepts for Spatial orientation. Modification would need to consider removing items that are not relevant to Spatial orientation, adding items (for the SHQ) asking about the ability to attend to sounds in the presence of noise or other distracting sounds, and rewording items (for the SSQ-18-C) so that items were more clearly relevant to specific key concepts. Considering the psychometric properties of any modified instrument, the last two points are particularly relevant for other important aspects of validity; namely construct and structural validity. Construct validity is the extent to which the PROM accurately measures the intended construct, while structural validity is the degree to which the PROM items and subscales reflect the underlying dimensionality of the construct being measured.
For Group conversations in noisy social situations, none of the existing instruments stood out as adequate candidates for measuring this outcome domain. Although instruments had good domain coverage, even speech-related subscales contained items that were not relevant to our domain of interest. Our findings highlight a gap in current practice which could be addressed through the creation of a new instrument that is more parsimonious for assessing Group conversations in noisy social situations. The CPHI, SSQ-12, SSQ-18-C and the questionnaire by Schafer et al. [76] all contain items that could be retained in a such new instrument. There are precedents for taking such an approach in the field of hearing sciences. For example, the developers of the Tinnitus Functional Index created this instrument by selecting the 25 best-functioning items from a pool of 175 items that had been harvested from nine widely used tinnitus questionnaires [79].
There is a growing body of evidence that hearing impairment significantly impacts on social well-being including engaging socially and maintaining inter-personal relationships [36, 80–83]. It is perhaps therefore surprising that for assessing Impact on social situations, the CPHI questionnaire [75] was the only identified PROM suitable for assessing this outcome domain. Twenty-six items from the SHQ (Items 5, 9, 12, 28, 31, 38, 41, 45, 52, 61, 65, 69, 72, 78, 80, 84, 92, 104, 109, 110, 116, 131, 135, 141, 144, and 145) were found to be relevant to the key concepts that were identified by our SSD experts. Our findings indicate that development of an instrument tailored to assessing social impacts is warranted. But any modification of the CPHI questionnaire to create a more parsimonious measure of social impact would again need to consider removing items that were not relevant to this domain.
Limitations
One of the study objectives was to identify and assess candidate PROMs for their relevance and comprehensiveness in measuring the three core outcome domains for SSD trials identified by the CROSSSD initiative [68]. These three outcome domains were identified through an international stakeholder involvement. However, for the domain conceptualisation phase of this project most participants were male, white Caucasian, and British. This geographical bias was in part due to the restriction on eligibility to those who had ‘core outcome set literacy’ [84, 85], i.e., who had already been engaged in the CROSSSD study [64]. It is unlikely that this geographical bias majorly impacted on the results because no material differences were noted in the functional domain topics and key concepts highlighted by focus group 1 (all British, male participants) and focus group 2 (other European, predominately male participants).
Concluding remarks
An ongoing challenge facing researchers, funders, healthcare professionals, and policy makers globally is adoption of translational research. For example, core outcome set uptake in trials contributes to reducing research waste by limiting selective reporting of outcomes and ensuring that results can be effectively compared and combined [86]. A recent review suggests that core outcome set uptake is low in most research areas [87], with most common barriers being not including all relevant stakeholder representatives in the core outcome set development process, which can reduce the generalisability and credibility of the outcome set [86]. Another identified barrier is not making recommendations on measurement instruments for the domains in the core outcome set [87]. The CROSSSD study group have endeavoured to minimise these barriers by using robust stakeholder engagement methods, including study team members with lived experience of SSD and making recommendations on measurement instruments. One suggestion going forward is to conduct a realist evaluation to understand how the research translation process contributes to health system sustainability and value-based healthcare [88]. Williamson et al. [87] suggest a ‘bottom up’ approach to research translation, which can yield positive outcomes across impact domains in a core outcome set, including advancing knowledge, collaboration and capacity building as well as contributing to changes in policy and practice. For example, an approach where core outcome set developers collaborate with key organisations and communities in a specific health area, to identify, tailor, and promote uptake strategies can be helpful. The review by Saldanha et al. [89] suggests that greater adoption of, and reference to, core outcome sets in regulatory guidance documents can encourage clinical researchers to measure and report consistent and agreed outcomes. Joining forces with different working groups (e.g., American Cochlear Implant Alliance Task Force [90]) could help to harmonise recommendations and would serve to complete the CROSSSD study roadmap [64].
Electronic supplementary material
Below is the link to the electronic supplementary material.
Supplementary Material 1 Supplementary Material 2 Supplementary Material 3 Supplementary Material 4
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Koumpa FS, Forde CT, Manjaly JG (2020) Sudden irreversible hearing loss post COVID-19. BMJ Case Rep 13(11). 10.1136/bcr-2020-23841910.1136/bcr-2020-238419 PMC 755450533051251 · doi ↗ · pubmed ↗
- 2Hall DA, Kitterick PT, Heffernan E, et al. How do we know that our patients have benefitted from our ENT/Audiological interventions? Presented at the annual meeting of ADANO 2016 in Berlin. Otol Neurotol 2019; 40: e 474–e 481.10.1097/MAO.000000000000193730870383 · doi ↗ · pubmed ↗
- 3Terwee CB, Prinsen CAC, Chiarotto A et al (2018) COSMIN methodology for assessing the content validity of PRO Ms user manual version 1.0 (12 June 2025). https://www.cosmin.nl/wp-content/uploads/COSMIN-methodology-for-content-validity-user-manual-v 1.pdf
- 4Katiri R (2023) A consensus on outcome measures for device-based interventions that seek to restore bilateral and binaural hearing in adults with single-sided deafness. Ph D thesis, University of Nottingham. https://eprints.nottingham.ac.uk/73071/
