Introduction
Generic preference-based measures (GPMs) of health-related quality of life (HRQL) play important roles in the allocation of resources in health care because they provide the utility component of cost–utility analyses. Such analyses inform prioritisation of treatments by third-party payers in many jurisdictions and are mandated in England when the National Institute for Health and Care Excellence (NICE) commissions the appraisal of a health technology [
1].
There is concern that some GPMs are insensitive to interventions which improve hearing [
2‐
5]. The issue is relevant because impaired hearing is prevalent. A clinically significant loss is experienced by 10% of all adults and by 30% of those older than 70 years [
6,
7]. Sufferers must compete for resources to obtain treatments to alleviate their disability. There would be a failure of equity if treatments were denied not because they were ineffective but because GPMs failed to attribute appropriate value to their benefits.
The development of GPMs begins with researchers identifying dimensions on which good function corresponds to good health. Discrete levels of function are defined on each dimension ranging from poor to good. A subset of the possible combinations of levels is valued by a representative sample of the public. These informants use methods for measuring preferences [
8] such that their valuations lie on a scale where 1 corresponds to perfect health, 0 to the state of being dead, and negative values to states considered worse than dead, if relevant. Statistical modelling is used to generate a valuation function which converts any combination of levels into a composite index whose value best aligns with the valuations of the informants. Finally, a questionnaire is compiled to elicit a respondent’s own level of function on each dimension. Then, the responses to the questionnaire and the valuation function are deployed to assign a value to the respondent’s HRQL that reflects public preferences.
We examined four widely used GPMs [
9‐
11]. The EuroQol Descriptive System [
9] includes dimensions relating to
Mobility, Self-care, Usual Activities, Pain/
Discomfort, and
Anxiety/
Depression. The version of the system until recently preferred by NICE [
12] defines three levels on each dimension (EQ-5D-3L). The Mark-2 version of the Health Utilities Index (HUI2) [
10,
13] includes dimensions relating to
Sensation, Mobility, Emotion, Cognition, Self-care, and
Pain, each with 4–6 levels. In the Mark-3 version (HUI3) [
14],
Sensation was decomposed into
Seeing, Hearing, and (being understood when)
Speaking. Self-care was re-worked as
Dexterity. The Short-Form-6D (SF-6D) [
11] includes dimensions relating to
Physical functioning, Role limitations, Social functioning, Pain, Mental health, and
Vitality, each with 4–6 levels. The dimensions are a subset of those in the Medical Outcomes Study Short-form Health Survey (SF-36) [
15], such that an SF-6D utility can be derived from responses to the SF-36 [
11].
Given differences in wording, dimensions, and methods of valuation, it is not surprising that the GPMs differ in their sensitivity to interventions [
16], including interventions which improve hearing [e.g.
2‐
4]. A systematic review of studies that measured HRQL in participants with impaired hearing using one or more of EQ-5D, HUI3, and SF-6D [
5] reached five conclusions: HUI3 was appropriate for estimating HRQL in studies involving impaired hearing; EQ-5D-3L was not responsive to modest changes in hearing; few studies had been designed to compare GPMs; only one study had included SF-6D; additional ‘head-to-head’ comparisons of GPMs were needed to understand why differences in sensitivity arose.
We made such a comparison by investigating the sensitivity of the four GPMs to cochlear implantation—a clinically effective intervention [
17] which restores auditory sensations to people with little or no functional hearing [
18,
19]. The primary goal of implantation is to improve the ability to understand speech. A secondary goal is to attenuate tinnitus [
20,
21]—“the conscious experience of a sound that originates in an involuntary manner in the head of its owner, or may appear to do so” [
22]. Up to 80% of candidates for implantation report some degree of tinnitus [
23].
The rationale for the study was that the categorical change from dysfunctional hearing to viable hearing that is brought about by implantation would allow scope for GPMs to show improvement if they were sensitive to hearing. The study adds to those in the systematic review [
5] of which only two were prospective evaluations of implantation where participants completed questionnaires themselves; both of those studies included only one GPM.
Against that background, we identified reasons why the four GPMs differed in their sensitivity to implantation. We distinguished their response to impaired speech reception from their response to tinnitus, and we considered the implications for the choice of GPM to use in studies of hearing.
Discussion
The data reported here were gathered in the late 1990s. Improvements in cochlear implants and changes in candidature might have rendered the results irrelevant to today’s outcomes. Two considerations counter those concerns. First, criteria of candidacy for unilateral implantation of adults in England [
17] remain the same as the entry criteria for the study, although they are currently under review [
33]. Second, accuracy of speech reception following implantation has plateaued since the mid-1990s [
34]. Thus, participants in the study, and patients receiving implants in England today, are likely to come from the same population and to demonstrate the same patterns of association among outcome measures.
With those caveats, the study provides evidence at three levels of detail to inform the choice of GPM to use in assessments of interventions for hearing disorders. First, at the level of composite indices, the results corroborate previous demonstrations [
2‐
5] that GPMs differ in their sensitivity to treatments which improve hearing (Table
2): HUI2 and HUI3 showed moderate or large responses to implantation; EQ-5D-3L showed a small response; and SF-6D showed no response. In consequence, economic analyses informed by HUI2 or HUI3 suggest that implantation is a cost-effective intervention, while analyses informed by EQ-5D-3L or SF-6D suggest that it is not (Online Resource 5). That result might argue that only HUI3 need be used in studies of hearing.
However, limitations in all of the GPMs emerge at the second level of detail where the changes associated with implantation on individual dimensions are considered (Table
3). Significant changes occurred on three groups of dimensions: positive changes resulting from improved auditory sensitivity; positive changes resulting from improved psychological well-being; and negative changes presumably resulting from pain caused by the surgical wound and from irritation induced by wearing the external parts of the implant system against the scalp. Limitations of the GPMs are shown by the fact that no GPM displayed all three types of change.
It is not surprising that the composite indices of HUI2 and HUI3 were sensitive to the improvement in speech reception, given that the questions which map respondents onto the levels of the
Sensation (HUI2) and
Hearing (HUI3) dimensions ask about the ability to “hear what is said” in conversation. More surprising may be that neither
Usual Activities (EQ-5D-3L) nor
Role Limitations or
Social Functioning (SF-6D) were similarly sensitive. However, participants had been severely profoundly hearing impaired for nearly 15 years on average (Table
1), so their “usual” activities would have adapted to constraints imposed by their deafness. Also, the questions that map respondents onto levels of the
Role Limitations and
Social Functioning dimensions emphasise limitations arising from health. The loss of hearing sensitivity that underpins impaired speech reception is not usually the consequence of disease [
35] nor is it generally perceived to be a manifestation of ill-health [
36].
In contrast, evidence from the third level of detail—individual differences in outcome measures before and after implantation—shows that annoyance due to tinnitus was perceived to be strongly related to perceptions of health (Table
4b). It was associated with poorer levels of function on dimensions related to physical activity (
Mobility and
Usual Activities in EQ-5D-3L,
Role Limitations and
Social Function in SF-6D), pain (
Pain/
Discomfort in EQ-5D-3L,
Pain in HUI2, HUI3, SF-6D), and mental health (
Anxiety/
Depression in EQ-5D-3L,
Emotion in HUI3,
Mental Health in SF-6D). This pattern (see also [
37]) can be rationalised by the ideas that the negative influence of tinnitus on the capacity to concentrate reduces productive activity [
38], that tinnitus is discomforting and akin to pain [
39,
40], and that tinnitus either elevates anxiety [
41] or anxiety exacerbates the experience of tinnitus [
42].
The conclusion that the GPMs are more attuned to tinnitus than impaired speech reception is reinforced by the finding that variation among participants in the change in self-reported health, EQ-VAS, was correlated with the change in
Tinnitus, not the change in
Speech (Table
4). The conclusion is supported by supplementary analyses in Online Resource 6 and is also compatible with analyses reported by Konerding et al. [
16] who examined patterns of association among dimensions of EQ-5D-3L, HUI2, and SF-6D. Correlated patterns across the three GPMs were found among dimensions related to mental health and, separately, physical functioning, and physical pain. The
Sensation dimension of HUI2, however, was not related to other dimensions, suggesting that sensory deficits are independent of other aspects of health. That conclusion anticipates the demonstration in the present paper and elsewhere [
2‐
5] that sensitivity to interventions which improve the ability to detect sounds is shown only by those GPMs that explicitly measure the benefits of improved auditory sensitivity.
Which GPM should be used in studies of hearing? Consider first that
Tinnitus was associated with aspects of health which are represented by dimensions in each GPM. All of the GPMs, therefore, have the potential to be sensitive to changes in health produced by interventions which reduce annoyance due to tinnitus. Although in the present study only EQ-5D-3L was sensitive to the small improvement in
Tinnitus (Table
4), in a clinical trial reported by Maes et al. [
43], both EQ-5D-3L and HUI3 distinguished patients whose tinnitus improved from patients whose tinnitus did not improve, with HUI3 showing slightly greater sensitivity. Thus, in jurisdictions where policy makers prefer HUI3, it should be used in studies of hearing, insofar as it has been shown to be sensitive to improvements in both tinnitus and auditory sensitivity.
In jurisdictions where policy makers prefer EQ-5D, it should be used to evaluate interventions intended to improve tinnitus, while HUI3 should be used to evaluate interventions intended to improve hearing sensitivity. That distinction aligns with the guidance given by NICE that alternatives to EQ-5D should be used to evaluate conditions for which EQ-5D lacks critical dimensions of health [
1]. Nonetheless, it is unsatisfactory to advocate the use of different GPMs to assess treatments for two aspects of dysfunctional hearing which can co-occur and which can both respond to the same treatment. The dilemma reinforces the aim of a consortium of researchers in the UK [
44] to develop a successor to the EuroQol Descriptive System which preserves sensitivity to the conventional dimensions of health while adding sensitivity to sensory disorders [
45,
46] among other changes.
Acknowledgements
This study was supported by the intra-mural programme of the Medical Research Council (UK) and Defeating Deafness—The Hearing Research Trust. At the time when the data were gathered, AQS and GRB were employed by the Medical Research Council. The UK Cochlear Implant Study Group involved clinicians in adult cochlear implant programmes in 14 NHS hospitals together with researchers at the MRC Institute of Hearing Research and the University of York. The lead surgeons and clinical coordinators of the programmes were Mr J. Toner FRCS and Mrs C. McAnallen (City Hospital, Belfast), Mr D. Proops FRCS and Mr H. Cooper (Queen Elizabeth Hospital, Birmingham), Mr C. Raine FRCS and Mr S. Khan (Royal Infirmary, Bradford), Mr R. Evans FRCS and Mr. J. Joseph (Princess of Wales Hospital, Bridgend), Mr R. Gray FRCS and Mrs I. Court (Addenbrooke’s Hospital, Cambridge), Mr A. Kerr FRCS and Dr. R. Barr-Hamilton (Royal Infirmary, Edinburgh), Mr J. Osborne FRCS and Ms M. Doran (Glan Clwyd Hospital, Bodelwyddan), Mr. R. Singh FRCS and Dr. A. Allen (Crosshouse Hospital, Kilmarnock), Mr J. Graham FRCS and Ms. W. Aleksy (Royal National Throat Nose and Ear Hospital, London), Mr A. Fitzgerald-O’Connor FRCS and Dr. T. Nunn (St. Thomas’ Hospital, London), Professor R. Ramsden FRCS and Mr. M. O’Driscoll (Royal Infirmary, Manchester), Mr G.M. O’Donoghue FRCS, Mr. K.P. Gibbin FRCS, and Mr. G. Armstrong-Bednall (University Hospital, Nottingham), Mr D. Chapman FRCS and Ms D. Gratton (Royal Hallamshire Hospital, Sheffield), and Mr P. Ashcroft FRCS, Mr M. Pringle FRCS, and Mrs J. Eyles (Institute of Sound and Vibration Research, Southampton). We thank Dr Pádraig Kitterick, Dr David Marshall, Mr John Foster, and Professor Karen Bloor for advice. We thank three anonymous reviewers whose critiques materially improved the paper.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.