|
|
||||||||
Education |
1. Birmingham Women's Hospital, Metchley Park Road, Edgbaston, Birmingham B15 2TG, UK Email: Pabedin{at}doctors.org.uk (corresponding author)
2. Birmingham Women's Hospital, UK
3. Birmingham Women's Hospital, UK
| Abstract |
|---|
|
|
|---|
Learning objectives:
Ethical issues:
Please cite this article as: Abedin P, Daniels JP, Khan KS. Health technology assessment in obstetrics and gynaecology. Part 1: an overview of the process. The Obstetrician & Gynaecologist 2007;9:109115.
Keywords bias / clinical trials / health technology assessment / meta-analysis / systematic reviews
| Introduction |
|---|
|
|
|---|
The evaluation of these technologies can be done through primary research, systematic reviews and model based syntheses of the scientific evidence. It is concerned with technologies and their costs in their widest sense and the judicious use of the information for policy and decision making, thereby showing the way forward for incorporating evidence into practice.
HTA considers the effectiveness, appropriateness and cost of technologies by asking four fundamental questions:
Two of the many national agencies involved in HTA are of particular interest: the National Institute for Health and Clinical Excellence (NICE) and the National Horizon Scanning programme. NICE is the independent organisation responsible for providing national guidance on the promotion of good health and the prevention and treatment of ill health in the UK. It does this by producing guidelines on public health, health technologies and clinical practice within the National Health Service (NHS) using the HTA process described in this article. NICE is an independent agency set up by the government that aims to promote clinical and cost effectiveness through the production of national guidance and audit. New and emerging technologies are assessed and all stakeholders, including industry, professional and patient groups are invited to participate in the decision making process before a health policy or guidance is decided upon. Once clinical guidance is published, health professionals are expected to take such recommendations into account when making decisions on the individual care of the patient.
National Horizon Scanning is an early warning system to identify new technologies that have the potential to impact on health services. It is designed to identify significant and urgent advances regardless of clinical specialty and includes regular scanning of primary, secondary and tertiary information sources. Advance notice is provided to the Department of Health about new and emerging technologies that need consideration of clinical and cost effectiveness, safety and efficacy and the cost impact about 2 to 3 years before their launch.
A quality HTA will use skills in systematic review techniques, health economics, statistics, modelling, clinical intuition and health expertise (Box 1). We provide an overview of some of these below.
|
| Systematic reviews and meta-analyses |
|---|
|
|
|---|
Steps in conducting a systematic review
Formulating the question
There are several key components to a well formulated question. A clearly defined question should specify the population under study, the intervention(s) or exposures and the types of outcomes that are of interest. In addition, the types of study designs that are relevant to answering the question should be specified.
Study designs are mainly divided into two broad categories:4
Randomised controlled trials are considered the most reliable source of data when addressing questions regarding therapeutic efficacy, whereas other study designs are appropriate for addressing other types of questions. Questions relating to aetiology or risk factors can be addressed by case control and cohort studies. Questions relating to accuracy of tests are best addressed by cross-sectional design.
Searching the literature
The precision and validity of the findings of reviews are directly related to the comprehensiveness of the literature identification process. All important databases of reviews should be searched, along with conference proceedings and foreign language journals. To ensure there is no publication bias, unpublished papers should be searched for by contacting authors or searching in databases such as the SIGLE (System for Information on Grey Literature in Europe). The correct search term, incorporating the population, interventions, outcomes and study designs relevant to the review should be used.
Assessing the quality of the literature
The quality of the studies included in a systematic review will ultimately determine the impact it has and how reliable are its conclusions. This is dependent on the degree to which the included studies minimise bias and error in their design, conduct and analysis. Bias either exaggerates or underestimates the true effect of an intervention or exposure. There are several possible types of bias, such as selection, performance, measurement and attrition.4
There are published quality assessment instruments or scales that can be used to weigh studies according to their methodological rigour. These are usually in the form of checklists.56 One such scale was developed by Jadad,7 who validated a quality rating scale to evaluate RCTs, comprising adequacy of randomisation, double-blinding and description of withdrawals and dropouts.8 Other scales are also available, although some experts are of the opinion that, instead of using summary scores, it is preferable to use individual components of methodological quality to explore their influence upon the overall summary measure. The essential components of a quality RCT are discussed in more detail later in this article.
Summarising the evidence
A descriptive summary of the findings of studies included in a review is done by presenting the information about the study characteristics, their design and quality and their effects. A decision to carry out a meta-analysis, which is a statistical combination of results from two or more separate studies, must then be made. This increases the power or the chance of detecting a real effect as statistically significant if it exists. It also improves precision8 and answers questions not posed by the individual studies. However, if studies are clinically diverse, then a meta-analysis can be meaningless.
Meta-analysis should only be carried out when a group of trials is sufficiently homogenous in terms of participants, interventions and outcomes to provide a meaningful summary, although techniques are available to accommodate a degree of heterogeneity into meta-analyses.
Interpreting the results
The results of the individual studies and of the meta-analysis should be tabulated. The standard graphical output used in most systematic reviews, including Cochrane reviews, is the forest plot,9 although other formats are available.
The forest plot displays effect estimates and confidence intervals for both individual studies and meta-analysis (Figure 1): the area of the block indicates the weight assigned to the study in the meta-analysis, while the horizontal line depicts the confidence interval (this is usually 95%). If the summary effect estimates lie to the side of the vertical line denoting the intervention arm, it means there is a benefit of the experimental intervention. If, however, they lie towards the side of the control arm, there is no benefit. If the 95% confidence interval crosses the vertical line, there is not a statistically significant overall benefit of one intervention over the other.
|
| Randomised controlled trials |
|---|
|
|
|---|
The main advantage of randomisation is that it ensures that the groups of participants are comparable at the beginning of the trial, so that any difference seen later can be attributable either to the effect of the intervention or pure chance. However, the integrity of randomisation can be broken down if the randomisation sequence can be predicted (for example, by alternative or repeating allocations) or subverted. Once committed to the trial, participants should not be withdrawn in the hope of receiving a more favourable allocation on a second attempt. An independent randomisation service avoids the temptation to cheat. Furthermore, once allocated a treatment, the participants should be followed up and analysed in that group, irrespective of whether they actually received or completed the treatment course. Failure to perform this intention to treat analysis negates all the gains of randomisation and can significantly bias results. Periodic checking of the baseline balance of prognostic factors between groups throughout the trial provides reassurance that strict randomisation is maintained, although performing statistical tests on multiple variables will produce significant differences through chance alone, so they should not be applied.
A clear definition of the eligibility criteria will establish the nature of the population under study, allowing clinicians to consider how applicable the evidence is to an individual patient situation. The calculated number of participants needed to demonstrate or refute an effect of an intervention should also be clearly stated to reassure the reader that the trial was large enough to detect a meaningful and realistic difference between the groups. Inevitably, not all participants will complete the trial, but dropouts and non-compliance should not be hidden. The CONSORT diagram is the favoured method of demonstrating the flow of participants through a trial.
A thorough discussion of the pitfalls of analysis is beyond the scope of this article: suffice to say there are many ways of distorting the results of the trial through inappropriate statistical analysis. Undue emphasis on particular subgroups should be regarded as suspicious, as such analyses will be underpowered and likely to produce significant results by chance. Ideally, results should be reported as an estimate of the likely treatment effect with a measure of variability, usually a 95% confidence interval, which allows the reader to gauge the clinical significance of the intervention.
| Health economic evaluation |
|---|
|
|
|---|
An economic evaluation of a health technology is the comparative analysis of alternative courses of action by identifying, measuring, valuing and comparing both their costs and consequences. The latter information is obtained from systematic reviews and primary research, including RCTs.
There are essentially four types of economic evaluation.13 They differ with respect to the nature of the consequences being examined, although the method of cost evaluation remains essentially the same.
Cost minimisation: it is assumed that the consequences of alternative treatment options are the same and the least costly alternative is then adopted. An example would be day surgery versus inpatient surgery for the same procedure. It is assumed that the result of the surgery would be the same but the cost would be different, depending on whether the patient is a day or inpatient.Cost effectiveness analysis: the cost is related to a single common effect, which can differ in magnitude between the alternative programmes. An example is a renal programme in which the cost effectiveness of renal transplantation is compared with that of dialysis. Here, the unit of comparison would be the cost per life-year saved.
Costbenefit analysis: the outcomes of interest can be multiple and they need to be considered together to do a full evaluation. With reference to the above example, if home dialysis is also compared with hospital dialysis and quality of life, and medical complications considered in addition to the years of life gained, to do a costbenefit analysis we would need to compute cost effectiveness ratios for the three effects.
Costutility analysis: utilities are employed as a measure of the value of health programmes. This is done is by calculating the quality-adjusted life-year (QALY), which takes into account both quality and quantity of life generated by healthcare interventions.14
A QALY places a weight on time in different health states. A year of perfect health is 1; however, a year of less than perfect health is less than 1. Death is considered to be 0; however, some health states can be considered worse than death and can have negative scores.
The costutility ratio is, thus, the additional cost required to generate a year of perfect health (one QALY). Comparisons can be made between interventions and priorities can be established based on those interventions that are relatively inexpensive (low cost per QALY) and those that are relatively expensive (high cost per QALY).
There are various instruments used to produce health utility values leading to a calculation of a QALY.15 These include the SF-36 (sickness impact profile/Nottingham health profile) and the EQ-5D (a health-related quality of life measure which includes social preferences) (Box 2).
|
The main types of models are decision tree analyses (Figure 2) and Markov models (Figure 3).17 In the former, the decision tree is structured around probabilities and outcomes, the latter being quantified in the form of cost, utilities, life-years and QALYs. Markov models on the other hand, represent disease processes that evolve over time. They assume a variety of health states that individuals can have at a given time and that they move from one state to the other with a given probability. They are used to estimate long-term costs and life-years or QALYs gained.
|
|
The theme is to be continued in the next issue, in which we will elaborate on health technology assessment as it is applied to obstetrics and gynaecology.
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |