
- Vol 39, Issue 10
The Most Important Thing You Need to Know About Meta-Analyses
A busy clinician does not have time to read the vast number of randomized controlled trials (RCTs) published about different treatments. But good meta-analyses save us time by aggregating and summarizing data from relevant trials. Individual RCTs are sometimes small and suffer from methodological flaws. The results of RCTs are often influenced by sampling error. Good meta-analyses help us base decisions on larger numbers of subjects and reduce the effects of sampling error. To make the most of meta-analyses, we need to understand the most important information reported by a good meta-analysis. That information is the “prediction interval.” Although meta-analyses can focus on other topics, this article focuses on meta-analyses of treatments.
Using clear rules, a systematic review searches for studies on certain interventions for certain medical conditions. The authors of the study review the data from relevant studies to determine if helpful patterns can be found.
A meta-analysis is a systematic review that also includes a statistical synthesis of the data and calculation of overall, or “summary,”
The summary effect tells us if a treatment’s average effect is clinically meaningful. But different patients and different populations of patients respond differently. Some patients will experience a below-average effect, some about an average effect, and some an above-average effect. Further, different studies report different average effect sizes. We need to know to what degree the effects are consistent or inconsistent. In meta-analysis, the degree of consistency or inconsistency is referred to as “dispersion” or “heterogeneity.”
Dispersion
Dispersion exists for 2 reasons: dispersion of true effects sizes and dispersion secondary to sampling error. True dispersion exists because different RCTs differ in at least 3 ways. First, different RCTs may include different subsets of a population (eg, 1 study may provide a particular drug to patients aged 18 to 64 years, and another RCT may provide the same drug to patients over aged 65 years). Second, RCTs are conducted according to different protocols (eg, 1 RCT may provide a form of
Dispersion due to sampling error occurs simply by chance. The sample drawn may not accurately reflect the population. For example, in a certain city, the population of individuals with depression may include 30% of patients who are 65 years and older. However, just by random chance, an experimental sample of patients with
We want to statistically eliminate the effect of sampling error and to know the dispersion of true effects. The prediction interval (PI) is the statistic that does this.
Prediction Interval
To fully explain the PI, we would have to see the equations and discuss them extensively. We do not have space for that, and the math is complex and requires many layers of computation. Instead, let’s introduce the concept and look at it graphically.
The equation for the PI is based on the standard deviation of the mean effect sizes of the included studies. A PI is defined with a level of statistical significance, usually P = .05, in which case, we have a 95% PI. Assume we did 100 more studies. Every study would have its own mean effect size, m1, m2, m3, etc, all the way up to m100. Those 100 study-level mean effects sizes would vary from study to study. For a 95% PI, 95 of the studies would have a study-level mean effect size that fits somewhere between the low end and the high end of the PI. Five of the 100 studies would have a study-level mean effect size that is below or above the PI. If the study-level means were normally distributed, the distribution would look something like the graph in
Let’s continue to look at the PI graphically and see how we can use the PI in clinical practice. In
In contrast, the green curve is much narrower. The results of the included studies are much more consistent. Even the leftward aspect of the green curve is above the clinically meaningful effect size of 0.5. If we provide the green treatment, we can be more confident that our patient will improve.
In
Of course, we cannot just look at numbers about treatment effects. In making clinical decisions, we also have to consider other factors, such as how serious the underlying medical condition is, the severity and probability of potential adverse effects, economic costs, all of those same factors about alternative treatments, and how likely it is that alternative treatments would work.
Prediction Interval vs Confidence Interval
Unfortunately, many—if not most—meta-analyses fail to report the PI. This omission makes it really hard to apply the results to clinical decision-making. Many, if not most, meta-analyses report the confidence interval (CI) and claim that this statistic represents dispersion or heterogeneity of true effect sizes. However, this is incorrect.
Whereas the PI is calculated from the standard deviation, the CI derives from a different statistic: the standard error of the mean. The 95% PI = MC ±1.96SD, where MC is the mean effect of all the studies combined and SD is the standard deviation. The 95% CI = MC ±1.96SE where SE is the standard error. The CI reflects how confident we are in the estimation of the combined average effect size. We summarize the differences between the PI and the CI in the
Let’s look at this graphically as well. Imagine that our population of interest is everyone in a particular city with depression and that we treat every patient with depression in a placebo-controlled trial. Assume that the bell curve in
As the number of studies grows, the CI narrows and that means we are more confident in how well we have estimated the true mean effect of treating the entire population. But just being confident in the mean effect size does not tell us how dispersed or how consistent the effects are overall. To make clinical decisions, we want to know how broad the base of the bell curve is and where the bell curve fits compared with a clinically meaningful effect. The PI tells us this information.
Many, if not most, meta-analyses report the inconsistency statistic I2 (also called the heterogeneity statistic) and claim that this statistic represents dispersion or heterogeneity of true effect sizes. However, this is incorrect. I2 is not an absolute number. I2 is the following ratio:
An analogy might help us understand why we want to focus on an absolute amount and not a ratio. Imagine that you ask me, “What are your total financial assets?” I answer, “My house constitutes 25% of my total assets.” You asked about an absolute amount, but I answered in terms of a ratio. My house might be worth $100,000, and my total assets might be $400,000. Or my house might be worth $500,000, and my total assets might be $2,000,000. Either way, the ratio by itself does not answer your question about the absolute amount of my total assets.
I2 is reported as a percentage. According to different cutoff points, I2 is described as low (about 25%), medium (about 50%), or high (about 75%) heterogeneity. This practice is illegitimate and should be abandoned.
Concluding Thoughts
There is much more we could discuss about meta-analyses, but it is the PI that best informs clinical decision-making. Do not worry too much about the math. Use the PI intuitively. Determine what you believe is a clinically meaningful effect size, and then graph the PI. Then compare the curve of the PI to that of the meaningful effect size. Finally, estimate whether the treatment is likely to help your patient at issue.
For more information, see Borenstein et al, Introduction to Meta-Analysis, 2nd Edition.
Dr Moore is clinical associate professor of psychiatry at Texas A&M University College of Medicine and works at Baylor Scott & White Health Mental Health Clinic in Temple, Texas.
Articles in this issue
almost 3 years ago
Tackling Treatment Issues in Adult ADHDalmost 3 years ago
Important Lessons for Psychiatry and Beyondalmost 3 years ago
Mental Health’s Most Toxic Mythalmost 3 years ago
Exploring DTx for Clinical Carealmost 3 years ago
COVID-19 and Mental Health: Global Consequences and CAM Approachesalmost 3 years ago
Exploring Integrative Medicine and Nutrition for PTSDalmost 3 years ago
Neurobiological Mechanisms of Mind-Body Medicinealmost 3 years ago
Herbal Medicine: What Psychiatrists Need to Knowalmost 3 years ago
Contemplating RetirementNewsletter
Receive trusted psychiatric news, expert analysis, and clinical insights — subscribe today to support your practice and your patients.