|
| Volume
— Use
of Evidence:
Quality of Care — From Variation to Improvement
newest
06/10/09: Variations
in Care Costs and Care Quality: Evidence (Lack of), Quality
of Care and Physician Revenue: “The Cost Conundrum:
What a Texas town can teach us about health care”
by Atul Gawande
Contents
Go to
DelfiniClick™
for all volumes. |
|
The
Volume of Inappropriate Care in the US
Could things be
getting worse in the area of clinical quality? We have seen
estimates of 20%-30% inappropriate care in the past, but in
the May/June 2004 issue of Health Affairs (Vo 23;3: 247-255),
Kerr et al. report that in a study of approximately 7,000 people
in 12 metropolitan areas in the United States, on average, residents
received only 50% to 60% of recommended care (recommended care
was based on 439 quality indicators across thirty conditions
and preventive care recommendations developed by RAND and validated
by four multi-specialty expert panels using a modified Delphi
method). And this doesn't even address the other side of inappropriate
care — overuse. We continue to believe the answer lies
in organizations creating a system of evidence-based solutions
for care.
Full text is available
to subscribers and can be obtained for a fee for non-subscribers
at http://www.healthaffairs.org/
|
More
On Therapies Becoming “Routine Clinical Practice”
Without Evidence & Why We Are Doing This Work: Radiofrequency
for the Treatment of Gastro-esophageal Reflux Disease Critical
Appraisal Example
04/01/08
Several years ago,
we received a phone call from a medical director asking if we
would do a limited review of radiofrequency for the treatment
of gastro-esophageal reflux disease (GERD in the US; GORD in
UK). He had received a request for coverage of this procedure
by a physician and was concerned that there was not good scientific
evidence supporting use of this procedure. His health plan policies
required proof of efficacy.
We agreed. We contacted
the requesting physician and asked him to recommend his best
three studies for review. (He pleaded for more - we relented
and said, “Okay, four.”) Of the 4 studies he sent
us, only 1 was a randomized controlled trial (RCT), and so we
had to surmise that if he had been given permission to send
us even more, those additional studies, too, would have been
observational.
Observational studies
should not be used for addressing questions of efficacy of interventions
with rare exception.[1,2] The only exception that is thus far
agreed upon is when all-or-none results are achieved, and this
is extremely rare.
And one RCT generally
will not be considered sufficient to declare “proof.”
Even a meticulously designed, executed and reported RCT may
still result in findings merely due to chance, not to mention
that only through replicated studies might one discern the unfortunate
occasional fraud. And this RCT could not even come close to
being a candidate for definitive proof of anything. While thoughtfully
designed by investigators more knowledgeable of research design
than many we have seen, it was very small and had myriad problems
as our critique — reproduced below — demonstrates.
Even the authors
- very responsibly - themselves acknowledged that their study
should only be considered preliminary and to have proved nothing
(a point that the requesting physician seemed to have overlooked
in his desire for a solution for this problem that he, himself,
believed in).
And now, in The
Lancet, Mehran Anvari[3] of St Joseph’s Healthcare, Hamilton
ON, Canada, comments upon a recently published review which
considers clinical trials from the past three decades on the
use of endoluminal therapies for GERD/GORD.[4]
Anvari writes that
the systematic reviewers found that “…no study of
any techniques has provided adequate ‘scientific and clinical
data on their safety, efficacy and durability to support the
use of endoluminal therapies for GORD [gastro-oesophageal reflux
disease] in routine clinical practice.’ Thus we could
be left with a sense of pessimism about the future use of endoluminal
therapies for this common disease, at a time when similar endoluminal
approaches are being proposed for other common disorders, such
as morbid obesity.”
He tells us that
the reviewers, “…also highlight the surprising lack
of well-designed randomised trials, even though many of the
endoluminal therapies have been in clinical use for several
years and many of the trials of these techniques have come from
respected academic centres.” He goes on to say, “It
almost seems that the standards established for the
assessment of treatment efficacy in the control of GORD…
were lowered [emphasis ours] when it came to assessing
these approaches.”
He also presents
the sobering news that, “As a result, many patients have
undergone procedures that have since been abandoned and proven
ineffective, and an unfortunate minority has had substantial
side-effects or even died.”[5-9]
Indeed. And how
many patients may have been told that research studies have
shown benefits of this treatment? Isn’t it possible that
some patients would not have elected to undergo this procedure
if they had been told, what Anvari states — most correctly
— in his commentary, “…up to this point, there
is no clear evidence to support the use of any of the endoluminal
therapies…in routine clinical practice.”
This is what we
advocate:
- All health care
professionals who care for patients need to recognize that
the vast majority of published research in even the best medical
journals is so flawed in design, execution and/or reporting,
or is not clinically useful, that it should not be used to
inform health care decision-making. Patients’ quality
of care — and even their lives — depend upon this
understanding. Of the approximately 12,000 articles published
in the medical literature each week, it is our estimate, having
evaluated thousands of published studies, that fewer than
10 percent of the published studies in health care are both
valid and clinically useful, while others have estimated that
the number is less than 5 percent.[10]
- And most users
of the medical literature lack the skills to tell. The vast
majority of physicians, clinical pharmacists and nurses fail
our simple critical
appraisal pre-test. (Read more shocking facts here.)
- It is incumbent
upon every such professional to attain the skills to discern
valid and clinically useful research from that which is not.
- This means that
—
- Health care
researchers need these skills before subjecting patients
to medical research.
- Medical
editors, peer reviewers and health care writers need these
skills in order to avoid passing on the flood of misleading
information. It is one thing to publish work that may
legitimately raise hypotheses. We, personally, think it
is perilous to publish work (and even perform research)
that is clearly severely flawed.
- Publishers
also need to change their editorial policies which frequently
emphasize “discussion” over “reporting
of methods.” Users need to see methodological details
to determine whether they agree that the study is valid.
It is not sufficient to tell a reader that a study was
blinded. Readers need to evaluate the likelihood of success
of blinding — and only details will do. Transparency
is vital.
- Physicians,
clinical pharmacists, nurse practitioners and others directing
or influencing patient decision-making need to attain
the skills to evaluate the health care information put
before them.
- Leaders
need to attend to this urgent need and provide the appropriate
culture and work elements, such as structures, processes,
methods, staff roles, skills and tools for evidence-
and value-based practice.
- Professional
societies, advocacy groups, experts writing reviews, systematic
reviewers, guideline developers and others who create
secondary studies and secondary sources need to develop
these skills before producing “guidance.”
- Policy makers
and payers need to be aware of the huge problems in the
quality of health care information and attain a true understanding
of the evidence before dictating sweeping changes with
far reaching impact that may only contribute to the cost
and quality problems in health care.
- Manufacturers
need to gain an understanding of what makes for valid
and clinically useful science and how it needs to be reported.
- Legal professionals
need to become responsible and help raise the bar for
evidence that is valid and clinically useful.
- And much
of it all starts in the schools. Academicians must gain
these skills themselves. And they must strive to identify
ways to successfully inculcate evidence-based practice
knowledge and habits in their students.
While some would
say if there is no evidence of benefit, don’t do it, we,
personally, do not wish to tell others how to practice medicine.
That is not what we see our role to be. No evidence of benefit
is not the same thing as evidence of no benefit. And it is possible
that something for which there is no evidence works. (In fact,
some topics are so inherently difficult to study that we will
never have good evidence — and this doesn’t mean
that practitioners will stop trying to find solutions and offering
options ot patients.)
We do, of course,
advocate that a science-based approach be taken in all possible
instances. It is important to try and study — correctly
— cause and effect for interventions. It is important
to report findings responsibly. It is important that all users
of such information have skills to evaluate science and that
they do so.
Know and clearly
state the evidence first! Then decisions can be made taking
into account other factors.
However, it is
our strong belief that patients should be provided with honest
information to enable them to make choices that are in their
best interest in accordance with their values and preferences.
This requires that all working in health care decision-making
acquire the appropriate skills. There are many, many resources
available to help achieve this — including information
we make freely available on our site from our own work and the
work of others.
Our patients deserve
this — and indeed — their lives depend upon it.
And that is why we are doing this work.
|
Improvement
of gastroesophageal reflux symptoms after radiofrequency
energy: a randomized, sham-controlled trial. Corley DA,
Katz P, Wo JM, Stefan A, Patti M, Rothstein R, Edmundowicz
S, Kline M, Mason R, Wolfe MM. Gastroenterology. 2003
Sep;125(3):668-76. PMID 12949712
Review
& Critique by Sheri A. Strite & Michael E. Stuart
MD, Delfini Group
January 14, 2006
Commentaries in PubMed:
See editorial by Kahrilas (Kahrilas PJ. Radiofrequency
energy treatment of GERD. Gastroenterology. 2003 Sep;125(3):970-3.
No abstract available. PMID: 12949740). Editorial urges
caution until further studies become available.
Issues
with Study:
Number
of People in Study (N)
- Small
N – small N may be especially prone to non-representation
Inclusions
- It appears
that patients who may still have benefited from proton
pump inhibitors (PPI) and H2 blocker treatment (i.e.,
those who had not failed medical therapy) were allowed
to participate in the study.
Randomization
- Sequential
randomization method was used which is a weaker method
more prone to “gaming” especially when concealment
of allocation is not adequate.
- There
is no mention of concealment of allocation which would
prevent “reassigning” patients, such as
use of opaque envelopes or a 1-800 call-in center for
example
- Studies
(Egger, M, Juni P, Bartlett C, Holenstein, Sterne J.
How important are comprehensive literature searches
and the assessment of trial quality in systematic reviews?
Empirical study. Health Technology Assessment 2003;
Vol.7: No.1) have shown that inadequate or unclear concealment
of allocation show more beneficial effects than adequately
concealed trials.
- One patient
(active treatment group) was determined to be ineligible
post-randomization (see discussion in Intention-to-Treat
Analysis).
Baseline
Characteristics
- No p-values
for baseline characteristics, and authors state that
there was a slight difference in age and PPI use between
groups. However, upon visual comparison the groups appear
equal for demographics and disease dimensions. Plus,
this is suggestive that randomization was successfully
achieved.
Blinding
- Authors
state study is double-blinded, however, no details of
blinding were reported excepting to state that an envelope
was used (again, no mention of opacity) and to state
that staff blinded to the treatment assignments performed
subsequent outcome assessments.
- The endoscopist
opened the envelope to determine assignment during the
endoscopy.
- Of some
concern is that the authors state that that “subsequent
contact between the subject and the endoscopist was
minimized…” — See Intention-to-Treat
Analysis. The only outcome that remained statistically
significant through the toughest test was subjective
(health-related quality of life) which could have been
influenced if patients had any direct, indirect or even
subliminal idea they had received RF. More in ITT Analysis…
- Authors
themselves state that they “cannot exclude a possible
bias from subjects guessing their treatment assignments
because of difference in post-treatment symptoms between
the active and sham groups.”
- Maintaining
blinding is highly important even when measures are
objective. Chalmers TC et al. Bias in Treatment Assignment
in Controlled Clinical Trials. N Engl J Med 1983;309:1358-61.
has shown that results of non-blinded studies may be
biased in favor of the intervention even with objective
measures
Concealment
of Allocation
- No details
of concealment reported
Deviations
from Protocol
- 1 patient
determined ineligible post-randomization.
Statistical
Analysis
- No analysis
of statistics was performed by Delfini.
Loss of Data
Points
- Significant
loss of data points. However, authors performed sensitivity
analysis using ITT methods including an imputation method
that put the intervention through the toughest test.
In that test, health-related quality of life remained
statistically significant. However, there are no details
reported of the actual outcomes in this analysis —
further see comments that potential subsequent contact
with endoscopist could have affected subjective measurement
of patients.
Analysis
Methods Used
- Authors
did not use Intention-to-Treat (ITT) as their primary
analysis method. They performed a completer analysis
as the primary method — and because they had over
5% loss to follow-up, these results are not dependable.
- Anturane
review by FDA demonstrated that even a 4.4% differential
loss of data points can result in meaningful differences
as the outcome went from a p-value of 0.058 to 0.16
— which represents a difference of a chance of
~1 in 17 to ~1 in 6 that findings are due to chance.
- The authors,
however, also did a sensitivity analysis that included
a last-observation-carried forward (LOCF) method plus
an extreme case analysis (see below for ITT discussion).
ITT Analysis
& Imputation Methods
- Authors
state that they performed sensitivity analyses on “all
patients” using LOCF and extreme case analysis
— putting the intervention through the toughest
test. It would be preferable to see the numbers used
and outcomes obtained rather than summary statements;
however, overall authors have demonstrated sophistication
and a great deal of clarity in reporting this study
(including ITT methods) — indicating knowledge
with key methodologic principles.
- Eligibility
was determined after randomization, further contributing
to patient loss — this, however, amounted to one
patient, and was 3.5 percent. Because the actual number
of “all” patients was not reported, it is
not possible to tell if this patient was included in
the various ITT analyses. However, this is less than
5 percent and so probably would not significantly affect
study results. Delfini reviewers are most focused on
results of sensitivity analysis as these were ITT analyses.
LOCF seems reasonable for imputing missing values for
this condition as a sensitivity analysis, and extreme-case
scenario reporting is always appropriate as it presents
the most severe challenge to statistically significant
differences between groups. Authors report statistically
significant improvement in the primary outcomes of heart
burn symptoms (LOCF and extreme-case) and quality of
life (extreme-case). There were no differences in any
analysis in daily medication use or esophageal acid
exposure times — however, study was not be powered
to detect differences between groups in these outcomes.
- Delfini
reviewers are most focused on results of sensitivity
analysis as these were ITT. LOCF seems reasonable for
imputing missing values for this condition and extreme-case
scenario reporting is always welcome as it presents
the highest bar.
- Authors
report statistically significant improvement in the
primary outcomes of heart burn symptoms (LOCF and extreme-case)
and quality of life (extreme-case). There were no differences
in any analysis in daily medication use or esophageal
acid exposure times — however, study would not
be powered for these outcomes.
Mechanism
of Action
- Per editorial,
it is uncertain whether the results are due to reduced
gastroesophageal reflux (paper suggests not) or are
due to diminished esophageal sensitivity.
Clinical
Significance
- This study
does not provide sufficient evidence to conclude that
radiofrequency ablation is effective and safe. Because
of the above-mentioned threats to validity and the patient
population, conclusions regarding symptom relief and
improved quality of life should be drawn cautiously.
Further confirmatory studies that include patients who
have not responded to H2-blockers and PPIs would be
useful. Undetected long term harms remain a possibility.
Patients should be aware that the medical evidence is
sparse and that uncertainty regarding the appropriate
role of radiofrequency therapy in the treatment of gastroesophageal
reflux symptoms remains.
Safety
- There
were more ADEs reported in the active treatment group
than in the sham group — no p-values are reported.
Specific ADEs were not determined a priori which increases
likelihood that any reported outcomes are due to chance.
- One active
treatment patient with an esophageal ulcer had self-limited
bleeding and had inpatient observation overnight.
- A patient
in the sham group developed pneumonia and received inpatient
antibiotics; the pneumonia was thought to be due to
aspiration during the endoscopy.
- One active
treatment patient experienced increased bloating; a
nuclear medicine gastric emptying study showed worsening
of a pre-existing delayed gastric emptying disorder.
- Some active
and sham patients experienced temporary post-procedure
retrosternal discomfort that required oral analgesics.
These transient symptoms were not fully quantifiable
with the measurement instruments.
Other
- In many
respects, this is a very well done and very well reported
study. The authors clearly have understanding of research
excellence in design and execution.
- A
study such as this cannot be sufficient to “prove”
a therapy’s efficacy. Proof is not possible with
main issues such as small N in a single study, questions
about maintenance of blinding, plus a need for assurance
that the LOCF procedure was truly done as intention-to-treat
with all randomized patients especially given the subjective
measures of heartburn and HRQOL. Furthermore, even in
this research setting, the reported benefits are minimal
and extrapolating benefits to clinical practice should
be done with caution. Without confirmatory studies,
radiofrequency therapy for GERD symptoms should be considered
unproven. However, for a small study, this is a very
thoughtfully-designed, apparently well-executed study
as reported. It is suggestive that there may be benefits
from this procedure. Further research is clearly needed
to determine if this treatment can help patients avoid
more invasive and risky procedures.
|
References
1. Delfini Primer: Problems
with Case Series
2. Delfini Primer:
Problems
with the Use of Observational Studies to Draw Cause and Effect
Conclusions About Interventions
3. Anvari, M. Endoscopic
treatments for gastro-oesophageal reflux disease. [Comment]
The Lancet. Vol 371. March 22, 2008. 965-966
4. Fry LC, Mönkemüller
K, Malfertheiner P. Systematic review. Endoluminal therapy for
gastro-oesophageal refl ux disease: evidence from clinical trials.
Eur J Gastroenterol Hepatol 2007; 19: 1125–39.
5. Pleskow D, Rothstein
R, Lo S, et al. Endoscopic full-thickness plication for the
treatment of GERD: a multicenter trial. Gastrointest Endosc
2004; 59: 163–71.
6. Noh KW, Loeb
DS, Stockland A, Achem SR. Pneumomediastinum following enteryx
injection for the treatment of gastroesophageal refl ux disease.
Am J Gastroenterol 2005; 100: 723–26.
7. Wong RF, Davis
TV, Peterson KA. Complications involving the mediastinum after
injection of Enteryx for GERD. Gastrointest Endosc 2005; 61:
753–56.
8. Tintillier M,
Chaput A, Kirch L, Martinet JP, Pochet JM, Cuvelier C. Esophageal
abscess complicating endoscopic treatment of refractory gastroesophageal
refl ux disease by enteryx injection: a fi rst case report.
Am J Gastroenterol 2004; 99: 1856–58.
9. Tuebergen D,
Rijcken E, Senninger N. Esophageal perforation as a complication
of EndoCinch endoluminal gastroplication. Endoscopy 2004; 36:
663–65.
10. Ebell M. An
introduction to information mastery, July 15, 1988. http://www.poems.msu.edu/InfoMastery/default.htm.
Accessed December 21, 2007.
|
| Variations
in Experts' Recommendations
For more than two
decades, observers like Jack Wennberg have pointed out geographic
variations in health care practices and recommendations in the
United States. David Eddy has elegantly demonstrated extreme
variation in experts’ recommendations for health care
interventions. And yet, many clinicians rely on expert opinion
as their primary source of information. EBM emphasizes the need
to critically appraise information sources for validity and
clinical relevance as a way of dealing with such variation.
Recently Schaafsma
FG, Verbeek JH, Hulshof CT, van Dijk FJ have reported on how
expert opinion differs from information gathered from an evidence-based
approach (Caution required when relying on a colleague's advice;
a comparison between professional advice and evidence from the
literature. BMC Health Serv Res. 2005 Aug 31;5(1):59. PMID:
16131405)
The authors presented
12 occupational medicine cases to 14 occupational medicine specialists
from differing geographical areas in the Netherlands. The authors
also searched the literature and critically appraised the evidence
on the 12 problems. Questions regarding management were stated
so that specialists could answer with yes/no responses. [Example:
For a 36-year old caretaker at a secondary school with a lateral
ankle ligament rupture treated with tape for three weeks, is
it safe to resume work? Evidence-based answer: Yes]
The authors found
that 53% (95% confidence interval 42% to 65%) of all professional
advice was not in line with the research literature. In 18 cases
(24%), professional advice explicitly referred to up-to-date
research literature as their source of information. These cases
were substantially less incorrect (17%) than advice that had
not mentioned the literature as a source (65%). The authors
conclude that less than half of the given professional advice
by experts to a practical occupational health problem was in
line with evidence from the research literature.
This article points
out the need for caution in accepting information from “experts”
without evaluating the actual evidence from the published medical
literature.
Full text
of this article is available at --
http://www.biomedcentral.com/content/pdf/1472-6963-5-59.pdf
|
|
Variations
in Care Costs and Care Quality: Evidence (Lack of), Quality
of Care and Physician Revenue
06/01/09
Thanks to our friend,
Edward A. Walker, MD, MHA, Professor, Department of Psychiatry
and Behavioral Sciences, Cheryl M. Scott / Group Health Cooperative
Professor of Health Administration & Director, UW Healthcare
Leadership Development Alliance, who alerted us to an excellent
and thought-provoking article published in the New Yorker in
their Annals of Medicine column on June 1, 2009:
“The
Cost Conundrum: What a Texas town can teach us about health
care”
by Atul Gawande
Gawande takes a
journey throughout the US to ostensibly answer the question,
“Why is health care so expensive in McAllen County, Texas,
when the explanations do not seem to be population-dependent,
nor the answers better resulting care?” And at the end
of his investigation, he concludes that “…the damning
question we have to ask is whether the doctor is set up to meet
the needs of the patient, first and foremost, or to maximize
revenue.”
Some key quotes
from Gawande:
“…having
a system that does so much to misalign [needs of the patient
to physician revenues] has proved disastrous. As economists
have often pointed out, we pay doctors for quantity, not quality.
As they point out less often, we also pay them as individuals,
rather than as members of a team working together for their
patients. Both practices have made for serious problems…"
"Imagine
that, instead of paying a contractor to pull a team together
and keep them on track, you paid an electrician for every
outlet he recommends, a plumber for every faucet, and a carpenter
for every cabinet. Would you be surprised if you got a house
with a thousand outlets, faucets, and cabinets, at three times
the cost you expected, and the whole thing fell apart a couple
of years later? Getting the country’s best electrician
on the job (he trained at Harvard, somebody tells you) isn’t
going to solve this problem. Nor will changing the person
who writes him the check.”
Highly compelling
reading addressing one of the most vital problems we face. The
full text is available here: »
|
| Variations
in Clinicians' Estimates of Pretest Probability
Newer EBM texts,
when addressing the areas of diagnosis and screening, frequently
state that diagnostic tests that produce large changes from
pretest to post-test probabilities are very useful clinically.
The teaching goes
like this: Likelihood ratios can be utilized along with practitioners’
clinical estimate of the pretest probability of disease, incorporating
the results of the diagnostic tests and then deriving an individual
patient’s probability of having a disease or condition.
Phelps and Levitt
point out (Acad Emerg Med. 2004 Jun;11(6):692-4.) that this
approach to generating post-test disease probabilities has never
been validated and that the approach may result in very inconsistent
results between clinicians.
In a cross-sectional
cohort study of emergency and internal medicine residents and
faculty, the authors presented clinical vignettes and asked
clinicians to estimate the likelihood of common diseases based
on history and physical exam findings. No lab or imaging results
were provided. They reported that the mean pretest probability
estimates of disease ranged from 42% (95% confidence interval
[95% CI] = 36.6% to 47.4%) to 77% (95% CI = 72.9% to 81.1%).
The smallest difference in pretest probability magnitude for
a single vignette was 70% (range 30-100%; interquartile range
[IQR] 64-80%), whereas the largest was 95% (range 3-98%; IQR
30-60%).
Their conclusions
were that there is wide variability in clinicians’ estimates
of pretest probability of disease and that post-test disease
estimates are also likely to be inconsistent.
Because of such
wide variability, we prefer to concentrate on more traditional
statistics derived from the classical 2x2 table such as sensitivity,
specificity, positive predictive value and negative predictive
value and skip the hassle of estimating pretest probability
and using nomograms or tables to generate post-test probabilities.
The abstract of
this article can be found at:
Go to http://www.pubmed.gov
and enter or copy and paste 15175211 in the
search field. |
| Underuse
of Proven Interventions
One of the main reasons for using valid, relevant
evidence in health care is to more accurately predict outcomes
from various interventions and thus be equipped to make informed
choices. For example, it is very useful to know the evidence
for outcomes by risk category for cardiovascular disease. The
higher an individual’s risk for a cardiovascular event,
the greater the benefit from interventions such as lipid-lowering
with statins.
Because
of increased benefits, we would expect patients who are at the
highest baseline risk for CV events to receive the most aggressive
lipid-lowering treatment. Ko et al. (JAMA.2004;291:1864-1870),
however, have demonstrated that the prescription of statins
in Canadian patients older than 66 years of age with a history
of CV disease or diabetes mellitus decreased progressively as
baseline CV risk increased.
In their database
study:
- Only 19% of
the 75,617 patients received statins for secondary prevention;
- In patient
66-74 years of age, the adjusted probabilities for receiving
statin Rx was 37.7% (95% CI 37.3-38.2), 26.7% (26.4-27.1),
and 23.4% (22.8-23.9) in the categories of low, intermediate
and high baseline risk, respectively.
Why would such
a small percentage of elderly patients with CV disease or diabetes
be prescribed lipid-lowering drugs? Why would the likelihood
that physicians prescribe statin therapy be inversely correlated
with CV risk?
Comment: This is
yet one more study demonstrating the huge problem of underuse
of proven interventions in health care. However, this is apparently
the first article pointing out the treatment-risk paradox and
is worth reading. The authors suggest that the treatment risk
paradox in these elderly patients might be explained by:
- Clinicians
not understanding tradeoffs between benefits and harms of
statins—clinicians may believe that the results of RCTs
of lipids cannot be applied to patients with comorbidites
on the grounds that these patients may experience fewer benefits
and greater harms than the study subjects;
- Physicians
may prejudge patients with co-morbidities—they may believe
that such patients will not adhere to lipid-lowering treatment
regimens; and/or,
- Physicians
may be inattentive to preventive care measures in elderly
patients with chronic conditions.
|
| Class
Effect: Caution Urged
Pharmacy and Therapeutics
committees and clinicians are frequently faced with issues regarding
substitution for agents in a specific drug class. CD Furburg
and BM Psaty, using ace-inhibitors (ACEIs) as an example, make
a number of interesting points in an article entitled, “Should
evidence-based proof of drug efficacy be extrapolated to a “class
of agents”? (Circulation 2003;108:2608-2510). Below is
a summary of some key points made by the authors of the article
and also some points made by the editorialists, EM Antman and
JJ Ferguson (Circulation 2003;108:2604-2607).
Generic
substitution is the act of dispensing a different brand
or an unbranded drug product that is the same chemical entity
and meets US Food and Drug Administration criteria for bioequivalence
(e.g., generic lisinopril in place of brandname Prinivil [Merck
& Co.] or Zestril [Zeneca Pharmaceuticals]).
Therapeutic
substitution is the dispensing of an alternate chemical
entity for the original drug prescribed by the physician from
the same general therapeutic class (e.g., a physician orders
enoxaparin, but the hospital pharmacy dispenses dalteparin;
a physician writes a prescription for ramipril, and the pharmacy
dispenses moexipril).
The HOPE trial,
using ramipril 10mg, reported the following relative risk reductions
in patients with vascular disease or diabetes and one additional
risk factor:
- 16% reduction
over placebo in all cause mortality
- 20% reduction
in MI
- 32% reduction
in stroke
However, the evidence
is less robust for some of the other ACEIs:
- There was no
decrease in ischemic events in subjects who had undergone
angioplasty and took Quinapril 20mg when compared to placebo
over 3 years
- In the PROGRESS
trial, there was no difference in stroke or coronary events
in the Perindopril 4 mg and placebo groups. (However, in the
EUROPA trial using 8 mg, there was a reduction in coronary
mortality and morbidity of 20%.)
- 4 ACEIs have
not been shown to reduce mortality or morbidity—2 agents
have not been tested and 2 have not shown a reduction in clinical
events.
Does it make sense
to declare all of the ACEIs equivalent? What about differing
doses that have not been tested?
Furburg and Psaty
point out that all 7 drugs have FDA approval for treating hypertension,
heart failure and LV dysfunction. All 7 have been shown to lower
BP and improve hemodynamic measurements. However, only 4 of
these drugs—enalapril, captopril, ramipril and trandolopril—have
been shown to improve long term survival or clinical outcomes
compared to placebo in large, RCTs.
They point out
that:
- The actions
specific to an individual agent may add to, subtract from
or have a neutral effect on the safety and efficacy attributed
to the common class actions of a drug class.
- Concluding
that equipotency exists rests on surrogate measures of efficacy.
- Equipotency
is difficult to establish in the absence of relevant outcome
data.
- The term “class
effect” has never been defined from a regulatory perspective.
- The class effect
concept is often overinterpreted to mean that all agents in
a class are interchangeable.
- Me-too drugs
can be produced without the risk and expense of long term
morbidity and mortality trials.
- Regulatory
agencies do not accept extrapolations, and FDA regulations
prevent manufacturers from making claims for unapproved indications.
- Safety is an
issue to consider.
Given all of the
above, the authors state that efforts should be made to use
drugs with the best-documented outcomes and in full doses. Further,
the use of me-too drugs without adequate documentation should
be restricted. Comparability should be established in trials
where two agents in a class are compared.
The editorialists
list the following minimal criteria they would require for defining
a class effect of a drug:
- A clearly defined
biological target or pathway.
- Comparable
efficacy demonstrated for multiple agents within the class
(with multiple randomized, controlled clinical trials for
each agent).
- Absence of
convincing evidence that there is a member of the class that
does not have comparable clinical benefit to that of other
agents within the class.
They also recommend
considering the following before deciding whether drugs within
a class are interchangeable:
- The absolute
and relative degree of benefit for each agent;
- The clinical
circumstances in which benefit has been demonstrated;
- The extent and
the depth of the evidence in favor of a particular agent;
- The safety profile
and tolerability of agents within a class;
- The cost of
the alternatives, viewed not only in terms of cost per quality
life-year saved versus placebo or an established control,
but also in terms of cost per quality life-year lost in the
case of less expensive, but also less effective, alternatives;
- The specific
details of the inclusion/exclusion criteria and the exact
management protocols used in the individual supporting clinical
trials; and,
- The particular
subgroups that show benefit (or lack of benefit) for agents
within the class.
|
|
|