| Use
of Evidence:
Quality of Care — From Variation to Improvement
newest
04/01/08: More
On Therapies Becoming “Routine Clinical Practice” Without
Evidence & Why We Are Doing This Work »
Contents
- The
Volume of Inappropriate Care in the US »
- newest
More
On Therapies Becoming “Routine Clinical Practice”
Without Evidence & Why We Are Doing This Work: Radiofrequency
for the Treatment of Gastro-esophageal Reflux Disease Example
»
- Variations
in Experts' Recommendations
»
- Variations
in Clinicians' Estimates of Pretest Probabilities
»
- Underuse
of Proven Interventions
»
- Class
Effect: Caution Urged
»
|
| The
Volume of Inappropriate Care in the US
Could
things be getting worse in the area of clinical quality? We have
seen estimates of 20%-30% inappropriate care in the past, but in
the May/June 2004 issue of Health Affairs (Vo 23;3: 247-255), Kerr
et al. report that in a study of approximately 7,000 people in 12
metropolitan areas in the United States, on average, residents received
only 50% to 60% of recommended care (recommended care was based
on 439 quality indicators across thirty conditions and preventive
care recommendations developed by RAND and validated by four multi-specialty
expert panels using a modified Delphi method). And this doesn't
even address the other side of inappropriate care — overuse.
We continue to believe the answer lies in organizations creating
a system of evidence-based solutions for care.
Full
text is available to subscribers and can be obtained for a fee for
non-subscribers at http://www.healthaffairs.org/
|
More
On Therapies Becoming “Routine Clinical Practice” Without
Evidence & Why We Are Doing This Work: Radiofrequency for the
Treatment of Gastro-esophageal Reflux Disease Example
04/01/08
Several years ago, we received a phone call from
a medical director asking if we would do a limited review of radiofrequency
for the treatment of gastro-esophageal reflux disease (GERD in the
US; GORD in UK). He had received a request for coverage of this
procedure by a physician and was concerned that there was not good
scientific evidence supporting use of this procedure. His health
plan policies required proof of efficacy.
We agreed. We contacted the requesting physician
and asked him to recommend his best three studies for review. (He
pleaded for more - we relented and said, “Okay, four.”)
Of the 4 studies he sent us, only 1 was a randomized controlled
trial (RCT), and so we had to surmise that if he had been given
permission to send us even more, those additional studies, too,
would have been observational.
Observational studies should not be used for addressing
questions of efficacy of interventions with rare exception.[1,2]
The only exception that is thus far agreed upon is when all-or-none
results are achieved, and this is extremely rare.
And one RCT generally will not be considered sufficient
to declare “proof.” Even a meticulously designed, executed
and reported RCT may still result in findings merely due to chance,
not to mention that only through replicated studies might one discern
the unfortunate occasional fraud. And this RCT could not even come
close to being a candidate for definitive proof of anything. While
thoughtfully designed by investigators more knowledgeable of research
design than many we have seen, it was very small and had myriad
problems as our critique — reproduced below — demonstrates.
Even the authors - very responsibly - themselves
acknowledged that their study should only be considered preliminary
and to have proved nothing (a point that the requesting physician
seemed to have overlooked in his desire for a solution for this
problem that he, himself, believed in).
And now, in The Lancet, Mehran Anvari[3] of St Joseph’s
Healthcare, Hamilton ON, Canada, comments upon a recently published
review which considers clinical trials from the past three decades
on the use of endoluminal therapies for GERD/GORD.[4]
Anvari writes that the systematic reviewers found
that “…no study of any techniques has provided adequate
‘scientific and clinical data on their safety, efficacy and
durability to support the use of endoluminal therapies for GORD
[gastro-oesophageal reflux disease] in routine clinical practice.’
Thus we could be left with a sense of pessimism about the future
use of endoluminal therapies for this common disease, at a time
when similar endoluminal approaches are being proposed for other
common disorders, such as morbid obesity.”
He tells us that the reviewers, “…also
highlight the surprising lack of well-designed randomised trials,
even though many of the endoluminal therapies have been in clinical
use for several years and many of the trials of these techniques
have come from respected academic centres.” He goes on to
say, “It almost seems that the standards established
for the assessment of treatment efficacy in the control of GORD…
were lowered [emphasis ours] when it came to assessing
these approaches.”
He also presents the sobering news that, “As
a result, many patients have undergone procedures that have since
been abandoned and proven ineffective, and an unfortunate minority
has had substantial side-effects or even died.”[5-9]
Indeed. And how many patients may have been told
that research studies have shown benefits of this treatment? Isn’t
it possible that some patients would not have elected to undergo
this procedure if they had been told, what Anvari states —
most correctly — in his commentary, “…up to this
point, there is no clear evidence to support the use of any of the
endoluminal therapies…in routine clinical practice.”
This is what we advocate:
- All
health care professionals who care for patients need to recognize
that the vast majority of published research in even the best
medical journals is so flawed in design, execution and/or reporting,
or is not clinically useful, that it should not be used to inform
health care decision-making. Patients’ quality of care —
and even their lives — depend upon this understanding. Of
the approximately 12,000 articles published in the medical literature
each week, it is our estimate, having evaluated thousands of published
studies, that fewer than 10 percent of the published studies in
health care are both valid and clinically useful, while others
have estimated that the number is less than 5 percent.[10]
- And
most users of the medical literature lack the skills to tell.
The vast majority of physicians, clinical pharmacists and nurses
fail our simple critical
appraisal pre-test. (Read more shocking facts here.)
- It
is incumbent upon every such professional to attain the skills
to discern valid and clinically useful research from that which
is not.
- This
means that —
- Health
care researchers need these skills before subjecting patients
to medical research.
- Medical
editors, peer reviewers and health care writers need these
skills in order to avoid passing on the flood of misleading
information. It is one thing to publish work that may legitimately
raise hypotheses. We, personally, think it is perilous to
publish work (and even perform research) that is clearly severely
flawed.
- Publishers
also need to change their editorial policies which frequently
emphasize “discussion” over “reporting of
methods.” Users need to see methodological details to
determine whether they agree that the study is valid. It is
not sufficient to tell a reader that a study was blinded.
Readers need to evaluate the likelihood of success of blinding
— and only details will do. Transparency is vital.
- Physicians,
clinical pharmacists, nurse practitioners and others directing
or influencing patient decision-making need to attain the
skills to evaluate the health care information put before
them.
- Leaders
need to attend to this urgent need and provide the appropriate
culture and work elements, such as structures, processes,
methods, staff roles, skills and tools for evidence-
and value-based practice.
- Professional
societies, advocacy groups, experts writing reviews, systematic
reviewers, guideline developers and others who create secondary
studies and secondary sources need to develop these skills
before producing “guidance.”
- Policy
makers and payers need to be aware of the huge problems in
the quality of health care information and attain a true understanding
of the evidence before dictating sweeping changes with far
reaching impact that may only contribute to the cost and quality
problems in health care.
- Manufacturers
need to gain an understanding of what makes for valid and
clinically useful science and how it needs to be reported.
- Legal
professionals need to become responsible and help raise the
bar for evidence that is valid and clinically useful.
- And
much of it all starts in the schools. Academicians must gain
these skills themselves. And they must strive to identify
ways to successfully inculcate evidence-based practice knowledge
and habits in their students.
While some would say if there is no evidence of
benefit, don’t do it, we, personally, do not wish to tell
others how to practice medicine. That is not what we see our role
to be. No evidence of benefit is not the same thing as evidence
of no benefit. And it is possible that something for which there
is no evidence works. (In fact, some topics are so inherently difficult
to study that we will never have good evidence — and this
doesn’t mean that practitioners will stop trying to find solutions
and offering options ot patients.)
We do, of course, advocate that a science-based
approach be taken in all possible instances. It is important to
try and study — correctly — cause and effect for interventions.
It is important to report findings responsibly. It is important
that all users of such information have skills to evaluate science
and that they do so.
Know and clearly state the evidence first! Then
decisions can be made taking into account other factors.
However, it is our strong belief that patients should
be provided with honest information to enable them to make choices
that are in their best interest in accordance with their values
and preferences. This requires that all working in health care decision-making
acquire the appropriate skills. There are many, many resources available
to help achieve this — including information we make freely
available on our site from our own work and the work of others.
Our patients deserve this — and indeed —
their lives depend upon it. And that is why we are doing this work.
|
Improvement
of gastroesophageal reflux symptoms after radiofrequency energy:
a randomized, sham-controlled trial. Corley DA, Katz P, Wo
JM, Stefan A, Patti M, Rothstein R, Edmundowicz S, Kline M,
Mason R, Wolfe MM. Gastroenterology. 2003 Sep;125(3):668-76.
PMID 12949712
Review
& Critique by Sheri A. Strite & Michael E. Stuart
MD, Delfini Group
January 14, 2006
Commentaries in PubMed:
See editorial by Kahrilas (Kahrilas PJ. Radiofrequency energy
treatment of GERD. Gastroenterology. 2003 Sep;125(3):970-3.
No abstract available. PMID: 12949740). Editorial urges caution
until further studies become available.
Issues
with Study:
N
- Small
N – small N may be especially prone to non-representation
Inclusions
- External
validity issue: patients appear to be candidates for the
procedure before demonstrating failure of PPI and H2 blocker
treatment. While this is not a threat to validity for this
study showing value for treatment with radiofrequency energy
(RF) for GERD as compare to sham, it is worth noting.
Randomization
- Sequential
randomization method was used which is a weaker method more
prone to “gaming” especially when concealment
of allocation is not adequate.
- There
is no mention of concealment of allocation which would prevent
“reassigning” patients, such as use of opaque
envelopes or a 1-800 call-in center for example
- Studies
(Egger, M, Juni P, Bartlett C, Holenstein, Sterne J. How
important are comprehensive literature searches and the
assessment of trial quality in systematic reviews? Empirical
study. Health Technology Assessment 2003; Vol.7: No.1) have
shown that inadequate or unclear concealment of allocation
show more beneficial effects than adequately concealed trials.
- One
patient (active treatment group) was determined to be ineligible
post-randomization (see discussion in Intention-to-Treat
Analysis).
Baseline
Characteristics
- No
p-values for baseline characteristics, and authors state
that there was a slight difference in age and PPI use between
groups. However, upon visual comparison the groups appear
equal for demographics and disease dimensions. Plus, this
is suggestive that randomization was successfully achieved.
Blinding
- Authors
state study is double-blinded, however, no details of blinding
were reported excepting to state that an envelope was used
(again, no mention of opacity) and to state that staff blinded
to the treatment assignments performed subsequent outcome
assessments.
- The
endoscopist opened the envelope to determine assignment
during the endoscopy.
- Of
some concern is that the authors state that that “subsequent
contact between the subject and the endoscopist was minimized…”
[emphasis ours] — See Intention-to-Treat Analysis.
The only outcome that remained statistically significant
through the toughest test was subjective (health-related
quality of life) which could have been influenced if patients
had any direct, indirect or even subliminal idea they had
received RF. More in ITT Analysis…
- Authors
themselves state that they “cannot exclude a possible
bias from subjects guessing their treatment assignments
because of difference in post-treatment symptoms between
the active and sham groups.”
- Maintaining
blinding is highly important even when measures are subjective.
Chalmers TC et al. Bias in Treatment Assignment in Controlled
Clinical Trials. N Engl J Med 1983;309:1358-61. has shown
that results of non-blinded studies may be biased in favor
of the intervention even with objective measures
Concealment
of Allocation
- No
details of concealment reported
Deviations
from Protocol
- 1
patient determined ineligible post-randomization.
Statistical
Analysis
- No
analysis performed by Delfini of statistics used.
Loss
of Data Points
- Significant
loss of data points. However, authors performed sensitivity
analysis using ITT methods including an imputation method
that put the intervention through the toughest test. In
that test, health-related quality of life remained statistically
significant. However, there are no details reported of the
actual outcomes in this analysis — further see comments
that potential subsequent contact with endoscopist could
have affected subjective measurement of patients.
Analysis
Methods Used
- Authors
did not use Intention-to-Treat (ITT) as their primary analysis
method. They performed a completer analysis as the primary
method — and because they had over 5% loss to follow-up,
these results are not dependable.
- Anturane
review by FDA demonstrated that even a 4.4% differential
loss of data points can result in meaningful differences
as the outcome went from a p-value of 0.058 to 0.16 —
which represents a difference of a chance of ~1 in 17 to
~1 in 6 that findings are due to chance.
- The
authors, however, also did a sensitivity analysis that included
a last-observation-carried forward (LOCF) method plus an
extreme case analysis (see below for ITT discussion).
ITT
Analysis & Imputation Methods
- Authors
state that they performed sensitivity analyses on “all
patients” using LOCF and extreme case analysis —
putting the intervention through the toughest test. It would
be preferable to be able to better see the numbers used
and outcomes than is reported in the paper; however, overall
authors have demonstrated sophistication and a great deal
of clarity in reporting this study (including with ITT methods)
— indicating knowledge with key methodologic principles.
- Eligibility
was determined after randomization, further contributing
to patient loss — this, however, equaled one patient,
and was 3.5%. Because actual numbers of what equal “all”
patients were not reported, it is not possible to tell if
this patient was included in the various ITT analyses. However,
this is less than 5% and so probably would not significantly
affect study results.
- Delfini
reviewers are most focused on results of sensitivity analysis
as these were ITT. LOCF seems reasonable for imputing missing
values for this condition and extreme-case scenario reporting
is always welcome as it presents the highest bar.
- Authors
report statistically significant improvement in the primary
outcomes of heart burn symptoms (LOCF and extreme-case)
and quality of life (extreme-case). There were no differences
in any analysis in daily medication use or esophageal acid
exposure times — however, study would not be powered
for these outcomes.
Mechanism
of Action
- Per
editorial, uncertain whether due to reduced gastroesophageal
reflux (which paper suggests not) or due to diminished esophageal
sensitivity.
Clinical
Significance
- This
study does not provide sufficient evidence to conclude that
radiofrequency is effective and safe. Because of the above-mentioned
threats to validity and the patient population, conclusions
regarding symptom relief and improved quality of life should
be drawn cautiously. Further confirmatory studies including
patients who have not responded to H2-blockers and PPIs
would be useful. Undetected long term harms remain a possibility.
Patients should be aware that the medical evidence is sparse
and that uncertainty remains regarding the appropriate role
of radiofrequency therapy in the treatment of gastroesophageal
reflux symptoms.
Safety
- There
were more ADEs reported in the active treatment group than
in the sham group — no p-values are reported. Specific
ADEs were not determined a priori which increases likelihood
that any reported outcomes are due to chance.
- One
active treatment patient with an esophageal ulcer had self-limited
bleeding and had inpatient observation overnight.
- A
patient in the sham group developed pneumonia and received
inpatient antibiotics; the pneumonia was thought to be due
to aspiration during the endoscopcopy
- One
active treatment patient experienced increased bloating;
a nuclear medicine gastric emptying study showed worsening
of a pre-existing delayed gastric emptying disorder.
- Some
active and sham patients experienced temporary post-procedure
retrosternal discomfort that required oral analgesics. These
transient symptoms were not fully quantifiable with the
measurement instruments.
Other
- In
many respects, this is a very well done and very well reported
study. The authors clearly have understanding of research
excellence in design and execution.
- A
study such as this cannot be sufficient to “prove”
a therapy’s efficacy. Proof is not possible with main
issues such as small N in a single study, questions about
maintenance of the blind, plus a need for assurance that
the LOCF procedure was truly done as intention-to-treat
with all randomized patients especially given the subjective
measures of heartburn and HRQOL. Furthermore, even in this
research setting, the reported benefits are minimal and
extrapolating benefits to clinical practice should be done
with caution. Without confirmatory studies, radiofrequency
therapy for GERD symptoms should be considered unproven.
However, for a small study, this is a very thoughtfully-designed,
apparently well-executed study as reported. It is suggestive
that there may be benefits of this procedure. Further research
is clearly needed to determine if this treatment can help
patients avoid more invasive and risky procedures.
|
References
1. Delfini Primer: Problems
with Case Series
2. Delfini Primer: Problems
with the Use of Observational Studies to Draw Cause and Effect Conclusions
About Interventions
3. Anvari, M. Endoscopic treatments for gastro-oesophageal
reflux disease. [Comment] The Lancet. Vol 371. March 22, 2008. 965-966
4. Fry LC, Mönkemüller K, Malfertheiner
P. Systematic review. Endoluminal therapy for gastro-oesophageal
refl ux disease: evidence from clinical trials. Eur J Gastroenterol
Hepatol 2007; 19: 1125–39.
5. Pleskow D, Rothstein R, Lo S, et al. Endoscopic
full-thickness plication for the treatment of GERD: a multicenter
trial. Gastrointest Endosc 2004; 59: 163–71.
6. Noh KW, Loeb DS, Stockland A, Achem SR. Pneumomediastinum
following enteryx injection for the treatment of gastroesophageal
refl ux disease. Am J Gastroenterol 2005; 100: 723–26.
7. Wong RF, Davis TV, Peterson KA. Complications
involving the mediastinum after injection of Enteryx for GERD. Gastrointest
Endosc 2005; 61: 753–56.
8. Tintillier M, Chaput A, Kirch L, Martinet JP,
Pochet JM, Cuvelier C. Esophageal abscess complicating endoscopic
treatment of refractory gastroesophageal refl ux disease by enteryx
injection: a fi rst case report. Am J Gastroenterol 2004; 99: 1856–58.
9. Tuebergen D, Rijcken E, Senninger N. Esophageal
perforation as a complication of EndoCinch endoluminal gastroplication.
Endoscopy 2004; 36: 663–65.
10. Ebell M. An introduction to information mastery,
July 15, 1988. http://www.poems.msu.edu/InfoMastery/default.htm.
Accessed December 21, 2007.
|
| Variations
in Experts' Recommendations
For more than two decades, observers like Jack Wennberg
have pointed out geographic variations in health care practices
and recommendations in the United States. David Eddy has elegantly
demonstrated extreme variation in experts’ recommendations
for health care interventions. And yet, many clinicians rely on
expert opinion as their primary source of information. EBM emphasizes
the need to critically appraise information sources for validity
and clinical relevance as a way of dealing with such variation.
Recently Schaafsma FG, Verbeek JH, Hulshof CT, van
Dijk FJ have reported on how expert opinion differs from information
gathered from an evidence-based approach (Caution required when
relying on a colleague's advice; a comparison between professional
advice and evidence from the literature. BMC Health Serv Res. 2005
Aug 31;5(1):59. PMID: 16131405)
The authors presented 12 occupational medicine cases
to 14 occupational medicine specialists from differing geographical
areas in the Netherlands. The authors also searched the literature
and critically appraised the evidence on the 12 problems. Questions
regarding management were stated so that specialists could answer
with yes/no responses. [Example: For a 36-year old caretaker at
a secondary school with a lateral ankle ligament rupture treated
with tape for three weeks, is it safe to resume work? Evidence-based
answer: Yes]
The authors found that 53% (95% confidence interval
42% to 65%) of all professional advice was not in line with the
research literature. In 18 cases (24%), professional advice explicitly
referred to up-to-date research literature as their source of information.
These cases were substantially less incorrect (17%) than advice
that had not mentioned the literature as a source (65%). The authors
conclude that less than half of the given professional advice by
experts to a practical occupational health problem was in line with
evidence from the research literature.
This article points out the need for caution in
accepting information from “experts” without evaluating
the actual evidence from the published medical literature.
Full
text of this article is available at --
http://www.biomedcentral.com/content/pdf/1472-6963-5-59.pdf
|
| Variations
in Clinicians' Estimates of Pretest Probability
Newer
EBM texts, when addressing the areas of diagnosis and screening,
frequently state that diagnostic tests that produce large changes
from pretest to post-test probabilities are very useful clinically.
The
teaching goes like this: Likelihood ratios can be utilized along
with practitioners’ clinical estimate of the pretest probability
of disease, incorporating the results of the diagnostic tests and
then deriving an individual patient’s probability of having
a disease or condition.
Phelps
and Levitt point out (Acad Emerg Med. 2004 Jun;11(6):692-4.) that
this approach to generating post-test disease probabilities has
never been validated and that the approach may result in very inconsistent
results between clinicians.
In
a cross-sectional cohort study of emergency and internal medicine
residents and faculty, the authors presented clinical vignettes
and asked clinicians to estimate the likelihood of common diseases
based on history and physical exam findings. No lab or imaging results
were provided. They reported that the mean pretest probability estimates
of disease ranged from 42% (95% confidence interval [95% CI] = 36.6%
to 47.4%) to 77% (95% CI = 72.9% to 81.1%). The smallest difference
in pretest probability magnitude for a single vignette was 70% (range
30-100%; interquartile range [IQR] 64-80%), whereas the largest
was 95% (range 3-98%; IQR 30-60%).
Their
conclusions were that there is wide variability in clinicians’
estimates of pretest probability of disease and that post-test disease
estimates are also likely to be inconsistent.
Because
of such wide variability, we prefer to concentrate on more traditional
statistics derived from the classical 2x2 table such as sensitivity,
specificity, positive predictive value and negative predictive value
and skip the hassle of estimating pretest probability and using
nomograms or tables to generate post-test probabilities.
The
abstract of this article can be found at:
Go
to http://www.pubmed.gov and
enter or copy and paste 15175211 in the search
field. |
| Underuse
of Proven Interventions
One
of the main reasons for using valid, relevant evidence in health
care is to more accurately predict outcomes from various interventions
and thus be equipped to make informed choices. For example, it is
very useful to know the evidence for outcomes by risk category for
cardiovascular disease. The higher an individual’s risk for
a cardiovascular event, the greater the benefit from interventions
such as lipid-lowering with statins.
Because
of increased benefits, we would expect patients who are at the highest
baseline risk for CV events to receive the most aggressive lipid-lowering
treatment. Ko et al. (JAMA.2004;291:1864-1870), however, have demonstrated
that the prescription of statins in Canadian patients older than
66 years of age with a history of CV disease or diabetes mellitus
decreased progressively as baseline CV risk increased.
In
their database study:
-
Only 19% of the 75,617 patients received statins for secondary
prevention;
-
In patient 66-74 years of age, the adjusted probabilities for
receiving statin Rx was 37.7% (95% CI 37.3-38.2), 26.7% (26.4-27.1),
and 23.4% (22.8-23.9) in the categories of low, intermediate and
high baseline risk, respectively.
Why
would such a small percentage of elderly patients with CV disease
or diabetes be prescribed lipid-lowering drugs? Why would the likelihood
that physicians prescribe statin therapy be inversely correlated
with CV risk?
Comment:
This is yet one more study demonstrating the huge problem of underuse
of proven interventions in health care. However, this is apparently
the first article pointing out the treatment-risk paradox and is
worth reading. The authors suggest that the treatment risk paradox
in these elderly patients might be explained by:
-
Clinicians not understanding tradeoffs between benefits and harms
of statins—clinicians may believe that the results of RCTs
of lipids cannot be applied to patients with comorbidites on the
grounds that these patients may experience fewer benefits and
greater harms than the study subjects;
-
Physicians may prejudge patients with co-morbidities—they
may believe that such patients will not adhere to lipid-lowering
treatment regimens; and/or,
-
Physicians may be inattentive to preventive care measures in elderly
patients with chronic conditions.
|
| Class
Effect: Caution Urged
Pharmacy
and Therapeutics committees and clinicians are frequently faced
with issues regarding substitution for agents in a specific drug
class. CD Furburg and BM Psaty, using ace-inhibitors (ACEIs) as
an example, make a number of interesting points in an article entitled,
“Should evidence-based proof of drug efficacy be extrapolated
to a “class of agents”? (Circulation 2003;108:2608-2510).
Below is a summary of some key points made by the authors of the
article and also some points made by the editorialists, EM Antman
and JJ Ferguson (Circulation 2003;108:2604-2607).
Generic
substitution is the act of dispensing a different brand
or an unbranded drug product that is the same chemical entity and
meets US Food and Drug Administration criteria for bioequivalence
(e.g., generic lisinopril in place of brandname Prinivil [Merck
& Co.] or Zestril [Zeneca Pharmaceuticals]).
Therapeutic
substitution is the dispensing of an alternate chemical
entity for the original drug prescribed by the physician from the
same general therapeutic class (e.g., a physician orders enoxaparin,
but the hospital pharmacy dispenses dalteparin; a physician writes
a prescription for ramipril, and the pharmacy dispenses moexipril).
The
HOPE trial, using ramipril 10mg, reported the following relative
risk reductions in patients with vascular disease or diabetes and
one additional risk factor:
-
16% reduction over placebo in all cause mortality
-
20% reduction in MI
-
32% reduction in stroke
However,
the evidence is less robust for some of the other ACEIs:
- There
was no decrease in ischemic events in subjects who had undergone
angioplasty and took Quinapril 20mg when compared to placebo over
3 years
-
In the PROGRESS trial, there was no difference in stroke or coronary
events in the Perindopril 4 mg and placebo groups. (However, in
the EUROPA trial using 8 mg, there was a reduction in coronary
mortality and morbidity of 20%.)
-
4 ACEIs have not been shown to reduce mortality or morbidity—2
agents have not been tested and 2 have not shown a reduction in
clinical events.
Does
it make sense to declare all of the ACEIs equivalent? What about
differing doses that have not been tested?
Furburg
and Psaty point out that all 7 drugs have FDA approval for treating
hypertension, heart failure and LV dysfunction. All 7 have been
shown to lower BP and improve hemodynamic measurements. However,
only 4 of these drugs—enalapril, captopril, ramipril and trandolopril—have
been shown to improve long term survival or clinical outcomes compared
to placebo in large, RCTs.
They
point out that:
-
The actions specific to an individual agent may add to, subtract
from or have a neutral effect on the safety and efficacy attributed
to the common class actions of a drug class.
-
Concluding that equipotency exists rests on surrogate measures
of efficacy.
-
Equipotency is difficult to establish in the absence of relevant
outcome data.
- The
term “class effect” has never been defined from a
regulatory perspective.
-
The class effect concept is often overinterpreted to mean that
all agents in a class are interchangeable.
-
Me-too drugs can be produced without the risk and expense of long
term morbidity and mortality trials.
-
Regulatory agencies do not accept extrapolations, and FDA regulations
prevent manufacturers from making claims for unapproved indications.
-
Safety is an issue to consider.
Given
all of the above, the authors state that efforts should be made
to use drugs with the best-documented outcomes and in full doses.
Further, the use of me-too drugs without adequate documentation
should be restricted. Comparability should be established in trials
where two agents in a class are compared.
The
editorialists list the following minimal criteria they would require
for defining a class effect of a drug:
-
A clearly defined biological target or pathway.
-
Comparable efficacy demonstrated for multiple agents within the
class (with multiple randomized, controlled clinical trials for
each agent).
-
Absence of convincing evidence that there is a member of the class
that does not have comparable clinical benefit to that of other
agents within the class.
They
also recommend considering the following before deciding whether
drugs within a class are interchangeable:
- The
absolute and relative degree of benefit for each agent;
- The
clinical circumstances in which benefit has been demonstrated;
- The
extent and the depth of the evidence in favor of a particular
agent;
- The
safety profile and tolerability of agents within a class;
- The
cost of the alternatives, viewed not only in terms of cost per
quality life-year saved versus placebo or an established control,
but also in terms of cost per quality life-year lost in the case
of less expensive, but also less effective, alternatives;
- The
specific details of the inclusion/exclusion criteria and the exact
management protocols used in the individual supporting clinical
trials; and,
- The
particular subgroups that show benefit (or lack of benefit) for
agents within the class.
|
|