Evidence-Based Critical Care

Published on 22/03/2015 by admin

Last modified 22/04/2025

Print this page

This article have been viewed 1567 times

227 Evidence-Based Critical Care

Mary E. Hartman, John A. Kellum, Derek C. Angus

The practice of critical care, like all fields of medicine, is changing constantly, and the pace of change is ever increasing. Among the many forces for change, the rapid increase in information is one of the most important. Although the majority of practitioners do not engage in research themselves, they are consumers of research information and must therefore understand how research is conducted to apply this information to their patients. Fellowship programs in critical care medicine emphasize education in this area to varying degrees. The traditional approach has been to require fellows to actively participate in a research project, either clinical or basic science. However, there has also been a growing interest in instructing fellows in the methods of clinical epidemiology.¹ The practical application of clinical epidemiology is evidence-based medicine (EBM), which Sackett defines as “the conscientious and judicious use of current best evidence in making decisions about the care of individual patients.”² The clinical practice of EBM involves integrating this evidence with individual physician expertise and patient preferences so informed, thoughtful medical decisions are made.³ In this chapter we present the methodology of EBM and its application in critical care medicine.

Asking a Question

The first step in practicing EBM is asking a well-constructed clinical question. To benefit the patient and aid the clinician, clinical questions must be both directly relevant to patients’ problems and constructed in a way that guides an efficient literature search to relevant and precise answers. The Centre for Evidence Based Medicine (CEBM) in Oxford, England, provides an excellent description of the four essential elements of an EBM question, summarized in Table 227-1.

TABLE 227-1 Four Essential Elements of a Well-Constructed Clinical Question

Developing a specific, thoughtful question leads to a much more efficient search for the answer. Search results themselves can be used to further refine a question. For example, too many results may indicate the question is too broad, and too few results often necessitate a broader description of the patient population, intervention, or outcome.

Types of Evidence

After the question is formulated, one must consider the type of question being asked. Different types of studies, based on their size, design, and methodology, provide evidence of differing quality and relevance to a research question. For example, is the question about therapy, prevention, etiology, or harm? A randomized controlled trial (RCT) or (better yet) systematic review of RCTs will provide the best evidence for this kind of question. Is the investigator interested in the prevalence of a specific disease or symptom in the general population? If so, a large cohort study will best answer this question.

Randomized clinical trials, also referred to as experimental or interventional studies, are the cornerstones of medical evidence. Physicians place considerable faith in the results of randomized control trials.^4,⁵ This faith is placed with good reason, as randomization remains perhaps the best solution to avoid misinterpreting the effect of a therapy in the presence of confounding variables.⁶ When participants are randomly allocated to groups, factors other than the variable of interest (e.g., a new therapy for sepsis) that are likely to affect the outcome of interest are usually distributed equally to both groups. For example, with randomization, the number of patients with underlying comorbidity that may adversely affect outcome should be similar in each study arm, presuming sample size is appropriate. A special advantage of randomization is that this equal distribution will occur for all variables (excluding the intervention) whether these variables are identified by the researcher or not, thus maximizing the ability to determine the effect of the intervention.

However, RCTs are expensive, difficult, and sometimes unethical to conduct, with the consequence that less than 20% of clinical practice is based on the results of RCTs.⁷ Moreover, many important questions such as determining the optimal timing of a new therapy or determining the effects of health care practices cannot practically be studied by RCTs.

Observational Studies

The principal alternative approach to the RCT involves observation rather than experimentation. Prior experience has biased us to favor RCTs, but partly in response to the increasing need to answer questions unanswerable by the RCT, the design and execution of observational outcomes studies have become much more sophisticated.

Observational outcomes studies are very powerful tools for addressing many questions that RCTs cannot address, including measuring the effect of harmful substances (e.g., smoking and other carcinogens), organizational structures (e.g., payer status, open versus closed ICUs), or geography (e.g., rural versus urban access to health care). Because of their cost and the regulatory demands on drug and device manufacturers, RCTs are frequently designed as efficacy studies in highly defined patient populations with experienced providers and therefore provide little evidence about effectiveness in the “real” world.⁸ Alternatively, observational studies can generate hypotheses about the effectiveness of treatments that can be tested using other research methods.⁸ Investigators have also explored the effects of different therapies that are already accepted but used variably in clinical practice.⁹

There are a number of different kinds of observational studies, each designed to address a different type of clinical question. These include case-control, cross-sectional surveys, and cohort studies. Case-control studies compare a group of patients with a disease or symptom of interest to a selected control group. They have the advantage of being quick and relatively inexpensive to perform and are often the only feasible study method for very rare disorders or when the lag time between an exposure and the related disease is very long. They can also be conducted with a relatively small number of patients. Cross-sectional studies provide a snapshot of a population at one point in time. They can also be conducted inexpensively and in a short time. Cohort studies prospectively identify an at-risk group (the inception cohort) and follow them through time, recording exposures and development (or not) of the disease under investigation. Cohort studies have a number of strengths, including the ability to match subjects to controls for some confounders, establish the timing and sequence of events, and standardize eligibility criteria and outcome assessments; they are easier and less expensive to conduct than RCTs.

However, observational studies have several significant limitations. First, the data source must be considered. Observational outcomes studies are often performed on large data sets wherein the data were collected for purposes other than research. This can lead to error owing to either a lack of pertinent information or bias in the information recorded.¹⁰ Second, one must consider how the authors attempt to control for confounding. The measured effect size of a variable on outcome (e.g., the effect of the pulmonary artery catheter on mortality rate) can be confounded by the distribution of other known and unknown variables. More specifically, case-control studies are subject to recall and selection bias, and the selection of an appropriate control group can be difficult. Cross-sectional studies can only establish association (at most), not causality, and are also subject to recall bias. Cohort studies have a number of limitations, including difficulty in finding appropriate controls and difficulty determining whether the exposure being studied is linked to a hidden confounder, and the requirement of large sample size or long follow-up to sufficiently answer a research question can be timely and expensive.

Case Reports or Case Series

The last form of primary research is the case report or case series. A case is a published account of a single or small number of patients and their response to a particular therapeutic intervention. The inability to generalize from a case report makes it the weakest form of clinical evidence available. However, case reports may be the only available or practical information in support of a therapeutic strategy, especially in the case of rare diseases when the evolution of the therapy predates the common use of randomized study designs in medical practice. This is also true for new therapies that have not yet been tested in clinical trials.

Summaries of Primary Research

Another valuable source of information, especially for the busy clinician with limited time for reading and research, is primary research that has already been summarized and evaluated. There are a number of high-quality, peer-reviewed sources of summary information, including those that summarize the results of individual trials and those that combine and summarize the results of multiple trials addressing the same topic. The following is a description of the most common types of literature summaries.

Single-Study Results—Critically Appraised Topics

Determining which studies provide information useful in the care of patients is largely a question of deciding whether a study is valid and, if so, can its results be applied to the patients in question. One format for appraising individual studies is the critically appraised topic (CAT) format that has been popularized as part of EBM. The purpose of the CAT is to evaluate a given study or set of studies using a standardized approach. Studies that address diagnosis, prognosis, etiology, therapy, and cost-effectiveness all have a separate CAT format.³ An example is shown in Box 227-1 for studies that address therapy. The CAT format for studies on therapy asks several questions intended to address the issues of validity and clinical utility. Studies that fail to achieve these measures are not generally useful, although studies do not necessarily have to fulfill every criterion, depending on the nature of the topic. For example, a study that examined the effect of walking once a day for the prevention of stroke would not be expected to include a detailed examination of side effects or a cost-effectiveness analysis. However, a study comparing streptokinase to placebo for treatment of stroke would likely be required to include a detailed examination of side effects and a cost-effectiveness analysis because of the excessive risks and costs associated with such therapy. Similarly, blinding may not always be possible, and the effects of the investigators being unblinded can be minimized by separating them from the clinicians making the treatment decisions or by establishing standard treatment protocols that are applied equally to both the study and control groups. Alternatively, a study would be “fatally flawed” if it failed in terms of randomization or was not analyzed as “intention to treat.” There are a number of other useful tools for assessing study design and for quantifying effect size and cost-effectiveness. In general, these are the tools of epidemiology and biostatistics, and their discussion is beyond the scope of this chapter. A basic primer and glossary of terms is included in Table 227-2.

Box 227-1

Critical Appraisal of the Literature

Are Results Clinically Useful?

• How large was the treatment effect?

• How precise was the estimate of the treatment effect?

• Are the patients similar to the “norm”?

• Were all clinically important outcomes considered?

• Was a cost-benefit analysis performed?

Adapted from Sackett DL, Straus SE, Richardson WS et al. Evidence-based medicine: how to practice and teach EBM. London: Harcourt; 2000.

TABLE 227-2 Definitions and Equations

Systematic Reviews of Multiple Studies

A systematic literature review combines the results of multiple studies through the systematic search, assembly, and appraisal of existing primary research on a given subject. Meta-analysis is a type of systematic review that incorporates a quantitative summary of the data, which combines actual data from several small although high-quality studies. Criteria for reviews to be systematic as opposed to narrative (see later) are quite explicit. All systematic reviews should start with a four-part (three-part when applicable) question, as described previously. Both the search criteria and inclusion and exclusion criteria should be predefined. The review should combine only RCTs or discuss how and why it is combining different types of evidence. Additionally, the methods section should provide search terms and key words, thus establishing some degree of reproducibility.

The advantages of systematic reviews are that by pooling many studies, the power to find a true effect is increased. This is particularly important when many well-done but small and inconclusive studies have attempted to answer a particular question. Systematic reviews often represent an exhaustive effort to find all related information in a given area. In this regard, they provide an excellent summary of the literature up to the date of the review.

The disadvantage of systematic reviews is that they are only as good as the studies they include and can only be interpreted if all the criteria just mentioned have been met. Unfortunately, there is considerable variability in the quality and comprehensiveness of available systematic reviews. Much of this dilemma stems from a lack of commonly accepted methodology for conducting and writing systematic reviews. For example, there are no standard exclusion criteria for studies in systematic reviews. Each author establishes the criteria, which the reader must assess to determine the quality and utility of the review to answer his clinical question. In addition, there is publication bias. Popular search techniques to identify studies are inherently limited by the fact that unpublished studies are unaccounted for in any review. Issues such as these have led authors to propose the development and maintenance of study registries where all RCTs are registered irrespective of their publication status.¹¹ This would enable review of smaller studies and those studies published in journals not listed in cumulative Index Medicus, MEDLINE, and other popular databases in systematic reviews.

Narrative Reviews of Multiple Studies

The most common system of non–peer-reviewed pooling of study results is the familiar “review” article or collection of reviews. This textbook is an example of the latter. Articles or chapters combine information from several primary articles, sometimes a few hundred, in a way that is digestible by the average reader. Reviews may be focused on recent advances, or they may provide a complete tutorial on a given subject. In either case, in the traditional method known as the narrative review, the methodology is the same: an author, presumably someone knowledgeable of the subject matter, reviews the existing literature in some way, formulates an opinion, and disseminates this opinion along with references to support each argument. This approach is also used in the discussion section of most original articles, in which the authors attempt to discuss their findings in the context of the existing literature.

The advantage of narrative reviews is that they provide a detailed qualitative discussion, usually by an expert with years of experience. However, they do have several limitations. The most important of these is that evidence used to support the author’s positions is not collected, evaluated, and compared in an organized and reproducible manner. That information is complete or that it is judged in an unbiased manner cannot be assured. Journal articles are often peer reviewed, which provides some limited oversight for completeness and lack of bias, but this is far from perfect. Furthermore, review articles and textbook chapters are not generally subject to vigorous review and therefore may be the least reliable sources of information, particularly current information. For example, by 1988, fifteen studies had been reported on the use of prophylactic lidocaine in acute myocardial infarction. While no single study was definitive, pooled data from the nearly 9000 patients showed that the practice was useless at best. Nonetheless, by 1990 there were still more recommendations for its use than against it appearing in textbooks and review articles.¹²

Appraising Evidence

For a piece of evidence to be useful, it has to be valid, have clinically important findings, and be applicable to the particular patient. Guides for assessment of validity, like that shown in Box 227-1, exist for different types of studies (e.g., therapy, diagnosis, prognosis) and are presented in detail in Evidence-Based Medicine.¹³ Worksheets to determine whether a study is valid are also available from a number of sources including the Centre for Evidence Based Medicine (www.cebm.net). The importance of findings again depends on the type of study. For studies on therapy, the clinician must decide if there was a true treatment effect and, if so, how large an effect. For studies on diagnosis, the characteristics of a test must be presented, and the clinician must decide if the test characteristics (sensitivity, specificity, positive and negative predictive values) would make the same test useful for current patients. Again, a number of guides exist to help physicians make these decisions. In the last few years, the GRADE (Grades of Recommendation Assessment, Development and Evaluation) Workgroup has proposed a mythology for evidence appraisal that has been widely adopted.¹⁴ Table 227-3 summarizes the GRADE System. High-grade evidence should, in theory at least, be adopted into clinical practice and forms the basis of guidelines, whereas a more nuanced approach is needed for lesser-quality evidence.

TABLE 227-3 Grade System for Grading Quality of Evidence

Applying Evidence

The strongest evidence available remains useless until it is effectively applied. Application of EBM can occur directly at the patient level or be implemented on a larger scale through guidelines and protocols. Although bedside decision making has been the traditional focus of EBM, guidelines and protocols are important means to promote the standardization of care at an institutional or regional level.

Bedside Decision Making

The goal of EBM is to facilitate bedside decision making by placing evidence in the context of clinical judgment and the preferences of the patient.¹⁵ There is often sufficient medical evidence to influence a number of daily decisions. Therefore, the clinician should always ask, “Is my patient receiving the best level of care as indicated by the evidence in the literature? Are there any study protocols or results that could be applied to this patient that currently are not?” Clinicians should also recognize knowledge deficits and be alert for opportunities to formulate EBM questions during daily rounds or routine patient care. Once the evidence is found and deemed useful, it must be judiciously applied. Clinicians must use their knowledge and experience to understand how to apply the results of studies to individual patients. Some cases, owing to patient- or environment-specific circumstances, may be sufficiently unique to render even good evidence inappropriate. Individual patient or family values and expectations could also direct therapy in one direction when medical evidence and physician judgment would have led it in another.

Guidelines and Protocols

Perhaps a natural extension of EBM is the desire to standardize care when evidence can be found for treatments or diagnostic procedures that are cost-effective. When such therapeutic or diagnostic strategies exist, they should be widely applied. A convenient way to ensure this is to develop a protocol or guideline. Protocols and guidelines are especially useful for common illnesses and procedures and have the advantage of allowing an institution to implement EBM even in the presence of physician lack of expertise in EBM. However, developing and maintaining protocols and guidelines is extremely labor intensive because the EBM criteria for guideline validity are explicit. Sackett states, “We should think of [a guideline] as having two distinct components: first the evidence summary, and second, the detailed instructions for applying that evidence to our patient.”¹³ The evidence summary consists of a recent review of the literature both for and against the guideline.³ The applicability of the guideline in each clinical situation with particular patient and institutional characteristics is assessed in the same manner as other evidence.

Problems with Evidence-Based Medicine in Critical Care

Although EBM faces challenges when applied in many fields, there are some unique challenges for its implementation in critical care medicine. These include difficulty in collecting high-quality evidence on which physicians can base decisions, difficulty in determining what to do when there is a general lack of evidence, and difficulty applying evidence to patient care.

Generating Evidence

It is impossible to practice EBM without a body of evidence in the literature. Until recently, there was little strong evidence supporting particular care paradigms in the critically ill. There are now a large number of studies guiding a wide set of critical care problems,¹⁶^–²¹ whereas other elements of care remain largely empirical. Why has our field had such difficulty conducting clinical trials? There are a number of reasons. First, critical illness occurs in a heterogeneous group of patients in whom treatment effects may be small. Narrow selection criteria may introduce bias, and smaller sample sizes may not show an effect. Second, investigators must ensure the novel therapy is tested against “current best methods of care.” Since a study will be interpreted in the light of likely treatment patterns at the completion of a trial rather than the initiation, recent strong evidence should be promoted in both arms of a trial. But the large number of recent critical care trials combined with the financial and practical difficulties of implementing all of the changes has made “current best methods of care” an evolving process that remains a constantly moving target. Third, the choice of appropriate outcomes continues to be debated in critical care. The historic choice of 28-day (or 30-day) mortality rate, which has been used as the primary outcome in most critical care trials,²² has been criticized as arbitrary and incomplete. There is growing recognition that clinical research has to define and focus on the outcomes most meaningful to patients and society, including quality of life, functional status, freedom from pain and other symptoms, and satisfaction with medical care.⁸

Reporting Results

Another threat to the validity of EBM is the accessibility to evidence as a function of study results reporting. Randomized trials can yield biased results if they lack methodological rigor, and it may be difficult to determine their flaws if they are not reported accurately. Unfortunately, authors of many trial reports neglect to provide lucid and complete descriptions of critical information needed to judge the methodological rigor and hence the validity of the results. In response to this problem, a series of Consolidated Standards of Reporting Trials (CONSORT) statements were published beginning in 1996.²³ Most recently, these statements have been updated with CONSORT 2010.²⁴ Figure 227-1 shows a flow diagram for reporting information on research subjects in a parallel randomized trial of two groups. CONSORT 2010 also provides a 25-point checklist for information to include when reporting a randomized trial. The hope is that by improving and standardizing trial reporting, evidence appraisal will be more objective and overall evidence quality will improve.

Figure 227-1 Flow diagram of progress through phases of a parallel randomized trial of two groups (i.e., enrollment, intervention allocation, follow-up, and data analysis).

(From Schulz KF, Altman DG, Moher D; for the CONSORT Group. CONSORT 2010 statement: updated guidelines for reporting parallel group randomized trials. Ann Intern Med 2010;152:726-32.)

Practicing When Evidence Is Lacking

Although the application of EBM has produced very useful information to guide therapy ²⁵ and further research,²⁶ it has also generated considerable controversy.²⁷^–²⁹ The disagreement is not over recommendation of practices based on sound evidence, but instead whether these practices should be avoided when evidence is lacking. Thus clinicians are weary of being told they and their patients cannot pursue diagnostic and therapeutic choices because there is no evidence these practices work. In this regard, it is important to note one of the basic principles of EBM: “not finding an effect is not the same as finding no effect.” Stated differently, the lack of evidence that something works is not evidence that it does not work. This issue is particularly relevant to critical illness where, by definition, patients are seriously ill and often do not respond to therapy. Should treatment that is possibly effective be withheld from patients with otherwise lethal conditions on grounds that it is unproved?

For new therapies, there are already evidence-based standards in place for evaluation and approval.³⁰ However, numerous therapies are in use in the ICU today without proven efficacy, and many others, for which there may be proof in one patient population, are being prescribed in another. Unfortunately, there may be significant barriers to obtaining evidence for these practices. For example, funding agencies and corporations may be unwilling to study therapies that are no longer patented. Furthermore, placebo-controlled studies are often impossible to conduct because clinicians find it unethical to withhold “standard” therapies. Efforts to use “lack of evidence” to justify withholding these therapies should be tempered by these and the following considerations:

1 Are alternatives available that are proven to be effective?

2 Is there evidence that the treatment or procedure is potentially harmful?

3 What is the natural history of the disease being treated?

4 In the case of prophylaxis, what is the risk of developing disease?

5 What is the cost of treatment as well as not treating?

Clinicians routinely grapple with these issues even for therapies that have proven to be effective. The risk-benefit ratio for any therapy is patient specific, and the clinician must judge the probability for benefit or harm to each individual patient. Evidence-based guidelines can be useful in helping clinicians and patients make these decisions, but they cannot take the place of clinical judgment. Treatments that are proven to be useless or even harmful should be avoided unless compelling evidence exists for their use in a specific patient. However, restrictions on existing therapy on the grounds that this therapy is unproven must be developed with great caution.

Barriers to Applying Evidence

When we see patients, we are responsible for applying EBM in the management of their clinical problems. But even when the evidence is strong and the patient is in agreement with the plan, powerful impediments often bar our way.³¹ As has been experienced by other specialties, critical care medicine has many logistical barriers to implementing EBM at the level of the clinician or the institution and regionally/nationally.

At the level of the clinician, there are a number of potential barriers. First, each step of EBM practice is difficult. For example, generating specific, patient-centered questions is difficult when patients suffer from poorly defined conditions with unknown underlying pathophysiology and uncertain outcome. Because of the relative paucity of available evidence, searching for the right article can be something akin to searching for a needle in a haystack. Second, there are time pressures in clinical practice. This is true both for finding and appraising evidence and for developing the skills of practicing EBM. Whereas this may be true in some circumstances, many non-urgent decisions (e.g., when to restart feeds, ventilator weaning, ulcer prophylaxis) are made in ICUs every day. All such decisions could benefit from thoughtful consideration of the evidence. And after an emergent situation, the clinician (in training) could identify any questions about the course of action taken and review later what evidence exists in such a circumstance. Fortunately, electronic databases are increasingly making this concern less of an issue. Last, clinicians are largely responsible for implementing EBM on their own initiative. If they lack the skills or confidence to apply best evidence to their patient care, nothing will change.

At the institutional level, commitment to practicing EBM requires both philosophical and financial support. Regular revision of institutional guidelines and protocols is time consuming to conduct, expensive to implement, and full implementation of best care practices may require changes in the array of clinical services available to patients. Purchase of new medical technology or establishing new clinical units can be very expensive. In the current healthcare environment, many hospitals are likely to be hesitant about such expenses, especially if the evidence is relatively young and the practice not firmly established.

Regionally and/or nationally, implementing EBM requires enormous resources in the effort to continually educate physicians and insurers and to update policies. State and federal governments have to consider medical education requirements and how compliance with policies and guidelines will be defined and enforced. Effective strategies to communicate policy changes and updates in guidelines must also be developed. Because regional and national systems are responsible for socially and geographically diverse healthcare environments, the aforementioned issues must be adaptable to local needs and circumstances.

Although the obstacles are significant, we are not without resources to overcome them. Paralleling the evolution of EBM has been research into how evidence can be implemented into practice—so-called implementation research.³² Understanding how individuals and institutions absorb evidence and implement change has, in select cases, translated into fundamental improvements in health care. We have also learned much about barriers to research transfer through our failed attempts to modify behavior. Success in modifying a discrete aspect of medical practice has invariably been achieved through multidisciplinary strategies that meld concepts and techniques from epidemiology, education, marketing, psychology, sociology, and economics.