CHILDREN'S HOSPITAL ORTHOPAEDIC CLINICAL EFFECTIVENESS RESEARCH GROUP
MININDER S. KOCHER, MD, MPH
DEPARTMENT OF ORTHOPAEDIC SURGERY, CHILDREN'S HOSPITAL

Print Manuscript

Download PDF version

View References
Close window

INTRODUCTION
The evidence-based clinical practice and patient-derived outcomes assessment movements burst onto the scene of clinical medicine in the 1980's and 1990's as a result of contemporaneous technical, societal, and economic influences. Work by Wennberg and colleagues revealed large small-area variations in clinical practice, with some patients thirty times more likely to undergo an operative procedure than other patients with identical symptoms merely because of geographic location. 4-9 Further critical research suggested that up to 40% of some surgical procedures might be inappropriate and that up to 85%of common medical treatments were not rigorously validated. 1-3 Meanwhile, the costs of health care were rapidly rising to over two billion dollars per day, increasing from 5. 2% of the gross domestic product in 1960 to 16. 2%in 1997. As a result of the lack of evidence-based practice and the escalating costs of care, there has been an increased focus on the clinical effectiveness of care.

Clinical epidemiology provides the methodology to assess the clinical effectiveness of care. Epidemiology is the study of the causation and distribution of disease, using statistical and experimental methods. Epidemiologic methods focus on study design, data interpretation, and quantitative methods.

  Figure 1: Children's Hospital Orthopaedic Clinical Effectiveness Research Group

CHOCERG
We are currently developing the Children's Hospital Orthopaedic Clinical Effectiveness Research Group (CHOCERG) . The research group's mission is to improve the effectiveness of management of musculoskeletal conditions in children and adolescents via applied epidemiologic research. The research group's goals are to provide epidemiologic and biostatistical support for pediatric orthopaedic clinical research, to apply advanced epidemiologic methods to pediatric orthopaedic clinical research, and to develop independent fields of inquiry into musculoskeletal conditions affecting public health. Specifically, we are developing study sections in trauma, spinal disorders, hip disorders, upper extremity injuries, and sports medicine. We are developing a clinical research team (Figure 1) and resources to apply advanced
  Figure 2: Diagnostic Performance

epidemiologic methodology including performing randomized clinical trials, prospective cohort studies, and quantitative synthesis methods such as meta-analysis, decision-analysis, and cost-effectiveness analysis. In addition, we intend to pursue fields of inquiry into musculoskeletal conditions affecting public health, including racial variations in the utilization of orthopaedic care, the volume-outcome relationship in orthopaedic care, and the development and validation of patient-derived outcome assessment instruments in pediatric orthopaedics.

EXAMPLES
DIAGNOSTIC PERFORMANCE
The diagnostic performance of an ancillary test involves establishing the sensitivity, specificity, and predictive value of that test (Figure 2) . In a clinical study to evaluate the diagnostic performance of MRI of the knee in children and adolescents, we studied a consecutive series (118 knees in 113 patients) of pediatric patients (<16 years old) treated for intraarticular knee disorders who had an initial clinical diagnosis, followed by a MRI diagnosis, followed by arthroscopic findings. 10 There were 139 clinical lesions, 128 MRI lesions, and 135 arthroscopic lesions. There were no significant differences between clinical exam and MRI with respect to agreement with arthroscopic findings (clinical exam:70%;MRI:74%) , overall sensitivity (clinical exam:71%; MRI:72%) and overall specificity (clinical exam:92%;MRI:94%) . Stratified analysis by diagnosis revealed significant differences only for sensitivity of lateral discoid meniscus and specificity of medial meniscal tears (Figure 3) . MRI had significantly lower sensitivity for younger children (<12 years old:62%;12-16 years old: 78%;p =0. 05) . MRI also demonstrated lower specificity for younger children (<12 years old:90%;12-16 years old: 96%;p =0. 02) . We concluded that selective MRI does not provide enhanced diagnostic utility over clinical exam, particularly in children, and should be used judiciously in cases where the clinical diagnosis is uncertain and MRI input will alter the treatment plan.

Figure 3:Diagnostic Performance of MRI of the Knee in Children and Adolescents Reproduced with permission from the American Journal of Sports Medicine

Diagnosis Sensitivity (%) Specificity (%) Positive Predictive
Value (%)
Negative Predictive
Value (%)
 
Anterior Cruciate Ligament Tear
Clinical
81.3
MRI
75.0
P
0.55
Clinical
90.6
MRI
94.1
P
0.39
Clinical
49.0
MRI
58.6
Clinical
97.8
MRI
97.1
Medial Meniscus Tear 62.1 79.3 0.15 80.7 92.0 0.03* 14.5 34.3 97.6 98.8
Lateral Meniscus Tear 50.0 66.7 0.24 89.2 82.8 0.21 34.0 30.1 94.1 95.7
Osteochondritis Dissecans 77.3 90.9 0.22 97.9 97.9 0.99 66.0 69.5 98.8 99.5
Lateral Discoid Meniscus 88.9 38.9 0.02* 98.0 100.0 0.15   31.0 100.0   99.9    99.4
OVERALL 71.2 72.0 0.89 91.5 93.5 0.26        
*p<0.05                    

MULTIVARIABLE METHODS:REGRESSION
Regression analysis is a method of mathematical modeling used to determine independent associations of multiple independent variables with a binary categorical dependent variable (logistic regression) , a continuous dependent variable (linear regression) , an ordinal dependent variable (ordinal logistic regression) , censored time-to-event data (Cox regression) , or count data (Poisson regression) . Regression is commonly used to predict outcomes, or to establish independent associations, controlling for confounding and colinearity.

Figure 4:Clinical Prediction Algorithm for the Differentiation of Septic Arthritis and Transient Synovitis of the Hip in Children Reproduced with permission from the Journal of Bone and Joint Surgery , Kocher MS, Zurakowski D, Kasser JR. Differentiating between septic arthritis and transient synovitis of the hip in children: An evidence-based clinical prediction algorithm. Journal of Bone and Joint Surgery , 1999, 81A:1662-1670
History
of Fever
Non-Weight
Bearing
Erythrocyte
Sedimentation Rate >
40 (mm/hr)
Serum White Blood
Cell count >12,000
(x1000/mm3)
Predicted Probability
of Septic Arthritis
yes yes yes yes 99.8%
yes yes yes no 97.3%
yes yes no yes 95.2%
yes yes no no 57.8%
yes no yes yes 95.5%
yes no yes no 62.2%
yes no no yes 44.8%
yes no no no 5.3%
no yes yes yes 93.0%
no yes yes no 48.0%
no yes no yes 33.8%
no yes no no 3.4%
no no yes yes 35.3%
no no yes no 3.7%
no no no yes 2.1%
no no no no 1 in 700

We have used logistic regression to develop a clinical prediction rule to differentiate between septic arthritis and transient synovitis of the hip in children. 11 We reviewed children who presented to Children's Hospital from 1979 to 1996 with an acutely irritable hip. Diagnoses of true septic arthritis, presumed septic arthritis, and transient synovitis were explicitly defined based on joint fluid white blood cell count, joint fluid and blood cultures, and clinical course. We found that septic arthritis patients differed significantly (p<0. 05) from transient synovitis patients with regard to ESR, serum WBC count and differential, weight-bearing status, history of fever, temperature, effusion on radiograph, history of chills, history of recent antibiotic use, hematocrit, and gender. Four independent multivariate clinical predictors were identified to differentiate between septic arthritis and transient synovitis: history of fever, non weight-bearing, ESR >40mm/hr, and serum WBC count >12, 000/mm 3 . The predicted probability of septic arthritis was determined for all sixteen combinations of these four predictors (Figure 4) and can be summarized as:<0. 2% for 0 predictors, 3% for 1 predictor, 40% for 2 predictors, 93%for 3 predictors, and 99. 8% for 4 predictors. The chi-square test for trend and the area under the receiver operating characteristic (ROC) curve indicated excellent diagnostic performance of this group of multivariate predictors in identifying septic arthritis.

 

Figure 5: Partial Tears of the ACL in Children and Adolescents

  Figure 6: Decision Tree. Operative versus Nonoperative Management of Acute Achilles Tendon Ruptures:Expected Value Decision Analysis.

We have also used logistic regression to determine independent associations, controlling for confounding. In a prospective cohort study of 9,410 professional skiers who underwent preseason knee screening from 1992-1997, we studied the effect of functional knee bracing on subsequent knee injury in ACL deficient (ACLd) skiers. 12 ACL deficiency was operationally defined as an abnormal Lachman or pivotshift examination with >5mm KT-1000 manual maximum side-to-side difference in a skier without prior ACL surgery in either knee. ACLd skiers self-selected the use of functional knee brace during skiing. Controlling for knee laxity, gender, age, ski occupation, Lachman and pivot-shift grades, we found that absence of bracing remained an independent, multivariate risk factor for knee injury. ACLd skiers without bracing were estimated to be over six times more likely to sustain a subsequent knee injury (OR =6. 6;95%CI:1. 5-31. 7) . Because of the increased risk of subsequent knee injury in nonbraced skiers, we recommend functional bracing for ACL deficient skiers.

  Figure 7: One-Way Sensitivity Analysis:Operative versus Nonoperative Management of Acute Achilles Tendon Ruptures:Expected Value Decision Analysis.

SURVIVORSHIP ANALYSIS
Survivorship analysis is used to analyze time-censored data, which poses unique challenges to univariate methods. We have used survivorship analysis to study the functional outcome of partial ACL injuries in children and adolescents treated without reconstruction. 13 A cohort of 45 skeletally mature and immature patients <17 years old who had an acute hemarthrosis, MRI signal changes within the ACL, an IKDC grade A or B Lachman and pivot-shift examination under anesthesia, an arthroscopically documented partial ACL injury, and minimum 2-year follow-up was studied prospectively. All patients were treated without ACL reconstruction and underwent a structured rehabilitation program. Fourteen patients (31%) underwent subsequent reconstruction. Using survivorship analysis, we found that >50%tear (p=0. 01) (Figure 5) , predominantly posterolateral tear (p=0. 01) , and older chronological age (p=0. 01) were independent, timedependent risk factors for subsequent reconstruction.

DECISION ANALYSIS
Expected-value decision analysis is a methodological tool, based in gaming theory, that allows for the quantitative analysis of decision-making under conditions of uncertainty. The process of expected-value decision analysis involves the creation of a decision tree to structure the decision problem, roll-back analysis to calculate expected-value and determine the optimal decision-making strategy, and sensitivity analysis to determine the effect on decision-making of varying outcome probabilities and utilities. We used expected-value decision analysis to assess to determine the optimal management strategy, operative versus nonoperative, for acute Achilles tendon rupture. 14 Outcome probabilities were determined from a previously published quantitative synthesis of 83 published studies of acute Achilles tendon rupture from 1954-97. Patientderived utility values were obtained from a visual analog scale questionnaire of 76 active males, age 30-50 years old without a history of Achilles tendon injury. Roll-back analysis revealed operative treatment to be the optimal management strategy. (EV:6. 52 vs 6. 28;marginal value:0. 24) (Figure 6) . Threshold values were determined for the probability of rerupture from nonoperative treatment (0. 06) , the probability of a moderate complication from operative treatment (0. 16) , the utility of rerupture (3. 3) (Figure 7) , and the utility of a moderate complication (1. 3) . We concluded that operative management was the optimal management strategy for acute Achilles tendon rupture given the outcome probabilities and patient utilities that we studied. Lowering rates of rerupture from nonoperative management or increasing rates of moderate complications from operative management would favor nonoperative management. Patients with a low utility for rerupture favor operative management, whereas patients with a low utility for complications favor nonoperative management. We advocate a model of doctor-patient shared decision-making in which both outcome probabilities and patient preferences are considered in order to optimize the decision process.

COST-EFFECTIVENESS ANALYSIS
Economic analysis of medical procedures includes costidentification studies which determine costs and cost-evaluation studies which assess both costs and benefits. Cost evaluative study designs include cost-benefit analysis, costeffectiveness analysis, and cost-utility analysis. In cost-benefit analysis, both costs and benefits are analyzed in terms of monetary units. In cost-effectiveness analysis, a ratio is determined of cost per health effect. In cost-utility analysis, this health effect is measured as quality-adjusted life years (QALY's) . We studied the cost and effectiveness of routine pathological examination of surgical specimens from primary total hip and knee replacement for osteoarthritis by comparing clinical and pathological diagnoses in 1,234 consecutive cases of primary THR (n=471) and TKR (n=763) performed between 1992 and 1995 at the Brigham and Women's Hospital for the clinical diagnosis of osteoarthritis. 15 We found a low prevalence of discrepant (2. 3%) and discordant (0. 1%) diagnoses. The cost per discrepant diagnosis was $4, 383 and the cost per discordant diagnosis was $122, 728. We concluded that routine pathological examination of surgical specimens from primary THR and TKR for the clinical diagnosis of osteoarthritis had limited cost-effectiveness at our hospital due to the low prevalence of findings that altered patient management.

  Figure 8: Randomized Clinical Trial study schema

OUTCOMES ASSESSMENT
Health care outcomes assessment measures include generic measures, condition-specific measures, and measures of patient satisfaction. The legitimacy and importance of patient satisfaction as an outcome measure has burgeoned over the past decade because of its use for a variety of purposes such as indicating quality of care, assessing health care delivery, developing patient care models, impacting general health status, and allowing for continuous quality improvement. Furthermore, as the paradigm for health care has shifted toward a market model, patient satisfaction has become an outcome measure with great clinical and economic implications. We studied the determinants of patient satisfaction with outcome after ACL reconstruction in a cohort of 202 patients with minimum two year follow-up and complete demographic, surgical, subjective, and objective data. 16 We found that some specific objective variables were associated with satisfaction, however subjective variables of symptoms and function had the most robust associations with patient satisfaction. The seven independent multivariate determinants (adjusted R 2 = 0. 83, p<0. 001) of patient satisfaction included Lysholm score, overall subjective knee function, IKDC range-of-motion subscale, patella tenderness, full giving-way, flexion contracture, and swelling.

STUDY DESIGN
Associations between exposures and outcomes can be spurious due to chance, confounding, and bias. In hypothesis testing, the probability of a finding an association in a study by chance when, in reality, there is no association (type I error) is estimated by alpha, which is the p-value. The probability of finding no association in a study when, in reality, there is an association (type II error) is estimated by beta, which is 1 -power. Bias is nonrandom systematic error in design or conduct of a study. There are many forms of bias including selection bias, nonresponder bias, state of health bias, detection bias, performance bias, recall bias, acceptability bias, and publishing bias. A confounding variable has associations with both the exposure and the outcome and thus may distort their relationship. Frequent confounders include age, gender, socioeconomic status, and comorbidities.

Studies may be performed prospectively or retrospectively. Study designs may be observational, such as case reports, retrospective case series, case-control studies, prospective cohort studies, and cross-sectional studies, or they may be experimental, such as randomized clinical trials. Bias and confounding can be minimized through study design, with methods such as randomization, blinding, stratification, and striving for similar follow-up and assessment of groups.

The randomized clinical trial (RCT) is the highest level of clinical evidence because it minimizes bias and confounding. All variables between the groups are theoretically distributed equivalently, except the allocated study variable, because of randomization. Disadvantages of RCT's include difficulty, expense, length of time for study, ethical considerations, lack of acceptance by patients and clinicians, and lack of external generalizability. The design of a randomized clinical trial involves establishing explicit inclusion criteria, pre-hoc sample size calculation, patient accrual, blinded outcome assessment, interim analysis with stopping rules, and intent to treat analysis.

Mininder S. Kocher, MD, MPH is an Instructor in Orthopaedic Surgery at Harvard Medical School, Program in Clinical Effectiveness at Harvard School of Public Health, and Director of Children's Hospital Orthopaedic Clinical Effectiveness Research Group.

Address correspondence to:

Mininder S. Kocher, MD, MPH
Department of Orthopaedic Surgery
The Children's Hospital
300 Longwood Avenue
Boston, MA 02115

Print Manuscript   •  View References  •  Download PDF version   •  Close window