|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
MEDICAL TECHNOLOGYIs Technological Change In Medicine Always Worth It? The Case Of Acute Myocardial Infarction
We examine Medicare costs and survival gains for acute myocardial infarction (AMI) during 19862002. As David Cutler and Mark McClellan did in earlier work, we find that overall gains in post-AMI survival more than justified the increases in costs during this period. Since 1996, however, survival gains have stagnated, while spending has continued to increase. We also consider changes in spending and outcomes at the regional level. Regions experiencing the largest spending gains were not those realizing the greatest improvements in survival. Factors yielding the greatest benefits to health were not the factors that drove up costs, and vice versa.
IN A SERIES OF IMPORTANT PAPERS, David Cutler, Mark McClellan, and their colleagues argue persuasively that the benefits from many technological innovations more than justify the rising costs of health care.1 Although they recognize the presence of waste in the U.S. health care system, the policy message is clear: Dont kill the golden goose of future medical advances in pursuit of shortsighted cost savings. Cutler and McClellan summarized research for five diseases, but the most striking evidence came from the rapid decline in mortality following heart attacks or acute myocardial infarction (AMI). Briefly, they found that between 1984 and 1998, the costs of treating heart attacks rose by $10,000 in real terms, but life expectancy increased by about one year. In short, technological innovations in the treatment of cardiac disease provided terrific value for the dollar; in this case, the rising costs were "worth it."2 In contrast to the studies by these authors and others studying the benefits of technological advances in health care, another set of studies calls into question the assumption that spending more on medical care leads inexorably to improved outcomes.3 These have focused on differences in costs and outcomes across regions.4 Elliott Fisher and his colleagues, for example, found that Medicare patients in regions with higher health care spending levels do not experience better health outcomes, nor do they gain better access to care or report greater satisfaction. More recently, a study using state-level data demonstrated a negative association between quality of care and health care spending.5 This paper strives to reconcile these two seemingly contradictory sets of findings by returning to the original analysis on AMI mortality and spending by Cutler, McClellan, and their coauthors, but with an expanded data set that stretches from 1986 to 2003. We first ask whether the survival gains observed in the earlier data have continued into the current century: Have recent increases in health care costs still been worth it? We then ask whether regions experiencing the most rapid improvements in health outcomes were also those experiencing the most rapid increases in costs. Were there specific regional characteristics, such as early adoption of low-cost, highly effective treatments such as beta-blockers, aspirin, and reperfusion, or a reliance on care by multiple medical specialists, that might have been associated with unusually rapid survival gains or with unusually low spending increases? In the final section we suggest a simple framework that reconciles the two prevailing views of health care technology.
Data. Medicare Part A hospital claims data from 19862003 were merged with the Medicare Denominator File through 2003 to create a longitudinal cohort of fee-for-service (FFS) enrollees age sixty-five and older coded with having a new AMI.6 During 19861991 the sample of Part A data was 20 percent of the FFS Medicare population, rising to 100 percent since 1992. Patients with a code of "old MI," or those identified from the panel data as having had an AMI previously, were excluded from the sample.7 Overall, there were 2,872,050 valid AMI events. For pedagogical reasons, we find it helpful to present our results in terms of survival, or the percentage of AMI patients surviving to one year following their AMI. Spending data are available only through the end of 2003, so one-year survival and spending data are analyzed from 1986 to 2002. Methods. The primary analysis followed that of Cutler and McClellan by using just the Medicare Part A hospital expenditures, correcting for inflation using the U.S. implicit price deflator and expressing all results in 2003 dollars.8 Both spending and survival rates were determined for the same one-year horizon. We also performed secondary analysis using a smaller sample of Part B spending data: a 5 percent sample for 19931997 and a 20 percent sample for 19982002. To adjust for both secular and cross-sectional differences in health status, we adjusted survival rates and spending for a variety of comorbidities (diabetes, diabetes with complications, pulmonary disease, liver disease, liver disease with complications, dementia, nonmetastatic cancer, metastatic cancer). Also included were age-sex-race effects consisting of five age categories (6569, 7074, 7579, 8084, and 85+) interacted with sex and with two race variables (black and nonblack), and the type of MI (inferior, anterior, subendocardial, and other). The regression analysis explains survival and spending as a linear function of demographic variables, comorbidities, type of AMI, and year categorical variables.9 All estimated survival and spending measures are expressed in terms of the representative patient with average characteristics during the entire period of analysis using the ADJUST command in STATA version 9. Thus, the regression adjusts for changes over time in the severity of the disease, demographic changes in the Medicare population, and general increases in price levels. The regional unit of analysis is the Hospital Referral Region (HRR), which was constructed for the Dartmouth Atlas of Health Care to reflect Medicare patients actual hospital migration patterns for tertiary care.10 There are 306 HRRs in the United States, and each must include at least one hospital that performs cardiac surgery and neurosurgery. Each U.S. ZIP code is assigned to an HRR depending on the hospital at which the majority (or in some cases, the plurality) of Medicare enrollees seek their hospital care, so the HRR may cross county or state boundaries. Individuals were assigned to HRRs depending on their ZIP code of residence, not whether they were actually admitted to hospitals in those HRRs. Region-specific measures of annual survival rates and cost measures were constructed from the estimated linear regressions mentioned previously that control for comorbidities and demographics. We interpreted these region-year-specific measures as the risk-adjusted survival rate and spending for the representative Medicare AMI patient in that region and year. Regions will differ both with regard to their initial adjusted survival and spending and with respect to changes over time in these variables, but our approach will, as far as possible, ensure that the results reflect regional practice patterns rather than regional differences in patient characteristics. We used these adjusted measures in both the cross-sectional analysis (using just 2002 data) and the longitudinal analysis that examines changes over time (19862002). There are a variety of approaches to treating patients with AMI, and as we show below, regions differed dramatically in their adoption of treatment strategies as well as their reliance upon multiple physicians per patient. We hypothesized that regional differences in the diffusion of new treatment strategies will be associated with survival gains and spending increases during 19862002. Note that we conducted our hypothesis tests at the regional level rather than at the individual patient level. Unobservable aspects of specific patients will lead to unmeasured confounding factors and resulting biases. Differences across regions, on the other hand, are small with regard to the average severity of heart attacks, but large with regard to treatment strategy. In the regression analysis, we focused on two region-level dimensions of care. The first was an index of low-cost, highly effective treatments for AMI: aspirin at discharge, beta-blockers at discharge, and reperfusion within twelve hours of admission (whether surgical reperfusion or thrombolysis). Aspirin reduces platelet aggregation and is known to reduce the risk of mortality following AMI.11 Beta-blockers are an inexpensive drug that by blocking the beta-adrenergic receptors reduces the demands upon the heart; they have been known since the mid-1980s to be effective in reducing post-AMI mortality by 25 percent or more.12 Compliance in the use of beta-blockers has lagged among many regions, even as late as 200001.13 Reperfusion encompasses either thrombolytics"clot-busting" drugs designed to improve blood flow in the blocked arteriesor percutaneous transluminal coronary angioplasty (PTCA) within twelve hours of the AMI, again with well-established reductions in the risk of death.14 Measures were the percentage of patients in each region deemed ideal for treatment who actually did receive treatment, and were based on chart reviews from the Cooperative Cardiovascular Project (CCP) survey and reported in the Dartmouth Atlas of Cardiovascular Health Care.15 These data were available only for 199495, but these were years marked by a remarkable divergence in the adoption of these treatments. For example, beta-blocker prescription at discharge among ideal heart attack patients was 20 percent in San Antonio, Texas; 42 percent in Orange County, California; and 82 percent in Rochester, New York. Thus, 199495 data allowed us to identify early and laggard adopters.16 Regional quality was determined by the number of quality measures for which that region was above the national median. The quality measures range from 1 (below median for all three measures) to 4 (above median for all three measures). Our second dimension of care was the average number of different physicians treating the patient within one year following the AMI, averaged across all patients in the HRR in 199495. This both measured the degree of reliance on specialists and provided a marker for continuity of care. There are potential gains from the specialization of medical knowledge, but there are also "network" costs associated with the larger number of interactions necessary among physicians who are coordinating a patients care.17 For example, when four physicians are treating a given patient, there are [4 x 3] / 2 = 6 possible interactions (whether communication among physicians or potential for drug-regimen interactions). With eight physicians, the number of potential interactions rises to twenty-eighta 367 percent increase. The potential adverse effects are illustrated in a recent Newsweek column written by the wife of a severely ill man: The cardiologist told me that Doug was doing reasonably well, and I naïvely took solace in this mild pronouncement. That is, until a lung specialist zipped into the room, put his stethoscope to Dougs chest and said, "Hes not getting any better. Hes worse. He may die. Any questions?" I was too stunned to be coherent. Later a nephrologist informed me that Dougs kidneys were failing and he needed dialysis. I told this doctor what the prior two specialists had said, hoping he could reconcile their conflicting reports. Instead, he plied me with questions about their findings that I could not answer.18 In other words, the apparent lack of communication among the specialists could attenuate (or even offset) the advantages of specialization.19 We hypothesized that regions where quality measures were adopted early would experience the greatest improvement in survival with small influences on Medicare spending. By contrast, we hypothesized that a larger number of separate physicians should have uncertain effects on survival (depending on whether the network effects overwhelm the gains from specialization) but were likely to be associated with more rapid spending increases during the period. We did not estimate a traditional economic "production function," in part because we did not have sufficient data on inputs in each year during 19862002. Instead, we tested whether characteristics of regions in 199495 were predictive of productivity growth; that is, whether survival gains were high relative to spending growth during the period 19862002. To quantify the importance of these two region-level measures (quality of treatment and number of different physicians), we estimated a least-squares regression at the HRR level. The dependent variable was the risk-adjusted change, 19862002, in one-year survival, one-year spending, or one-year log (or proportional) spending. The index of quality was entered flexibly with separate categorical variables for the quality index from 1 to 4, and for quartiles (or twenty-fifth-percentile groupings) of the average number of different physicians per patient, with all estimates reported using the ADJUST command in STATA 9.
Exhibit 1
Focusing on just Part A (inpatient) spending could be misleading, because some regions could provide a greater fraction of care through Part B (physician). However, the correlation between total (Parts A and B) spending and survival is similar (correlation coefficient = .36, p < .001), largely because Part A and Part B spending are themselves so highly correlated (correlation coefficient = .77, p < .001). In sum, these results are at least consistent with the interpretation that U.S. health care is on the "flat of the curve," where spending more does not yield additional health benefits; indeed, it seems to imply (from the least-squares regression line) that more spending yields worse health outcomes.
Exhibit 2
Exhibit 3
As a simple benchmark, we note that Part A spending increased by $12,399 during 19862002 (in 2003 dollars). At the same time, expected one-year survival has risen by 9.8 per 100 AMI patients. Using the adjustment in the 1998 study by Cutler and colleagues to transform one-year mortality to expected life years, the average cost-effectiveness ratio is $20,806 per life year, which in 2003 dollars means that the health care costs over the entire period were "worth it" in the sense of meeting standard cost-effectiveness criteria. Since 1996, however, the cost-effectiveness ratio for Part A spending was just under $300,000, a number well above the traditional cost-effectiveness hurdle and thus no longer "worth it." Still, we are left with a puzzle: Why do the cross-sectional data tell such a different story from the time-series patterns from 1986 to 2002? A better understanding of this puzzle can be gained by considering region-specific changes over time in survival and spending, to which we turn next.
One simple hypothesis is to ask whether regions that experienced the greatest cost increases during 19862002 also experienced the greatest gain in survival during the same period. We consider this question using both graphical and statistical approaches. As before, in the graph we include only regions with at least 250 AMI patients in 1986, while the statistical analysis uses all data at the HRR level, weighted by the number of AMI patients in 1986.
Exhibit 4
If simply spending more on health care improved survival, we would expect a positive association between survival and spending, with regions lining up from the southwest to the northeast of the graph. However, if anything, the association is reversed: Regions experiencing the greatest spending increase during 19862002 tended to lag behind in survival gains. The graphical result is borne out in statistical correlations using the entire data set. The correlation coefficient between growth in survival and in expenditures is .21 (p < .001). These dollar increases reflect changes both in reimbursement rates across regions and in the quantity of care provided. Focusing just on quantity, we created an index of the billing weights from the inpatient diagnosis-related groups (DRGs). This index is not affected by either regional differences in reimbursement rates or changes in reimbursement rates. Again, the correlation coefficient is .14 (p = .02). Finally, the correlation between the proportional growth rate in spending and in survival is .04 (p = .49). At best, there is no association between spending growth and survival gains.
Exhibit 4
Regression results are shown in Exhibit 5
The right-hand section of Exhibit 5
The left-hand section of Exhibit 5
Dramatic progress has been made in the treatment of heart attacks among the elderly during the past two decades. Between 1986 and 2002 the average one-year survival rate following AMI increased by nearly 10 per 100 elderly AMI patients at an estimated cost of less than $25,000 per life year saved. But underlying these numbers is tremendous heterogeneity across time and space: There was little improvement in survival after 1996, despite continued growth in costs, and there was much variation in survival gains across regions and over time, with regional gains that were (if anything) negatively related to costs. These facts and others like them have generated a debate over the value of additional medical care spending. On the one hand, aggregate trends in patient outcomes suggest that the technological innovations were "worth it." In contrast, the apparent lack of any strong association between costs and patient outcomes or quality of care across regions suggests that aggressive cost-control policies might benefit society by eliminating unnecessary medical care for patients in high-cost regions.24 One approach to reconciling the two views in this debate is "flat of the curve": Changes in patient outcomes over time reflect valuable technological change in medicine, but incremental spending across regions (beyond some minimum) provides little benefit or could even harm patients through medical errors or iatrogenic disease.25 But "flat of the curve" explanations are not sufficient to explain the patterns we found. Simple overuse of unnecessary treatments cannot explain why high-spending regions are less likely to provide inexpensive but effective treatments as found in the Fisher studies and studies by Katherine Baicker and Amitabh Chandra. Similarly, the "flat of the curve" hypothesis is not consistent with our findings that areas with lower gains in survival and higher cost growth lagged behind in adopting relatively low-cost, noninvasive treatments.
We suggest a different approach. Local regions where patients receive their care differ along numerous dimensions; thus, each develops its own local constrained "production function" relating spending to health outcomes.26 The bottom curve in Exhibit 6
A key assumption is that uneven diffusion of cost-effective innovations is a key factor driving differences in patient costs and outcomes.27 Support for this view comes from Paul Heidenreich and McClellan, who concluded that the vast majority of the increase in thirty-day survival following AMI between 1975 and 1995 was the consequence of low-cost treatments such as aspirin, beta-blockers, angiotensin-converting enzyme (ACE) inhibitors, and thrombolytics.28 Furthermore, both observational and clinical trial evidence suggests that use of these noninvasive treatments reduces the incremental benefit of more-expensive treatments such as invasive surgery.29 Therefore, adoption of such innovations might have caused a reduction in health care costs for many patients and thus resembled more closely those advancessuch as seen in computersthat both reduce costs and improve quality. The policy implications of this new approach are also quite different. In the "flat of the curve" approach, spending in high-cost regions could be reduced without an obvious adverse impact on outcomes. However, our new model does not preclude the possibility that sharp cutbacks in low-efficiency, high-cost regions could adversely affect quality of care.30 Instead, policies should be directed toward improving productivity: restructuring hospital resources, improving the efficiency of physician treatment patterns, and accelerating the diffusion of highly effective treatments. Put more simply, the benefits of health spending depend on how one spends the money.31 It is reassuring that our views are in concordance with those of Cutler and others, who also recognize the potential excess costs arising from marginally efficient modes of care and coordination problems.32 It is important to note that our new view does not apply solely to "low-tech" effective treatments such as aspirin and beta-blockers, but rather it applies equally to any highly effective treatment, whether high-tech or low-tech. Indeed, we have found in earlier research that regional rates of high-tech surgical treatments such as angioplasty and back surgery are not correlated with per capita Medicare spending.33 This paper has not examined diffusion in surgical interventions, but preliminary results suggest that this was not the primary cause of spending growth during the study period. The benefits of health care technology are often substantial. However, as health care costs continue to rise, squeezing consumers, producers, and the federal budget alike, principles of accountabilitythat each incremental dollar should provide something of real value to patientsbecome increasingly important. That some regions could implement technological innovations at remarkably low cost is a reminder that waste and inefficiency are not inevitable by-products of technological growth. Thus, efforts to develop measures of quality and efficiency that can encourage hospitals or provider groups to adopt low-cost, highly effective care, while discouraging incremental spending with no apparent benefits, might allow us to keep the golden goose of technological progress alive and well nourished.
Jonathan Skinner (Jonathan.skinner{at}dartmouth.edu) is the John French Professor of Economics, Dartmouth College; a senior research associate at the Center for Evaluative Clinical Sciences (CECS), Dartmouth Medical School; and a research associate with the National Bureau of Economic Research (NBER). Douglas Staiger is a professor of economics at Dartmouth, an adjunct professor at the CECS, and a research associate with the NBER. Elliott Fisher is a professor of medicine and of community and family medicine at Dartmouth Medical School and a senior associate at the Veterans Affairs (VA) Outcomes Group in the White River Junction VA Hospital, White River Junction, Vermont. The authors are grateful to National Institute on Aging (NIA) Grant no. PO1-AG19783 for financial support; to Amitabh Chandra, David Cutler, Victor Fuchs, Mark McClellan, John Wennberg, and two anonymous referees for helpful and constructive comments; and to Weiping Zhou and Daniel Gottlieb for expert research assistance.
This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||