Published ahead of print on May 8, 2008, doi:10.1164/rccm.200801-090OC
© 2008 American Thoracic Society doi: 10.1164/rccm.200801-090OC
Intersession Variability in Single-Breath Diffusing Capacity in Diabetics without Overt Lung Disease1 Division of Pulmonary and Critical Care Medicine, The Johns Hopkins University School of Medicine, Baltimore, Maryland; 2 Pfizer Global Research and Development, New London, Connecticut; 3 University of Iowa College of Medicine, Iowa City, Iowa; 4 University of Utah, and Pulmonary Division, LDS Hospital, Salt Lake City, Utah; and 5 Duke University Medical School, Durham, North Carolina Correspondence and requests for reprints should be addressed to M. Bradley Drummond, M.D., The Johns Hopkins University School of Medicine, Division of Pulmonary/Critical Care Medicine, 5501 Hopkins Bayview Circle, JHAAC 4B.70, Baltimore, MD 21224. E-mail: mdrummo3{at}jhmi.edu
Rationale: American Thoracic Society guidelines state that a 10% or greater intersession change in diffusing capacity of the lung (DLCO) should be considered clinically significant. However, little is known about the short-term intersession variability in DLCO in untrained subjects or how variability is affected by rigorous external quality control. Objectives: To characterize the intersession variability of DLCO and the effect of different quality control methods in untrained individuals without significant lung disease. Methods: Data were pooled from the comparator arms of 14 preregistration trials of inhaled insulin that included nonsmoking diabetic patients without significant lung disease. A total of 699 participants performed repeated DLCO measurements using a highly standardized technique. A total of 948 participants performed repeated measurements using routine clinical testing. Measurements and Main Results: The mean intersession absolute change in DLCO using the highly standardized method was 1.45 ml/minute/mm Hg (5.64%) compared with 2.49 ml/minute/mm Hg (9.52%) in the routine testing group (P < 0.0001 for both absolute and percent difference). The variability in absolute intersession change in DLCO increased with increasing baseline DLCO values, whereas the absolute percentage of intersession change was stable across baseline values. Depending on the method, 15.5 to 35.5% of participants had an intersession change of 10% or greater. A 20% or greater threshold would reduce this percentage of patients to 1 to 10%. Conclusions: Intersession variability in DLCO measurement is dependent on the method of testing used and baseline DLCO. Using a more liberal threshold to define meaningful intersession change may reduce the misclassification of normal variation as abnormal change.
Key Words: respiratory function testing diffusing capacity intersession variability inhaled human insulin methodology
Measurement of single-breath diffusing capacity of the lung (DLCO) is an integral component of the assessment of lung function in many diseases (1–5). As inhaled medications for nonpulmonary diseases are developed, DLCO will play a more important role in monitoring for early lung toxicity in individuals. This assessment requires adequate characterization of the expected intersession variability in DLCO in individuals without lung disease. Intersession variability in DLCO measurements exists as a consequence of biological and measurement variability (6, 7). Current American Thoracic Society/European Respiratory Society (ATS/ERS) guidelines state that a change of more than 10% in DLCO over 1 year should be considered clinically significant (8). This value is derived from the work of Hathaway and colleagues, who reported on the intraindividual variability in five DLCO measurements in eight healthy trained pulmonary function technicians over 1 year (9). Additional variability can be introduced by using different testing centers to serially monitor an individual. Wanger and Irvin have shown that healthy trained subjects had a substantial variation in measured DLCO when tested at 13 pulmonary function laboratories in a major metropolitan center (10). To minimize this measurement variability, published guidelines provide standard methods for DLCO measurement and quality control (8, 11). Although standard testing methods improve reproducibility within a single session at a single center (12), there are few data about the intersession reproducibility in DLCO measurements in untrained individuals free of substantial lung disease. Moreover, it is unclear how much intersession reproducibility can be improved by instituting a centralized structured program of laboratory quality control. Recently, a series of multicenter trials of inhaled human insulin (Exubera [insulin human (rDNA origin)] Inhalation Powder; Pfizer, Inc., New York, NY; and Nektar Therapeutics, San Carlos, CA) was completed in individuals with type 1 and 2 diabetes (13–22). Serial DLCO measurements were obtained in participants receiving inhaled insulin and in comparator participants receiving no inhaled therapy. In the initial drug safety trials, lung function was measured in community pulmonary function laboratories (hereafter referred to as "routine testing"). In later trials, a comprehensive program of centralized quality control monitoring was instituted (referred to as "highly standardized testing") (23–27). Individuals enrolled in the comparator groups did not receive inhaled therapies during the study protocol. The present analyses used DLCO measurements from the comparator groups of these preregistration trials of inhaled insulin to address the following questions: What is the mean intersession difference in DLCO measurements obtained using routine as compared with highly standardized testing in individuals free of overt lung disease? Is the magnitude of intersession difference best described as absolute change or percentage of change from baseline DLCO? Are the intersession differences observed in absolute and percent changes affected by the baseline DLCO level? Is the current ATS recommendation of a 10% threshold to define clinically significant change appropriate for an untrained population free of overt lung disease?
Study Participants Data for this analysis were pooled from a database of participants enrolled as control subjects in 14 preregistration trials of inhaled insulin (Table 1). All participants were individuals with type 1 or type 2 diabetes using either subcutaneous insulin or oral hypoglycemic medications. Individuals enrolled in type 1 diabetes studies ranged from 18 to 65 years of age, whereas individuals enrolled in type 2 diabetes studies ranged from 35 to 75 years of age. Individuals with active lung disease, self-reported smoking within 6 months, previous abnormal lung function, inability to perform pulmonary function tests, or major organ system disease were excluded. Individuals were screened for lung disease with physical examination, chest radiograph, and pulmonary function testing (PFT). Individuals with stable mild asthma (determined by self-report) or mild chronic obstructive pulmonary disease (COPD) (ratio of FEV1 to FVC <70%, FEV1 70% predicted, and a history of smoking) were permitted to participate in these studies. Participants underwent baseline and serial PFT.
Monitoring and Measurements In the preregistration trials, individual participants performed DLCO measurements at baseline and serially for 3 months to 3 or more years, depending on the specific trial protocol. For this analysis, the baseline and the last measurement at or before the Month 6 visit were used. Individuals underwent PFT measurements using either a highly standardized or routine approach. Methods for the highly standardized approach have been previously described (23). In brief, all sites used the same type of lung function analyzer (Collins CPL; Ferraris Respiratory, Louisville, CO), and all analyzers were calibrated and leak-tested daily. Technicians, many of whom had no previous experience in performing PFTs, underwent a 2-day training course followed by a written test and performance evaluation. For all PFT measurements in the highly standardized method, the prediction equations of Miller and colleagues (28), Crapo and colleagues (29), and Hankinson and colleagues (30) were used for DLCO, total lung capacity (TLC), and FEV1, respectively. A 12% adjustment in TLC and DLCO predicted values was applied for African-American participants. All study data were transmitted to a reading center (Ferraris Respiratory), where they were reviewed within 24 hours for test quality, and feedback was given to technicians. The pulmonary function laboratories using the routine testing approach were all community and hospital pulmonary function laboratories. By protocol, these testing laboratories were required to follow published guidelines for performance and reproducibility as recommended by the ATS (11). However, formal technician training and central review of results were not performed at these sites. For calculation of predicted values, the prediction equations in place at the particular PFT lab were used. In the routine testing group, performing serial testing at different pulmonary function laboratories or on different pieces of equipment was possible. Although not specifically tracked, testing at a different site was a very rare event insofar as patients were monitored at the same clinical site throughout. In both the highly standardized and routine testing approach, values of DLCO were corrected for hemoglobin concentration per ATS guidelines (11).
Statistical Analysis
Baseline Demographic and Clinical Characteristics A total of 739 participants were enrolled in four studies using the highly standardized procedure. Of these, 699 participants had both baseline and follow-up DLCO measurements within 6 months (Table 2). For 82% of these individuals, the first and last DLCO measurements were 6 months apart, whereas 18% had a 3-month interval between the two tests. A total of 1,098 participants were included in 10 studies using the routine testing procedures. Of these, 948 had both baseline and follow-up DLCO measurements within 6 months. For 76% of the participants in this group, 6 months elapsed between the first and last test, whereas 24% of participants had a 3-month interval between tests. Sex and smoking status were not significantly different between the highly standardized and routine testing groups. Subjects in the highly standardized group were slightly younger (45 vs. 52 yr), had a lower body mass index (27.2 vs. 29.1 kg/m2), and more frequently had type 1 diabetes (58 vs. 25%). The mean age for all type 1 subjects was 38.0 ± 10.9 years. For the subjects with type 2 diabetes included in this analysis, the mean age was 56.16 ± 10.1 years. Of the 699 individuals in the highly standardized group, 36 (5.4%) had mild stable asthma (n = 9) or COPD (n = 27). In the routine testing group, 38 (4.0%) of the 948 individuals with this information available had mild asthma (n = 19) or COPD (n = 19). Absolute DLCO measurements at baseline were similar for the two groups (25.90 ± 6.26 ml/min/mm Hg for the highly standardized group, 26.01 ± 6.52 ml/min/mm Hg for the routine testing group). At baseline, percent predicted DLCO was lower in the highly standardized group compared with routine testing (92.44 ± 12.6 vs. 98.31 ± 17.16% predicted, P < 0.0001).
Mean Intersession Change in DLCO Measurements The mean ± SD intersession change in DLCO measurements for participants in the highly standardized group was –0.31 ± 1.86 ml/minute/mm Hg. For participants in the routine testing group, the mean intersession change was –0.54 ± 3.50 ml/minute/mm Hg. The mean absolute intersession change in DLCO measurements (change expressed as positive, regardless of direction) for individuals using the highly standardized approach was smaller than in the routine testing group (1.45 vs. 2.49 ml/min/mm Hg, P < 0.0001) (Figure 1, top). The highly standardized group also had a smaller interquartile range (75th–25th percentile) than did the routine testing group (1.51 vs. 2.61 ml/min/mm Hg). The 90th percentile for mean absolute intersession change was lower in the highly standardized group than in the routine testing group (3.06 vs. 5.40 ml/min/mm Hg).
The mean intersession change was also expressed in terms of the percentage of change from baseline (final value minus baseline value, divided by baseline value, times 100%). The mean absolute percentage of intersession change for the highly standardized approach was statistically significantly smaller than that for the routine testing group (5.64 vs. 9.52%, P < 0.0001) (Figure 1, bottom). Similarly, the highly standardized group had a smaller interquartile range than did the routine testing group (5.82 vs. 9.73%). The 90th percentile for mean absolute percentage of intersession change from baseline was lower in the highly standardized group than in the routine testing group (11.3 vs. 20.0%). When the diffusing capacity was separated into components Kco (DLCO/VA) and VA, both KCO and VA components had smaller variability in the highly standardized method when compared with routine method. Both components contributed approximately equally to the observed variability. Because the age of participants in the routine testing method was slightly older than those in the highly standardized method, we assessed the possible interaction of age on intersession variability in these groups. When analyzing mean absolute intersession change across age strata within a testing method, no difference was seen between age groups in the highly standardized (P = 0.39) or routine testing group (P = 0.10). Similarly, no difference in mean absolute percentage of intersession change was seen across age strata in the highly standardized (P = 0.59) or routine testing group (P = 0.71). To assess the potential effect of individuals with mild asthma or COPD on the observed variability, a sensitivity analysis was performed excluding those individuals. After excluding the 5.4% of individuals in the highly standardized approach, we observed no difference in the mean absolute intersession change (1.43 ml/min/mm Hg) or interquartile range for mean absolute intersession change (1.53 ml/min/mm Hg). Similarly, there was no difference in mean absolute percentage of intersession change (5.57%) or interquartile range for mean absolute percentage of intersession change (5.89%). After excluding the 4% of participants in the routine testing group with mild asthma or COPD, we observed no difference in the mean absolute intersession change (2.50 ml/min/mm Hg) or interquartile range for mean absolute intersession change (2.65 ml/min/mm Hg). Similarly, there was no difference in mean absolute percentage of intersession change (9.52%) or interquartile range for mean absolute percentage of intersession change (10.12%).
Intersession Change in DLCO as a Function of Baseline DLCO
In contrast, the mean absolute percentage of intersession change across categories of baseline DLCO did not differ significantly, regardless of testing method used (P = 0.28 for the highly standardized group, P = 0.39 for routine testing) (Figure 2, bottom). The intersession percentage of change in the highly standardized group ranged from 5.28% in the >30–35 ml/minute/mm Hg category to 8.25% in the 15 ml/minute/mm Hg category. In the routine testing group, the percentage of change ranged from 7.06% in the 15 ml/minute/mm Hg category to 10.91% in the >35 ml/minute/mm Hg category.
Cumulative Frequency Distribution Curves
The primary finding of this study was that there is substantial intersession variability in DLCO measurement in individuals free of overt lung disease and the degree of variability is dependent on the method of quality control. We found that in a large population of diabetic patients with no substantial lung disease, the 90th percentile for mean absolute intersession change could be as high as 3.06 ml/minute/mm Hg using a highly standardized approach and 5.40 ml/minute/mm Hg in routine testing. The observed absolute intersession change increased with increasing baseline DLCO value. However, when we expressed the intersession change as a function of percentage of baseline, the intersession variability did not change as a function of baseline DLCO. Because of this observation, the intersession percentage of change, rather than absolute intersession change from baseline DLCO, proved to be a more stable metric for the measure of variation in diffusion capacity. Our analysis involved more than 1,600 participants with diabetes, the majority of whom were free of any lung disease. Before this analysis, the available data regarding the intersession variability in healthy individuals were limited to measurement of small numbers of trained individuals at one or a few centers (9, 10). The current guidelines for interpretation of clinically important changes in DLCO measurement over time are limited by these small numbers of measurements. Well-characterized normative ranges of variability will become essential as new inhaled pharmacotherapies requiring serial DLCO monitoring in untrained patients without underlying lung disease are developed. The data in this study will assist physicians in determining whether an observed change in DLCO measurement is likely to be within observed variability in a population of healthy individuals not exposed to potential toxic agents. Although it may be desirable for pulmonary function laboratories to develop more rigorous centralized quality control procedures, such as those used in the highly standardized group, this may not be realistic in clinical practice. More importantly, the currently recommended 10% value for determining clinically significant change may be too sensitive. Approximately 15% of individuals who underwent highly standardized testing and 35% of individuals who underwent routine testing would have been characterized as having a clinically significant change according to the current ATS/ERS guidelines, with no other indication of a change in lung function. Therefore, it is reasonable to define a less restrictive threshold (e.g., 20% change from baseline DLCO) to determine a clinically meaningful change. Using a 20% threshold would reduce the percentage of individuals misclassified as having a clinically significant change to 1% in the highly standardized testing group and 10% among those undergoing routine testing, providing a better estimate of clinically important changes between two test sessions. However, expanded criteria for normal variation will reduce sensitivity of the test to detect clinically meaningful changes. Recognizing that practitioners may desire different thresholds in different clinical situations, our study provides the percentage of individuals misclassified as having a clinically significant change across a spectrum of thresholds ranging from 10 to 25%. It should be noted that, although the participants included in this study were free of overt lung disease, all had underlying diabetes mellitus. Some have speculated that diabetics may be prone to microvascular injury of the lung, similar to that seen in the kidney (31, 32). Such a process could potentially affect the intersession variability in DLCO measurements. Although it is possible that this situation may impair the ability to generalize our findings to a population free of any systemic illnesses, the mean baseline DLCO in our study population was within the normal range, and each participant acted as his or her own control over the study period. In addition, the mean change over the short period of observation did not differ from zero, suggesting that the populations did not experience a significant decline or improvement in their lung function. These factors should minimize any potential impact that diabetic microangiopathy of the lung might have on our interpretation. One might also speculate that shifts in pulmonary blood volume due to changes in glycemic control might increase the variability of DLCO in patients with diabetes, but such an effect has yet to be established. Although our study included a follow-up observation period of only 3 to 6 months, analysis of 2-year data from two protocols included in our study (studies 1022 and 1029) demonstrated an annualized change in DLCO of –0.222 ± 0.073 ml/minute/mm Hg for patients with type 1 diabetes (study 1022) and –0.292 ± 0.071 ml/minute/mm Hg in patients with type 2 diabetes (study 1029). These observed annual changes are consistent with what has been reported in nondiabetic patients (33), and counter the possibility of the presence of a diabetic lung syndrome accounting for the variability observed in the present study. Further studies are necessary to determine whether these measures of intersession variability would be similar in individuals with lung disease. When reanalyzing our population after excluding the 5% of individuals with mild stable asthma or COPD, we found no change in the observed intersession variability in either testing method. We suspect that in the more severely diseased lung, intersession variability would be even higher, thereby necessitating more liberal demarcation of a significant percentage of change in DLCO. Regardless of whether our results apply to other populations of patients with chronic nonpulmonary diseases, these results can clearly be applied to the interpretation of monitoring of inhaled insulin in diabetic populations if this therapy becomes available. Current U.S. prescribing information for Exubera recommends measuring DLCO but does not provide guidance as to the magnitude of change that would require further action. On the basis of our study, when using routine PFT methods, we suggest a 20 to 25% change in intersession DLCO as a more useful threshold for determining a clinically meaningful change in intersession DLCO, because this cutoff would minimize the percentage misclassified to 5 to 10%. In summary, we have shown that the expected variability in DLCO measurements over time in stable individuals is substantial and depends on the method of testing used. Using the percentage of change from baseline, rather than absolute change, minimizes the effect of baseline DLCO on the assessment of intersession variability. Finally, by studying a large population of individuals with diabetes, we have characterized the expected variability in intersession change in DLCO measurements using both highly standardized and routine testing conditions. The current recommendation of a clinically significant change, defined as 10% from baseline, appears too stringent and may misclassify as many as one out of three normal subjects as abnormal. On the basis of these findings, we propose that a 20 to 25% change from baseline DLCO is a more reasonable threshold when using routine PFT methods, thereby allowing the practicing physician to better estimate the clinical significance of an observed change in intersession single-breath diffusing capacity.
The authors thank Dr. Deborah McClellan and Janet Embry for their assistance in the preparation of this manuscript.
Supported by grant 1KL2RR025006-01 from the National Center for Research Resources, a component of the National Institutes of Health (NIH), and the NIH Roadmap for Medical Research. The data discussed in this article were pooled from clinical trials funded by Pfizer Global Research and Development. Originally Published in Press as DOI: 10.1164/rccm.200801-090OC on May 8, 2008 Conflict of Interest Statement: M.B.D. does not have a financial relationship with a commercial entity that has an interest in the subject of this manuscript. P.F.S. has been a full-time employee of Pfizer, Inc., since 1995. W.T.D. has been a salaried employee of the sponsor Pfizer since 1999 and has less than $2,000 of Pfizer stock in his 401K plan. In addition, W.T.D.'s spouse also works at Pfizer and has less than $1,000 of Pfizer stock in her 401K plan. J.G.T. is a full-time employee of Pfizer since 2000 and holds stock options. R.J.R. is a full-time employee of Pfizer, Inc., since December 2004. R.C.A. received $21,000 in 2005 and $6,000 in 2007 from Pfizer for providing advice related to the clinical study of inhaled insulin. R.O.C. does not have a financial relationship with a commercial entity that has an interest in the subject of this manuscript. R.D.E. is a full-time employee at Pfizer, Inc., and owns Pfizer stock and Pfizer stock options. N.R.M. consults for the Viasys Health Care Pulmonary Function Division and receives $5,000 per year. R.L.J. has participated on an "oversight" committee for Pfizer to oversee the quality of pulmonary function reports obtained during clinical trials for inhaled insulin. R.A.W. has served as a paid consultant to Pfizer for matters related to pulmonary function testing in clinical trials of Exubera. Received in original form January 14, 2008; accepted in final form May 8, 2008
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||