Published ahead of print on November 25, 2003, doi:10.1164/rccm.200310-1376OC
© 2004 American Thoracic Society Diagnosing AsthmaComparisons between Exhaled Nitric Oxide Measurements and Conventional TestsOtago Respiratory Research Unit, Dunedin School of Medicine, University of Otago Medical School, Dunedin, New Zealand Correspondence and requests for reprints should be addressed to D. Robin Taylor, M.D., Dunedin School of Medicine, University of Otago Medical School, P.O. Box 913, Dunedin, New Zealand. E-mail: robin.taylor{at}stonebow.otago.ac.nz
International guidelines recommend a range of clinical tests to confirm the diagnosis of asthma. These focus largely on identifying variable airflow obstruction and responses to bronchodilator or corticosteroid. More recently, exhaled nitric oxide (FENO) measurements and induced sputum analysis to assess airway inflammation have been highlighted. However, to date, no systematic comparisons to confirm the diagnostic utility of each of these methods have been performed. To do so, we investigated 47 consecutive patients with symptoms suggestive of asthma, using a comprehensive fixed-sequence series of diagnostic tests. Sensitivities and specificities were obtained for peak flow measurements, spirometry, and changes in these parameters after a trial of steroid. Comparisons were made against FENO and sputum cell counts. Sensitivities for each of the conventional tests (047%) were lower than for FENO (88%) and sputum eosinophils (86%). Overall, the diagnostic accuracy when using FENO and sputum eosinophils was significantly greater. Results for conventional tests were not improved, using a trial of steroid. We conclude that FENO measurements and induced sputum analysis are superior to conventional approaches, with exhaled nitric oxide being most advantageous because the test is quick and easy to perform.
Key Words: asthma diagnosis exhaled nitric oxide induced sputum lung function Bronchial asthma is common and in Western populations its prevalence is about 20% in young adults (1). Diagnosis is based on a history of wheeze, shortness of breath, and cough, which are variable in severity and over time (2), but it may not always be straightforward and clinicians frequently wish to validate the diagnosis using objective tests. This is particularly so if maintenance treatment with long-term inhaled corticosteroid therapy is being considered. International guidelines recommend the measurement of serial peak expiratory flows or spirometry to confirm the diagnosis of asthma (35). The response to either inhaled bronchodilator or to a trial of corticosteroid is also included as part of current recommendations. However, these traditional approaches are problematic. First, they are primarily based on demonstrating abnormal airway physiology. Particularly in mild asthma, this is often not present and for this reason the sensitivity of each of these tests is low (68). Second, obtaining serial peak flow recordings or repeated spirometry over an interval of 1014 days (as recommended) requires adequate patient compliance, and it is the experience of many clinicians that this is not easily achieved (7). As an alternative, supportive evidence may be sought using laboratory-based tests for nonspecific bronchial hyperresponsiveness (BHR) (3, 5, 9). BHR is a recognized feature of asthma and is associated with increased vulnerability to exogenous stimuli. But again, the sensitivity of tests such as methacholine challenge is variable (8, 10), and access to testing is not always available. Updates to current guidelines have highlighted the potential diagnostic role of exhaled nitric oxide (FENO) measurements and induced sputum analysis (5). FENO is increased in patients with bronchial asthma (11) and has been shown to distinguish subjects with asthma from those without asthma (11, 12) with a high degree of discriminatory power (1315). Moreover, FENO correlates well with airway eosinophilia and with BHR (1618). Thus FENO is a potential surrogate measurement of abnormal airway pathology in asthma, and as such may be a more appropriate diagnostic tool than conventional approaches based on abnormal physiology. The test is easily performed in both adults and children (19). A similar rationale applies to the use of induced sputum analysis, but the latter is much more technically demanding, and is not routinely available. To date, there has been no systematic comparison between FENO measurements and each of the "routine" tests recommended for diagnosing asthma. In this study, we have compared FENO measurements and induced sputum cell counts against serial peak flow recordings, spirometry, and airway responses to inhaled bronchodilator and oral steroid, in an unselected population of patients being investigated by their family practitioner. Our principal aim was to evaluate the diagnostic utility of FENO measurements as a test for asthma. Our data have previously been presented in part as an abstract (20).
Subjects We recruited 47 consecutive patients aged 875 years referred by their family practitioner to the Dunedin Hospital (Dunedin, New Zealand) pulmonary function laboratory for investigation of possible bronchial asthma. The laboratory offers spirometry and BHR testing to community-based clinicians as a routine service. No patient had been referred for specialist consultation as such, and there was no prior selection of patients except that each subject was required to have had respiratory symptoms for a minimum of 6 weeks. Subjects were excluded if they had used oral or inhaled corticosteroid in the preceding 4 weeks or if they had had a typical respiratory tract infection in the previous 6 weeks. The study was approved by the Otago Ethics Committee and all subjects gave written informed consent.
Study Design
At the final visit, the diagnosis of asthma was ascertained on the basis of the following: relevant symptom history (present in all patients), using American Thoracic Society criteria (2), and a positive test for BHR and/or a positive response to bronchodilator. These were defined as: provocative dose of hypertonic saline resulting in a 15% fall in FEV1 (PD15) of less than 20 ml (21) and an increase in FEV1 of 12% or greater from baseline 15 minutes after inhaled albuterol, respectively (6).
Study Procedures Exhaled nitric oxide was measured before any forced expiratory maneuvers, according to current guidelines (19). All readings were obtained by technical staff who were blinded as to the clinical status of the patients. Two exhalation flow rates were used (50 and 250 ml/second). For the flow rate of 50 ml/second, FENO levels were read at the first NO plateau, and for the flow rate of 250 ml/second, they were read at the end-of-exhalation carbon dioxide plateau (23). The correlation between FENO50 and FENO250 was high (r = 0.99, p < 0.0001). On the basis of these and other data (24), only results for FENO50 are reported. Bronchodilator reversibility was calculated as percent change in FEV1 15 minutes after inhaling 400 µg of albuterol. An increase of 12% or greater was considered significant (6). Bronchial hyperresponsiveness to hypertonic saline (4.5%) was measured by means of a modified version of an earlier protocol, from which PD15 was established (21). Simultaneously, sputum was collected for cell analysis according to a standardized method (25).
Statistical Analysis Receiveroperator characteristic (ROC) curves were constructed to compare the diagnostic utility for each test, except for percent change in FEV1 with bronchodilator and PD15, both of which had been used as diagnostic criteria. The areas under the ROC curves were compared statistically by the method described by Hanley and McNeil (27). Predictive accuracy was defined as the percentage of all test results that correctly identified presence or absence of asthma.
Fifty-one consecutive patients were enrolled. Four patients withdrew after the first study visit because of the time commitment. The baseline characteristics of the remaining 47 are shown in Table 2 . Sputum induction was possible in only 40 of 47 patients.
Seventeen patients (36%) were diagnosed with asthma, 15 having symptoms and evidence of BHR, and a further 2 having symptoms and a positive response to bronchodilator without evidence of BHR. On the basis of Global Initiative for Asthma (GINA) criteria, 1 patient was classified as having severe asthma, 4 had moderately severe asthma, and 12 had mild asthma. Thirty patients (64%) were deemed not to have asthma. The final diagnosis for these patients was as follows: chronic rhinosinusitis (13; 28%); extended postviral respiratory syndrome (8; 17%); gastroesophageal reflux disease (6; 13%); eosinophilic bronchitis (2; 4%); and chronic obstructive pulmonary disease (1; 2%). Thirteen of the 17 patients with asthma (76%) were atopic compared with 13 of 30 (43%) in the nonasthmatic group. Compared with the nonasthmatic group, the mean FEV1 (percent predicted) and FEV1/FVC ratio were significantly lower with asthma (Table 2). FENO and sputum eosinophils were significantly higher in the asthmatic group (Table 2). The peak flow and spirometric responses to a trial of oral prednisone did not differ significantly between the two groups: mean (SD) morning peak flows increased by 7.0% (9.8) and 2.2% (6.7) and FEV1 increased by 5.4% (12.6) and 2.1% (4.2) in the asthmatic and nonasthmatic patients, respectively. The sensitivities, specificities, and positive and negative predictive values for each of the diagnostic tests are shown in Table 3 . ROC curves and comparisons of areas under the ROC curves are shown in Figure 1 and Table 4 , respectively. Both FENO and sputum eosinophils provided significantly higher degrees of diagnostic accuracy than did tests based on lung function. For FENO, the optimum cut point for diagnosing asthma, based on calculating the predictive accuracy for a range of different FENO50 levels, was 20 parts per billion (see the online supplement for detailed data).
In contrast, conventional lung function tests (spirometry and peak flow recordings) provided substantially lower degrees of diagnostic accuracy. Measurements taken before and after a trial of oral steroid did not offer an improvement in this overall picture. Given that bronchodilator reversibility and BHR were used to define the diagnosis of asthma, similar comparisons could not be made for these tests. However, significant bronchodilator reversibility was present in only 7 of 17 patients with asthma (41%) (poorly sensitive). There were significant correlations between FENO50 and sputum eosinophils (r = 0.67, p < 0.001) and PD15 (r = -0.56, p < 0.001).
In this prospective study, we have evaluated a comprehensive range of tests for confirming the diagnosis of asthma and compared those that are recommended in international guidelines with more recent "state-of-the-art" approaches, notably exhaled nitric oxide measurements. Our results show that single measurements of FENO in patients with undiagnosed chronic respiratory symptoms are strongly predictive of a diagnosis of asthma. On the basis of ROC curve analysis, FENO was also superior to the majority of conventional tests of lung function, such as peak flow recordings and spirometry, as well as changes in these parameters after a trial of steroid. In addition, we assessed the value of induced sputum analysis, given that it has been shown to reflect the underlying pathology of asthma (25, 28). FENO measurements were comparable to this technically more demanding approach, but with the advantage of being noninvasive, and quick and easy to perform. "Best practice" for confirming the diagnosis of asthma is currently based on assessing abnormal airway physiology, using relatively simple tests, despite a relative dearth of supportive evidence (35). Peak flow variation, spirometric values, and responses to bronchodilator have been shown to differ significantly between subjects with asthma and those without asthma (7). For the most part, this was confirmed in the present study (Table 2). However, our results demonstrate that despite significant between-group differences, the diagnostic usefulness of these conventional tests, including the response to a trial of steroid, is limited. They were poorly sensitive and their predictive values were much lower than for FENO and sputum eosinophils. This is in keeping with the results of two other studies showing that peak flow variation (8) and bronchodilator response (29) are poorly sensitive in diagnosing asthma, particularly in patients who present with mild disease. Thus, although it may be desirable for clinicians to confirm the diagnosis of asthma "on the spot," the overall yield of currently advocated, easily accessible tests is likely to be low. It might be argued that the thresholds we used to define "abnormal" for peak flow measurements and spirometry were inappropriate in a population with only mild asthma. We examined this issue by altering the threshold of abnormal for both FEV1 (percent predicted) and FEV1/FVC ratio. For FEV1 (percent predicted), no significant improvement in sensitivity was obtained by setting the threshold at 90% (Table 3). Similarly, for the FEV1/FVC ratio, a cut point of 80% improved the sensitivity to only 47%. This pattern of results strengthens our conclusion that FENO measurements are of even greater diagnostic value in patients presenting with only mild disease. The advent of FENO measurements to evaluate airway inflammation in asthma represents a significant advance in respiratory medicine, but its application in day-to-day clinical practice as a diagnostic tool has not yet been defined. This was recognized in the most recent update to GINA guidelines (5). Asthma is an inflammatory disease, for the most part, although not exclusively characterized by eosinophilic airway inflammation, variable airways obstruction, and BHR (30, 31). Given that a significant relationship between FENO and both sputum eosinophils and BHR has been confirmed in this and previous studies (16, 18), it is therefore much more logical to use FENO as a surrogate test for airway inflammation than to rely on physiological changes, the measurements of which are variable over time, are often undetectable in mild cases, and correlate only poorly with clinical symptoms. Our results provide evidence in support of this view. The predictive value of any test is dependent on the "gold standard" used for the diagnosis. The GINA guidelines describe asthma as a "chronic inflammatory disorder characterised by recurrent respiratory symptoms against a background of increased responsiveness to external stimuli giving rise to variable airflow limitation which is reversible either spontaneously or with treatment" (5). On the basis of this description, we elected to define asthma in our study as follows: relevant respiratory symptoms (which were present in all subjects and therefore could not be regarded as discriminatory) plus a positive test for bronchial hyperresponsiveness (BHR) and/or bronchodilator reversibility. A similar approach has been used previously (9, 32). In fact, we performed additional analyses (not reported) using two alternative gold standards. These were "relevant symptoms plus sputum eosinophilia (greater than 3%)" and "relevant symptoms plus one or more of the following: greater than 20% peak flow variability; positive bronchodilator response (spirometry); positive steroid response (peak flows or spirometry); positive test for BHR." By using these alternative definitions even greater accuracy for FENO as a diagnostic test was obtained. Thus the present results are conservative and have not been influenced by the definition of asthma used. Of necessity, our particular choice of gold standard also meant that neither bronchodilator response nor BHR could be assessed against FENO in terms of their diagnostic utility, because both were used to define asthma in the first place. The former is often used to confirm "reversibility." Despite this apparent lack, it is worth noting that even in the asthmatic group, fewer than half (7 of 17) demonstrated reversibility to bronchodilator, indicating low sensitivity for this test. The choice of hypertonic saline rather than another challenge agent was pragmatic in that it permitted simultaneous induced sputum collection. Compared with histamine or methacholine, hypertonic saline appears to be less sensitive but more specific for diagnosing asthma (33, 34). If anything, this will have resulted in a more conservative outcome than if methacholine or histamine had been used. The present study provides evidence of relevance in a population of patients that is most likely to be encountered in day-to-day clinical practice. By prospectively studying consecutive patients in primary care with hitherto undiagnosed symptoms, and who had relatively mild disease, we have demonstrated that even with low pretest probabilities, FENO measurements perform well when compared with conventional tests. Our results are consistent with those of three other studies (1315) in which the diagnostic role of FENO measurements has been assessed. However, these earlier investigations offer somewhat different and more limited perspectives. In the studies by Dupont and coworkers (13) and Malmberg and coworkers (14), although the diagnostic accuracy for FENO was similarly high, no direct comparisons with other clinical and laboratory-based tests for asthma were made. Similarly, Deykin and coworkers (15) studied selected rather than unselected patients and focused on the impact of different expiratory flow rates during FENO measurements on diagnostic accuracy. In summary, our study confirms the overall superiority of FENO measurements and induced sputum analysis in the diagnosis of asthma compared with conventional tests. FENO measurements are quick and easy to perform and may be readily incorporated into routine pulmonary function test procedures. This advance offers the possibility that a diagnosis of asthma may be performed more easily and confirmed with much greater confidence than has been possible to date.
Supported by the Otago Medical Research Foundation and the Otago Respiratory Research Trust. GlaxoSmithKline provided a personal educational grant to A.D.S. as GSK Research Fellow. This article has an online supplement, which is accessible from this issue's table of contents online at www.atsjournals.org Conflict of Interest Statement: A.D.S. has no declared conflict of interest; J.O.C. has no declared conflict of interest; S.F. has no declared conflict of interest; C.M. has no declared conflict of interest; G.M-S. has no declared conflict of interest; P.J. has no declared conflict of interest; D.R.T. has no declared conflict of interest. Received in original form October 8, 2003; accepted in final form November 24, 2003
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||