© 2002 American Thoracic Society
Validation of Nasal Pressure for the Identification of Apneas/Hypopneas during SleepDepartments of Medicine and Sociology, Foothills Hospital and University of Calgary, Calgary, Alberta, Canada Correspondence and requests for reprints should be addressed to W. Ward Flemons, Foothills Hospital/University of Calgary, 1403 29th Street NW, Calgary, Alberta, T2N 2T9 Canada. E-mail: flemons{at}ucalgary.ca
The reference standard for identifying apneas and hypopneas is a pneumotachograph, but using this can disrupt sleep. Nasal airflow estimation by measuring nasal pressure via nasal prongs is better tolerated by patients. However, nasal pressure has not been validated, using an event-by-event analysis, for detecting apneas/hypopneas during sleep. Eleven patients undergoing polysomnography wore a nasal mask capable of measuring nasal airflow (via pneumotachograph) and nasal pressure simultaneously. Each study was screened for respiratory disturbances, and from these 550 were randomly selected and blindly scored as an apnea/hypopnea or no event each using the pneumotachograph, nasal pressure, square root nasal pressure, and respiratory inductance sum signals independently. Agreement was measured using Cohen's kappa statistic. Intermeasurement agreements between the pneumotachograph and nasal pressure, square root nasal pressure, and respiratory inductance plethysmography sum were 0.76, 0.73, and 0.50, respectively. Inter- and intrarater agreements were, respectively, 0.68 and 0.60 for the pneumotachograph, 0.66 and 0.82 for nasal pressure, 0.61 and 0.78 for square root nasal pressure, and 0.47 and 0.76 for respiratory inductance plethysmography sum. These results indicate that nasal pressure has excellent agreement compared with a pneumotachograph and very good inter-/intrarater agreement. Square root transformation of the nasal pressure signal does not improve these levels of agreement, indicating that it is unnecessary in routine clinical practice for scoring apneas/hypopneas.
Key Words: sleep apnea syndromes diagnostic techniques and procedures evaluation studies
The reference standard for identifying and quantifying reductions in airflow during sleep is a pneumotachograph (1). Direct measurement of airflow using a pneumotachograph is seldom used in sleep laboratories because it requires patients to wear a tight-fitting facemask, which is uncomfortable and can disrupt sleep. Therefore, a number of surrogate measures have been developed to identify apneas and hypopneas during sleep. Respiratory inductance plethysmography has been used as a surrogate of tidal volume (2, 3), and thermistors (4) and nasal pressure (2, 5, 6) are often used as surrogates of airflow. Fluctuations in nasal pressure during inspiration and expiration reflect changes in inspiratory and expiratory airflow. Nasal prongs are well tolerated by patients, and the nasal pressure signal has excellent dynamic response (7). These factors make nasal pressure monitoring a promising new technique for detecting breathing events during sleep. However, nasal pressure has not been adequately validated to recommend it as a primary signal for detecting apneas/hypopneas (1). During wakefulness, nasal pressure has been shown to measure minute ventilation with moderate accuracy compared with body plethysmography (2). Nasal pressure has better negative predictive value but poorer positive predictive value than respiratory inductance plethysmography for identifying hypopneas in awake subjects (2). Thurnheer and colleagues have recently shown reasonably close agreement between the apneahypopnea index calculated from a pneumotachograph and that from nasal pressure in 20 sleeping subjects (8). Their study also suggested excellent agreement between these two methods in the apnea/hypopnea scores when analyzed epoch-by-epoch (using 2.7-minute epochs) (8).
The identification and classification of breathing events such as apneas and hypopneas depends on human judgment. Therefore, proper validation of a new measurement technique requires demonstration of good agreement with a reference standard, as well as good inter- and intrarater agreement. This is done best using an event-by-event analysis and Cohen's The relationship between nasal pressure and actual airflow is nonlinear. This may result in an underestimation of airflow and reduce event detection (false-negatives). If patients experience nasal obstruction or start mouth breathing, there may be false-positive identification of hypopneas. The square root of a nasal pressure signal provides a more linear estimate of actual airflow than nasal pressure alone (7, 8). Using an experimental apparatus with a flow generator to produce inspiratory waveforms, it has been shown that the square root of nasal pressure is more accurate than the untransformed nasal pressure signal for quantifying the reduction in flow amplitude during hypopneas and the degree of inspiratory flow limitation (9). Whether this transformation improves its inter- and intrarater agreement for identifying apneas and hypopneas during sleep in the clinical setting is not known. Standard laboratory pressure transducers are expensive and require large amplifiers, which limits their use in both routine laboratory and ambulatory settings. Less expensive and smaller transducers are now available but have not been compared with the laboratory standards. The objectives of the present study were the following: (1) to determine the intermeasurement agreement between the pneumotachograph and nasal pressure, square root nasal pressure, and respiratory inductance plethysmography sum for identifying apneas/hypopneas during sleep; (2) to compare the inter- and intrarater agreement of nasal pressure, square root nasal pressure, and respiratory inductance plethysmography sum with that of the pneumotachograph for identifying apneas/hypopneas during sleep; and (3) to compare a less expensive pressure transducer with a standard laboratory pressure transducer.
Eleven patients underwent overnight diagnostic polysomnography conducted in a standard fashion (10). Subjects wore a nasal mask designed to measure nasal airflow and nasal pressure simultaneously (Figure 1) . Nasal airflow was measured with a pneumotachograph (Fleisch, Lausanne, Switzerland) connected to a differential pressure transducer (Validyne, Northridge, CA) and nasal pressure was measured via nasal cannulae connected to a differential pressure transducer (Validyne). Nasal pressure was referenced to mask pressure. Square root nasal pressure was transformed from the nasal pressure signal, and respiratory inductance plethysmography sum was calculated through summation of chest and abdominal inductance bands (Respitrace; Ambulatory Monitoring, Ardsley, NY). The inductance bands were uncalibrated. To avoid heat and moisture build-up under the mask, cool air was circulated through the mask every 30 minutes for 5-minute cycles via an external port. Pneumotachograph and nasal pressure data were collected only during periods when this airflow was off. All signals were collected using Sandman NT software (Mallinckrodt Inc., St. Louis, MO). Sleep staging was performed according to standard criteria (11).
The study was approved by the University of Calgary Conjoint Medical Research ethics committee, and all participants provided written informed consent.
Protocol
Classification of events.
Comparison of standard laboratory and ambulatory pressure transducers.
Statistics
Eight male and three female subjects aged 3272 were studied. The mean respiratory disturbance index (RDI) for the group was 48.6 (range 9.9143). A total of 1,278 breathing events were identified; 550 of the events identified (50 per patient) were randomly selected for analysis. All 550 events were analyzed using a pneumotachograph, nasal pressure, and respiratory inductance plethysmography sum signals. During two of the studies, the square root transformation could not be performed due to technical difficulties; therefore, only 450 events were analyzed using square root nasal pressure.
Intermeasurement Agreement between a Pneumotachograph and Nasal Pressure, Square Root Nasal Pressure, and Respiratory Inductance Plethysmography Sum
Inter- and Intrarater Agreement The pneumotachograph and nasal pressure had the highest inter-rater agreement, and both were higher, including the 95% confidence interval, than respiratory inductance plethysmography sum. Square root nasal pressure also had good intermeasurement agreement, which was also higher than that of respiratory inductance plethysmography sum, but the 95% confidence intervals overlapped. Square root nasal pressure did not have better intermeasurement agreement than the untransformed nasal pressure signal.
Nasal pressure, square root nasal pressure, and respiratory inductance plethysmography sum all had higher intrarater agreement than the pneumotachograph. However, the intrarater agreement of the pneumotachograph was still good, with a Overall, the pneumotachograph, nasal pressure, and square root nasal pressure had higher reliability than respiratory inductance plethysmography sum. Although respiratory inductance plethysmography sum had good intrarater agreement, it had only moderate inter-rater agreement. Results are summarized in Table 1.
Comparison of Standard Laboratory and Ambulatory Pressure Transducers
The process of detecting apneas and hypopneas during sleep depends on human judgment to recognize changes in breathing amplitude and pattern; differences in scoring events occur both within and between individuals. Because a pneumotachograph measures flow accurately and precisely, it is assumed that human event recognition would be best using this technology (1). However, it is seldom used in sleep laboratories because its use requires patients to wear a facemask, making it cumbersome and disruptive to sleep.
The process of validating a surrogate measurement to be used for breathing event recognition involves demonstrating that it has good agreement with a pneumotachograph and that it has good reliability (both inter- and intrarater) (1). This is done best through an event-by-event analysis using Cohen's Others have attempted to validate nasal pressure by comparing it with a pneumotachograph during sleep (8, 16), but not on an event-by-event basis. Nasal pressure has also been compared with thermistors and respiratory inductance plethysmography (2, 5, 17). Although these studies have often reported high levels of agreement, most have tended to use correlation to measure agreement. Correlation coefficients measure the strength of a relation between two variables, not the agreement between them. Variables that are well correlated do not necessarily agree. Correlation coefficients are not recommended for this purpose because changes in the scale of measurement, although having no effect on correlation, do affect agreement. In addition, if the range over which measurements are taken is wide, the correlation will be greater than if the range of measurements is narrow (15). The American Academy of Sleep Medicine (AASM) Task Force suggested that differentiation of apneas from hypopneas was not necessary in clinical practice because both event types share a common pathophysiology and clinical consequences (1). However, separating apneas and hypopneas may be useful for research studies designed to compare their health consequences or to elucidate potential differences in pathogenic mechanisms. Because our study was designed to evaluate event scoring in clinical practice, we considered apneas and hypopneas a single entity and scored them visually rather than using an automated computer algorithm. The AASM criteria for event recognition are based on reductions in peak signal amplitude. However, other signal characteristics such as the shape of the inspiratory curve, inspiratory time, and rib cage and abdomen asynchrony in the case of respiratory inductance plethysmography can be useful. We focused solely on changes in signal amplitude in the presence or absence of O2 desaturations and arousals because it is common to all the signals used and is recommended by the AASM Task Force (1).
We attempted to select a broad range of event morphologies to be analyzed by studying subjects with a wide range of disease severity. The average RDI of subjects used in our study was 48.6. Although this would be generally considered high, the range of RDI was 9.9143 and the median was 31.4. Three of the 11 patients had an RDI less than 20. During our initial collection of events, we included all events that had a discernable change ( Bias was also limited through our randomization and blinding process. We randomly selected 50 events per subject for analysis out of a total of 1,278 events for all subjects. The order in which these events were analyzed was also determined randomly. It has been our experience that breathing events frequently occur in clusters that are morphologically similar. Had we analyzed all the events from one subject in succession, it is possible that neighboring events might have influenced our event classification. Event scoring using each signal occurred independently and was completely blinded, as were the results obtained by each scorer. One of the limitations of nasal pressure is false-positive detection of apneas/hypopneas due to nasal obstruction or mouth breathing. Reductions in airflow can occur through the nose, mouth, or both. It is possible that false-positive detection of apneas/hypopneas using the pneumotachograph or nasal pressure signals occurred in our study during transitions from nose or nose plus mouth breathing to periods of more predominant mouth breathing. Due to our choice of a nasal mask rather than a full facemask, we are unable to account for this potential problem. We elected to use a nasal mask because we felt that patients would be more likely to sleep, and a full facemask has the potential to alter airway mechanics by placing pressure on the lower jaw. Nevertheless, predominant mouth breathing appears to occur very rarely during sleep, and even when it occurs, there is often sufficient fluctuation in the nasal pressure signal to detect respiratory events (8, 18). Of the 550 events analyzed by Scorer 1, 334 (61%) were identified as true events using the pneumotachograph and 346 (63%) were identified as true events using respiratory inductance plethysmography sum. Although this does not eliminate mouth breathing as a potential confounder in our study, it is at least reassuring that the positive event detection rate using nasal flow was not significantly high compared with respiratory inductance plethysmography sum. Furthermore, during the initial event collection when all four signals were available, we did not identify lengthy periods of dampened nasal flow, which would have indicated possible mouth breathing. Nasal pressure is known to have a nonlinear relationship with nasal airflow. The square root transformation creates a near-linear approximation of actual nasal airflow (7), which could make it more sensitive in identifying events in situations of low flow. However, we have shown in the present study that the square root transformation does not improve agreement with the pneumotachograph and does not improve the degree of inter- or intrarater agreement for the identification of apneas/hypopneas during sleep (Table 1). The square root of nasal pressure has been shown to be superior to untransformed nasal pressure for quantifying the reduction in flow amplitude during hypopneas (9), and it might also be important for measuring ventilation. Nevertheless, we have shown that the transformation does not improve the ability of nasal pressure to detect relative changes in breathing pattern and amplitude, which remains the crucial factor for clinical event identification. Our results are in keeping with those of Thurnheer and colleagues, who showed that despite the fact that the square root transformation of the nasal pressure signal provided a near linear relationship with airflow, it was not essential for estimating the apneahypopnea index (8). There are a number of advantages to the use of nasal pressure over the pneumotachograph and other surrogate measures of flow. Nasal cannulae used to measure nasal pressure are well tolerated by patients, compared with a facemask. Thermistors have long time constants (19) and tend to underestimate changes in nasal flow. In contrast, nasal pressure has excellent dynamic response, which makes the interpretation of each different component within the breathing cycle more precise (7, 20). There is recent evidence to suggest that nasal pressure is capable of detecting airflow limitation (2123). Identification of a plateau on the inspiratory waveform, which correlates with elevated upper airway resistance (23), may be useful in identifying patients with the upper airway resistance syndrome. The inter-rater agreement for identifying respiratory effort-related arousals is very good when compared with esophageal manometry (24). When properly calibrated, the sum of the respiratory inductance plethysmography abdominal and chest signals provides a measure of tidal volume. However, displacement of the inductance bands can alter signal amplitude, resulting in loss of one or both of the chest and abdominal channels. It can also be difficult to obtain good thoracoabdominal signals in obese patients, many of whom have obstructive sleep apnea. Nasal pressure does not have these limitations. It has been recently recommended that a relative reduction of 50% from baseline in the uncalibrated respiratory inductance plethysmography sum can be used to define a hypopnea (1). This is based on the assumption that a 50% reduction in tidal volume is approximately equivalent to a 50% reduction in inspiratory flow. This assumption may not be valid in all circumstances and could partially explain our findings that respiratory inductance plethysmography sum was inferior to nasal pressure when compared with the pneumotachograph. Another possible reason why respiratory inductance plethysmography sum had inferior inter-rater agreement and intermeasurement agreement with the pneumotachograph is that we used an uncalibrated sum signal. This was done because it is the method most often used in clinical practice. In addition, had mouth breathing been a major factor in our study (we believe that it was not), one would have expected reduced agreement between the pneumotachograph and respiratory inductance plethysmography sum. However, this factor should not have altered the expected degree of inter-rater agreement, which was the lowest using the uncalibrated respiratory inductance plethysmography sum signal. Nasal pressure is measured by connecting cannulae to a differential pressure transducer. The most widely used transducers are expensive, which may limit the use of this promising signal in many diagnostic sleep laboratories and in the increasingly popular ambulatory setting. Therefore, we compared a less expensive pressure transducer with a standard laboratory transducer. The standard laboratory transducer used in the present study is approximately 10 times as expensive as the ambulatory device. The transducers were calibrated to ± 3 cm H2O, which is more than the expected range of pressure changes at the nares during quiet breathing. The flow tracings in Figure 2 and the Bland and Altman plots in Figure 3 demonstrate that the two transducers agree very well over a range of physiologic nasal pressures. The Bland and Altman plots suggest that the differences in pressures between the two transducers are greatest near 0 cm H2O and beyond ± 1.5 cm H2O. The relatively large differences around 0 cm H2O occurred because the absolute pressure measurements were very small, so that small oscillations in pressure show as large differences. Poor agreement near 0 cm H2O may indicate that nasal pressure alone would be inadequate for differentiating between apneas and hypopneas, but this is not necessary in routine clinical practice. Poor agreement beyond ± 1.5 cm H2O is also not likely to be important as this falls outside the range of physiologic pressure changes during normal breathing and certainly during periods of reduced breathing. The ambulatory transducer was not used during the sleep studies that constitute the present experiment. However, given the results in Figures 2 and 3, we believe that the two transducers could have been used interchangeably in our study. Following an event-by-event analysis in sleeping subjects using an appropriate statistical method, we conclude that nasal pressure is a valid device for identifying apneas/hypopneas during sleep. In the present study, all sensors were used simultaneously to gather potential breathing events such that an event-by-event analysis on both true and nonevents could be performed. Given the high level of agreement between nasal pressure and its square root with the pneumotachometer, we propose that these signals would be equally useful for detecting apneas/hypopneas from an unscored sleep study. Although the square root transformation improves the linearity of the nasal pressure signal (7), it is not essential for the purpose of clinical event recognition. Nasal pressure is well tolerated by patients, and with the advent of commercially available, portable, and more affordable pressure transducers, it has the potential to be used widely.
The authors thank Amalu Iyer, Michele Ostrowski, and John Laprairie for helping with patient recruitment and data collection. They are also grateful to Drs. W. A. Whitelaw and J. E. Remmers for their suggestions regarding the study. Finally, the authors thank Braebon Medical Corporation for donating two of the Ultima Airflow Pressure Sensors. Supported by grants from the Alberta Lung Association. The ambulatory pressure transducers were donated by Braebon Inc., ON, Canada. Received in original form May 18, 2001; accepted in final form April 9, 2002
This article has been cited by other articles:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||