help button home button
AJRCCM
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by WANG, M.-L.
Right arrow Articles by PETSONK, E. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by WANG, M.-L.
Right arrow Articles by PETSONK, E. L.
Am. J. Respir. Crit. Care Med., Volume 162, Number 6, December 2000, 2134-2138

Design Strategies for Longitudinal Spirometry Studies
Study Duration and Measurement Frequency

MEI-LIN WANG, ERDOGAN GUNEL, and EDWARD L. PETSONK

Division of Respiratory Disease Studies, National Institute for Occupational Safety and Health (NIOSH), Morgantown; Section of Pulmonary and Critical Care Medicine, West Virginia University School of Medicine, Morgantown; and Department of Statistics, West Virginia University, Morgantown, West Virginia




    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
REFERENCES

Measuring the longitudinal change in FEV1 is useful for assessing the adverse effects of respiratory exposures and pulmonary diseases. Investigators seek to estimate the "true" mean FEV1 slope (µbeta ) of an infinite population. The difference between the estimated mean FEV1 slope (^µ beta ) and the true mean slope, resulting from biological variation and measurement errors, can be minimized by increasing the number of subjects (N), years of follow-up (D), or the frequency of measurements (P). We defined maximum error emax such that P[|<A><AC>μ</AC><AC>ˇ</AC></A>beta  - µbeta | =< emax] = 0.95, and thus emax is one-half the width of the 95% confidence interval for µbeta . We computed the values of emax on the basis of actual data obtained from 160 coal miners and working nonminers who had completed 11 spirometry measurements, using recommended equipment and procedures, at 6-mo intervals over 5 yr. Individual 5-yr FEV1 slopes (Delta FEV1) were calculated by linear regression. For a range of values of N, D, and P, tables are provided for emax, the magnitude of detectable differences in Delta FEV1 between two groups, and the recommended number of subjects needed in each of two groups to reliably detect the anticipated differences in Delta FEV1. The tables provide unique guidance for investigators in selecting among various study design options.



    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
REFERENCES

Longitudinal change in FEV1 over time (Delta FEV1 or slope ) is a valuable health outcome measure when assessing the adverse effects of exposures or diseases in individuals and groups. However, spirometry results are subject to many sources of measurement noise, making it difficult to detect the effects of interest (the signal) (1). Three sources of variation that affect the accuracy of estimation in FEV1 slope have been studied by previous investigators: variations arising in the measurement process, variations within an individual between occasions, and variability between individuals. With the relatively high variability associated with short-term measurements of Delta FEV1, the detection of important declines in FEV1 is difficult with short periods of observation (2). Up to 8 yr of follow-up has been recommended to appreciate with precision the rates of decline in FEV1, an intimidating prospect for most investigators (3). The absolute difference between estimated mean FEV1 slope (^µ beta ) and the "true" mean slope (µbeta ), expressed by the "error" term (e = | ^µ beta  - µbeta |), can be minimized by increasing the number of subjects (N), the years of follow-up (D), and/or the frequency of measurements (P) (4, 5).

From a longitudinal field spirometry survey, using equipment and procedures recommended by the American Thoracic Society (ATS) (6), we calculated the intra- and intersubject variability, under various available combinations of study duration and measurement frequency. We defined the term maximum error (emax) by setting the probability equal to 95% that the error (e) is less than or equal to emax. That is, P[|^µ beta  - µbeta | =< emax] = 0.95, and accordingly, emax is one-half the width of the 95% confidence interval for µbeta . Using the field test results, we present tables of the maximum error values (emax) for Delta FEV1, the anticipated accuracy in estimation of Delta FEV1, the magnitude of differences in Delta FEV1 between two groups that can be reliably detected, and finally the number of subjects needed in each of the two groups for detecting anticipated differences in Delta FEV1. These results should provide guidance for investigators in designing more efficient longitudinal spirometry studies.


    METHODS
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
REFERENCES

Subjects and Spirometry

The subject selection and the spirometry measurements have been detailed elsewhere (7). A cohort of 411 underground coal miners and working nonminers was established in our previous study. Spirometry testing was conducted with an 8-L survey spirometer with an attached microprocessor (Eagle II; Warren E. Collins, Braintree, MA). All testing was performed at the worksite, using the 1979 spirometry standards of the American Thoracic Society (6). Forced exhalation maneuvers were done in the standing position with a nose clip in place. Three to five maneuvers were obtained from each subject. Spirometry was repeated at 6-mo intervals and individual 5-yr FEV1 slopes were calculated by linear regression. In the present study, we used data obtained from 160 subjects who had completed 11 measurements in approximately 5 yr.

Estimating the Maximum Error Values

The absolute difference between the estimated FEV1 mean slope (^µ beta ) and the unknown "true" mean slope (µbeta ) gives the amount of error in the estimation. To evaluate the precision of different study design strategies for repeated measurements of spirometry, we computed the maximum error values by setting the probability equal to 95% that the error is less than or equal to emax (P[|^µ beta  - µbeta | =< emax] = 0.95), and thus emax is one-half the width of the 95% confidence interval (CI). Values for emax were determined for the various combinations of study duration and number of equally spaced spirometry tests that were available in the entire sample of N = 160 subjects.

Among adults, the linear model for individual lung function decline provides a good approximation to the actual shape of the decay curve over a period of 4 to 6 yr (8). Assuming a 5-yr linear relation among 11 measurements, we estimated the variation around the fitted regression line for each individual j (^sigma 2j) and the average variation around the regression line for all study subjects. We also estimated the FEV1 slope for individual j (^beta j), and obtained the average of ^beta j values, giving the average slope ( ^beta ) for N individuals; µbeta = ^beta is the unbiased estimator of µbeta (the mean of an infinite population of individuals).

For all 160 subjects, the individual spirometry tests were numbered 1 through 11, from the first to the last test during the 5 yr of follow-up. We used all combinations of data with equal spacing between tests and at least three measurements. Within the 5-yr study, there are 17 separate combinations of D and P (see Table 2), and a total of 72 possible combinations of equally spaced tests with at least 3 measurements, that is, 6 monthly, yearly, or at the beginning, middle, and end of the study. For example, there are five available combinations of tests for D = 3 yr and P = 3 tests, including tests 1-4-7, 2-5-8, 3-6-9, 4-7-10, and 5-8-11; similarly, there are three different combinations of tests available for D = 4 and P = 9; and so forth. We computed the maximum error values using within- and between-subject variation (^sigma 2 and ^sigma 2beta ) obtained from the full sample of N = 160 individuals, and the corresponding study duration (D) and the number of spirometry measurements (P) for all 72 available combinations of tests. We then grouped the combinations of tests into sets (with same D and P, but different test numbers); and report the average maximum error value emax for each set (see Table 2). For assessing the sample size effect, we computed the emax for sample sizes of N = 30, 50, and 100, using the values of ^sigma 2 and sigma 2beta obtained from the cohort of N = 160 individuals. All the calculations for parameters of ^beta j, sigma 2, ^sigma 2beta , and the maximum error values were conducted by using SAS (Cary, NC) software (9) (see Appendix for a detailed explanation of the formulas used).


                              
View this table:
[in this window]
[in a new window]
 

TABLE 2

WITHIN- AND BETWEEN-SUBJECT VARIATION FOR VARIOUS COMBINATIONS  OF D AND P, OBTAINED FROM N = 160 SUBJECTS

Estimating the Detectable Difference in Delta FEV1

Suppose we want to detect a difference of Delta  units between the average rates of change in two groups. Using Schlesselman's formula 14 (5), {^sigma 2beta  + 12(P - 1)^sigma 2/[D2P(P + 1)]}/N = Delta 2/[2(Zalpha  + Zbeta )2], we are able to estimate the magnitude of Delta  units under the following assumptions. Suppose we want to detect the difference at the significant level of alpha  = 0.05 (Zalpha = 1.64), using a one-sided test of significance, and beta  = 0.20 (80% power, Zbeta = 0.84). We calculated the anticipated detectable differences in Delta FEV1 by substituting the values of P and D, and the corresponding values of ^sigma 2 and ^sigma 2beta presented in Table 2, for a fixed sample size of N = 30, 50, 100, and 160 for each group, respectively. (For a two-sided test of significance, Zalpha should be replaced by Zalpha /2, and Zalpha /2 = 1.96.)

Estimating the Number of Subjects Needed

On the basis of the above formula (Schlesselman's formula 14 [5]), we calculated the anticipated number of subjects requested in each group to detect a difference in average Delta FEV1 between two groups for any fixed Delta  units. The sample size estimation was made by using the corresponding values of ^sigma 2 and ^sigma 2beta obtained from the full sample of N = 160 for various available values of D and P, at alpha  = 0.05 and 80% power, using a one-sided test of significance.


    RESULTS
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
REFERENCES

Table 1 gives the characteristics for the original study cohort as a whole by the number of spirometry tests completed. In the present analysis, we included 160 participants who had completed all 11 spirometry measurements. The average age at baseline was about 39 yr (ranged from 25 to 65 yr); 24% were current smokers and 44% were nonsmokers at the initial survey. The mean FEV1 at baseline was 4.18 L, and 3.85 L at the last survey. A greater proportion of miners than nonminers participated in all 11 spirometry tests (59 versus 42%, p < 0.001). Otherwise, there were no significant differences in the baseline characteristics of those who were included in the study group (N = 160) versus those who were excluded from the current analyses (N = 251).

We calculated the within- and between-subject variations (^sigma 2 and ^sigma 2beta ), and the maximum error values for all 72 available combinations of tests with study duration (D) varied from 1 to 5 yr and the number of measurements (P) from 3 to 11; and then grouped the results into 17 sets with the same D and P. Table 2 presents means of ^sigma 2 and ^sigma 2beta , by various available values of D and P, for N = 160. Table 3 summarizes the maximum error values for the 17 available sets of P and D, when N = 30, 50, 100, and 160 subjects. The results indicated that the longer the study durations and the larger the sample sizes, the smaller the maximum errors. However, within the same study durations, testing every 6 or 12 mo resulted in similar maximum error values. When designing longitudinal spirometry studies, several combinations of D, P, and N are available that can achieve a desired value of maximum error. For example, to design a study in which the error in the estimation of FEV1 slope is intended to be less than or equal to 11.7 ml/yr (i.e., the length of the CI = 23.4 ml/yr), the investigator could choose 16 options (as highlighted) from Table 3, including between N = 100, D = 5, P = 11 and N = 160, D = 3, P = 7. 


                              
View this table:
[in this window]
[in a new window]
 

TABLE 3

MAXIMUM ERROR VALUES FOR ESTIMATING FEV1 SLOPE* FOR FIXED SAMPLE SIZES OF N = 30, 50, 100, AND 160 BY 17 AVAILABLE SETS OF D AND P

Table 4 illustrates the size of the anticipated difference in average FEV1 slope between two groups to be detected at alpha  = 0.05 and 80% power level when sample sizes are N = 30, 50, 100, and 160 for each of the two groups. For example, if the anticipated difference in Delta FEV1 between two groups is 25-30 ml/yr, then the study design would have nine options (as highlighted) from Table 4, including between N = 30, D = 5, P = 11 and N = 160, D = 2.5, P = 6. Again, with a longer study duration and a larger sample size, a smaller difference can be detected. However, with the same study duration and sample size, as the testing frequency increases the changes in detectable Delta FEV1 are relatively small. For longer follow-up duration and larger sample size, annual measurements are unnecessarily frequent.


                              
View this table:
[in this window]
[in a new window]
 

TABLE 4

DIFFERENCES IN AVERAGE FEV1 SLOPE BETWEEN TWO GROUPS DETECTED AT alpha  = 0.05 AND 80% POWER BY VARIOUS COMBINATIONS OF D AND P, FOR A FIXED SAMPLE SIZE OF N = 30, 50, 100, AND 160 IN EACH OF TWO GROUPS

Table 5 provides estimates of the number of subjects needed in each of two groups to detect a fixed difference in Delta FEV1 between the groups at a significance level of 0.05 with 80% power. For example, a study is planned for lung function in two groups of employees: those exposed to a chemical dust and those not exposed, in which FEV1 will be measured every 6 mo for 3 yr. If the exposure is estimated to result in a mean difference in FEV1 slope of at least 30 ml/yr between the exposed group and the unexposed group, Table 5 suggests that, to detect this effect, the sample size should be about 76 subjects in each group, as highlighted.


                              
View this table:
[in this window]
[in a new window]
 

TABLE 5

ESTIMATED NUMBER OF SUBJECTS NEEDED IN EACH OF TWO GROUPS TO DETECT A DIFFERENCE BETWEEN GROUPS IN Delta FEV1 AT alpha  = 0.05 AND 80% POWER, USING A ONE-SIDED TEST OF SIGNIFICANCE*


    DISCUSSION
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
REFERENCES

The rate of change in spirometric lung function is widely used as an outcome variable in investigating the possible hazard of specific exposures. The usefulness of FEV1 measurements in assessing the adverse effects of environmental exposures or disease processes is critically dependent on the intrinsic variability in the measurement. Many sources of variation can be controlled by assuring adequate training of personnel administering the tests, by selecting equipment that has been tested by accepted procedures, and by adhering to recommended test methodologies and procedures (10, 11).

After these sources of measurement variation have been controlled, the "error" term in estimating group mean FEV1 slopes can be minimized by increasing the number of subjects (N), the duration of follow-up (D), and/or the frequency of measurements (P). Because longitudinal studies are labor intensive and time consuming, investigators aim to select efficient design strategies that can reliably detect the anticipated effects.

Statistical techniques have previously been suggested in planning and in evaluating longitudinal study design choices, such as frequency of measurement and study duration. In 1974, Berry estimated the standard deviation of the FEV1 between occasions (within-subject variation) to be 0.12 L, based on several previous studies. By substituting this single value of 0.12 L as the between-occasion standard deviation into the formula given by Schlesselman (5) [Eq. (6)], Berry produced a table of the standard errors (L/yr) of the FEV1 slope for study durations ranging from 1 to 10 yr and intervals between tests of 1, 3, 6, and 12 mo (4). The results provided information on trends for precision in the estimation of Delta FEV1, and suggested that for shorter follow-up periods, the estimation error predominates, whereas for longer periods of follow-up it is negligible.

Our study indicates similar conclusions: that longer study durations and larger sample sizes give smaller maximum errors; whereas within the same study durations, testing every 6 or 12 mo results in similar emax values. However, in the current study, based on actual longitudinal spirometry tests performed with ATS-recommended equipment and procedures, we were able to calculate the within-subject and between-subject variation, ^sigma 2 and ^sigma 2beta , corresponding to a range of specific combinations of study duration and measurement frequency. Tables are reported for the maximum error (emax), a single indicator for assessing the precision of estimation for FEV1 slope. Emax values integrate the effects of D, P, N, and inter- and intrasubject variability, and offer unique guidance for investigators in selecting among various study designs. Tables are also presented for predicting the magnitude of differences in Delta FEV1 between two groups that can be reliably detected, and for estimating the number of subjects needed in each of two groups for reliably detecting anticipated differences in Delta FEV1.

This study and the study by Berry (4) provide tables recommending the number of subjects required in each of two groups for a longitudinal study of FEV1; however, the results are quite different. For example, Table VI in Berry (4) indicates that a 2-yr study with three test points would require 153 subjects in each of two groups to have sufficient power to detect a 30-ml/yr difference, whereas our study predicts the need for 229 subjects, a 50% increase. For a 5-yr study, our results suggest the need for 27 subjects, whereas Berry indicated 37 were needed. In contrast, for a 4-yr study, results from this study and those of Berry are similar (requiring 42 versus 45 subjects for 6-monthly measurements, and 54 versus 53 subjects for yearly testing). One of the reasons for the differences might be related to the values of within- and between-subject variation being used in the estimation. Berry used single values of ^sigma and ^ sigma  beta  for different combinations of study duration and measurement frequency, rather than values from actual test results, as in the current study. Schlesselman commented that actual longitudinal data were needed to use these statistical techniques, and avoids speculation about the size of ^sigma 2beta (5).

Certain cautions should be observed in using the tables to evaluate various study designs, because results from studies that use different methods or populations may not be fully comparable. First, the linear function for individual lung function decline provides a good approximation to the actual shape of the FEV1 decay curve for periods up to 4 to 6 yr. The adequacy of this approximation will decrease as the length of follow-up increases, but should still remain good for adults under 60 yr of age (8). For longer studies, basing statistical analyses solely on linear trends should be satisfactory as a first approximation, but may be insufficient for a more refined analysis, in which a second degree polynomial or other, more complicated curves may be appropriate. Second, an increase in within-subject variability in FEV1 has been observed among patients, compared with the general population (12). This might have implications for longitudinal study errors, particularly among current and former smokers, which represented about 55% of our study population. Among a currently working population, some investigators have found that smoking status has little effect on the standard deviation of FEV1 slopes (13). However, the applicability of our results to studies of populations with a high proportion of respiratory illness is not clear. Finally, in our study, testing was done throughout the 5-yr period according to ATS recommendations, by trained and motivated technicians, with ongoing quality assurance, and using an accurate volume spirometer. The results will be most suitable for studies using similar methods.

In summary, the results of this investigation provide guidance to researchers in selecting for longitudinal field spirometry studies a practical and efficient design that will result in acceptable values of maximum error and adequate power to detect anticipated differences in FEV1 slopes between two groups. The tables should be most useful for studies involving other middle-aged (25 to 65 yr), white, male working populations, with a similar proportion of smokers. It is anticipated that data from other longitudinal spirometry studies will be analyzed in a similar fashion, in order to provide data relevant to other populations.


                              
View this table:
[in this window]
[in a new window]
 

TABLE 1

CHARACTERISTICS AT INITIAL AND FINAL SURVEY FOR THE STUDY POPULATION AND BY NUMBER OF SPIROMETRY TESTS COMPLETED*


    Footnotes

Correspondence and requests for reprints should be addressed to Mei-Lin Wang, MD, NIOSH, 1095 Willowdale Road, Mail Stop H-2800, Morgantown, WV 26505. E-mail: mlw4{at}cdc.gov

(Received in original form March 31, 2000 and in revised form July 24, 2000).

Acknowledgments: The authors are grateful for the thoughtful comments on the manuscript provided by Drs. Paul Enright and Duane L. Sherrill (University of Arizona, Tucson, AZ), and by Drs. Dan S. Sharp and Eva Hnizdo, and by Ms. Patricia Schleiff (National Institute for Occupational Safety and Health).

Supported by the National Institute for Occupational Safety and Health.


    References
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
REFERENCES

1. Buist AS, Vollmer WM. The use of lung function tests in identifying factors that affect lung growth and aging. Stat Med 1988; 7: 11-18 [Medline].

2. Hankingson J, Wagner GR. Medical screening using periodic spirometry for detection of chronic lung disease. Occup Med State Art Rev 1993; 8: 353-361 .

3. Clement J, Van De Woestijne KP. Rapidly decreasing forced expiratory volume in one second or vital capacity and development of chronic airflow obstruction. Am Rev Respir Dis 1982; 125: 553-558 [Medline].

4. Berry G. Longitudinal observations: their usefulness and limitations with special reference to the forced expiratory volume. Bull Physio-path Respir 1974; 10: 643-655 .

5. Schlesselman JJ. Planning a longitudinal study: II. Frequency of measurement and study duration. J Chron Dis 1973; 26: 561-570 [Medline].

6. American Thoracic Society. Standardization of spirometry. Am Rev Respir Dis 1979;119:4-11.

7. Hodgins P, Henneberger PK, Wang M-L, Petsonk EL. Bronchial responsiveness and five-year FEV1 decline: a study in miners and nonminers. Am J Respir Crit Care Med 1998; 157: 1390-1396 [Abstract/Free Full Text].

8. Vollmer WM, Johnson LR, McCamant LE, Buist AS. Methodologic issues in the analysis of lung function data. J Chron Dis 1987; 40: 1013-1023 [Medline].

9. SAS Institute. SAS/STAT user's guide: version 6, 3rd ed. Cary, NC: SAS Institute, Inc.; 1990. p. 818-875.

10. American Thoracic Society. Standardization of spirometry: 1987 update. ATS statement. Am Rev Respir Dis 1987;136:1285-1298.

11. American Thoracic Society. ATS statement: standardization of spirometry---1994 update. Am J Respir Crit Care Med 1995;152:1107-1136.

12. Pennock BE, Rogers RM, McCaffree DR. Changes in measured spirometric indices: what is significant? Chest 1981; 80: 97 [Free Full Text].

13. Wang ML, McCabe L, Petsonk EL, Hankinson JL, Banks DE. Weight gain and longitudinal changes in lung function in steel workers. Chest 1997; 111: 1526-1532 [Abstract/Free Full Text].
    APPENDIX

Suppose the distribution of slope corresponding to an infinite population of individuals is modeled by a probability density function f(beta ). The mean and variance of this distribution, respectively, are
<AR><R><C>μ<SUB>β</SUB>=<LIM><OP>∫</OP></LIM>β <IT>f</IT>(β) <IT>d</IT>β</C></R><R><C>σ<SUB>β</SUB><SUP>2</SUP>=<LIM><OP>∫</OP></LIM>(β−μ<SUB>β</SUB>)<SUP>2</SUP> <IT>f</IT>(β) <IT>d</IT>β</C></R></AR>

The unbiased estimator of µbeta is mean of the estimated slopes for the sample of N individuals. For a given group of N individuals, the individual slopes beta 1,beta 2, . . . ,beta N can be considered as a random sample of size N from the population of beta  values. Let ^beta j denote the least-squares estimate of beta j, the true slope of the individual j. The estimate of µbeta is
<AR><R><C><A><AC><A><AC>β</AC><AC>ˆ</AC></A></AC><AC>¯</AC></A></C></R><R><C><A><AC>β</AC><AC>ˆ</AC></A></C></R></AR>=<FR><NU><LIM><OP>∑</OP><LL>j=1</LL><UL>N</UL></LIM><A><AC>β</AC><AC>ˆ</AC></A><SUB>j</SUB></NU><DE>N</DE></FR>

and the estimate of the variance sigma beta 2 is
<A><AC>σ</AC><AC>ˆ</AC></A><SUB>β</SUB><SUP>2</SUP>=<FR><NU><LIM><OP>∑</OP><LL>j=1</LL><UL>N</UL></LIM>(β<SUB>j</SUB>−<AR><R><C><A><AC><A><AC>β</AC><AC>ˆ</AC></A></AC><AC>¯</AC></A></C></R><R><C><A><AC>β</AC><AC>ˆ</AC></A>)</C></R></AR><SUP>2</SUP></NU><DE>N</DE></FR>

The mean of ^beta (the expected value of ^beta ) is
<AR><R><C>E<A><AC><A><AC>β</AC><AC>ˆ</AC></A></AC><AC>¯</AC></A><A><AC>[β]</AC><AC>ˆ</AC></A>=<FR><NU>1</NU><DE>N</DE></FR><LIM><OP>∑</OP><LL>j=1</LL><UL>N</UL></LIM>E(<A><AC>β</AC><AC>ˆ</AC></A><SUB>j</SUB>)=<FR><NU>1</NU><DE>N</DE></FR><LIM><OP>∑</OP><LL>j=1</LL><UL>N</UL></LIM>E[E(<A><AC>β</AC><AC>ˆ</AC></A><SUB>j</SUB>‖  β<SUB>j</SUB>)]</C></R><R><C>=<FR><NU>1</NU><DE>N</DE></FR><LIM><OP>∑</OP><LL>j=1</LL><UL>N</UL></LIM>E[β<SUB>j</SUB>]=<FR><NU>1</NU><DE>N</DE></FR><LIM><OP>∑</OP><LL>j=1</LL><UL>N</UL></LIM>μ<SUB>β<SUB>j</SUB></SUB>=μ<SUB>β</SUB></C></R></AR>

where E denotes expectation. That is, ^beta is an unbiased estimator of µbeta .

Similarly, the variance of ^beta is V[ ^beta ] = (1/N2) <LIM><OP>∑</OP><LL>j=1</LL><UL>N</UL></LIM>V(<A><AC>β</AC><AC>ˆ</AC></A><SUB>j</SUB>) .

Now,
V[<A><AC>β</AC><AC>ˆ</AC></A><SUB>j</SUB>]=E[V(<A><AC>β</AC><AC>ˆ</AC></A><SUB>j</SUB>‖  β<SUB>j</SUB>)]+V[E(<A><AC>β</AC><AC>ˆ</AC></A><SUB>j</SUB>‖  β<SUB>j</SUB>)]=E<FENCE><FR><NU>σ<SUP>2</SUP></NU><DE>S<SUB>tt</SUB></DE></FR></FENCE>+V[β<SUB>j</SUB>]=<FR><NU>σ<SUP>2</SUP></NU><DE>S<SUB>tt</SUB></DE></FR>+σ<SUB>β</SUB><SUP>2</SUP>

where
<AR><R><C>S<SUB>tt</SUB>=<LIM><OP>∑</OP><LL>j=1</LL><UL>P</UL></LIM>(t<SUB>i</SUB>−<A><AC><A><AC>β</AC><AC>ˆ</AC></A></AC><AC>¯</AC></A><SUP>t)</SUP>=<FR><NU>D<SUP>2</SUP>P(P+1)</NU><DE>12(P−1)</DE></FR></C></R><R><C><A><AC><A><AC>β</AC><AC>ˆ</AC></A></AC><AC>¯</AC></A>=t=<FR><NU><LIM><OP>∑</OP><LL>i=1</LL><UL>P</UL></LIM>t<SUB>i</SUB></NU><DE>P</DE></FR></C></R></AR>

in where t1, . . . ,tp are the times at which the tests were performed, and sigma 2 is the variation about the regression line for the jth individual, and it is assumed to be same for all individuals. Thus,
V<FENCE><AR><R><C><A><AC><A><AC>β</AC><AC>ˆ</AC></A></AC><AC>¯</AC></A></C></R><R><C><A><AC>β</AC><AC>ˆ</AC></A></C></R></AR></FENCE>=<FR><NU>1</NU><DE>N</DE></FR><FENCE>σ<SUB>β</SUB><SUP>2</SUP>+<FR><NU>σ<SUP>2</SUP></NU><DE>S<SUB>tt</SUB></DE></FR></FENCE>

Then, using formula 12 of Schlesselman (5), the estimated standard error of the sample mean slope of FEV1 is
SE<AR><R><C><A><AC><A><AC>β</AC><AC>ˆ</AC></A></AC><AC>¯</AC></A></C></R><R><C><A><AC>(β)</AC><AC>ˆ</AC></A></C></R></AR>=<FR><NU><FENCE><A><AC>σ</AC><AC>ˆ</AC></A><SUB>β</SUB><SUP>2</SUP>+<FR><NU><A><AC>σ</AC><AC>ˆ</AC></A><SUP>2</SUP>12(P−1)</NU><DE>D<SUP>2</SUP>P(P+1)</DE></FR></FENCE><SUP>1/2</SUP></NU><DE>(N)<SUP>1/2</SUP></DE></FR>

where
<A><AC>σ</AC><AC>ˆ</AC></A><SUP>2</SUP>=<FENCE><LIM><OP>∑</OP><LL><AR><R><C>j=1</C></R><R><C></C></R></AR></LL><UL>N</UL></LIM><A><AC>σ</AC><AC>ˆ</AC></A><SUB>j</SUB><SUP>2</SUP></FENCE>/N,

in which ^sigma 2j is the mean square error (MSE) of the simple line ar regression model for individual j.

For large N (N >=  30),
<FR><NU><AR><R><C><A><AC><A><AC>β</AC><AC>ˆ</AC></A></AC><AC>¯</AC></A></C></R><R><C><A><AC>β</AC><AC>ˆ</AC></A></C></R></AR>−μ<SUB>β</SUB></NU><DE>SE<AR><R><C><A><AC><A><AC>β</AC><AC>ˆ</AC></A></AC><AC>¯</AC></A></C></R><R><C><A><AC>(β)</AC><AC>ˆ</AC></A></C></R></AR></DE></FR>

is approximately normally distributed with mean zero and variance one. Then,
P[|<AR><R><C><A><AC><A><AC>β</AC><AC>ˆ</AC></A></AC><AC>¯</AC></A></C></R><R><C><A><AC>β</AC><AC>ˆ</AC></A></C></R></AR>−μ<SUB>β</SUB>|≤Z<SUB>0.025</SUB>SE<AR><R><C><A><AC><A><AC>β</AC><AC>ˆ</AC></A></AC><AC>¯</AC></A></C></R><R><C>(<A><AC>β</AC><AC>ˆ</AC></A>)]</C></R></AR>=0.95

That is, if we estimate µbeta with ^beta then the maximum error in our estimate is
e<SUB>max</SUB>=Z<SUB>0.025</SUB>SE<AR><R><C><A><AC><A><AC>β</AC><AC>ˆ</AC></A></AC><AC>¯</AC></A></C></R><R><C><A><AC>(β)</AC><AC>ˆ</AC></A></C></R></AR>

The maximum error value is exactly half the width of the confidence interval for µbeta .





This article has been cited by other articles:


Home page
ChestHome page
M. L. Wang, B. H. Avashia, and E. L. Petsonk
Interpreting Periodic Lung Function Tests in Individuals: The Relationship Between 1- to 5-Year and Long-term FEV1 Changes.
Chest, August 1, 2006; 130(2): 493 - 499.
[Abstract] [Full Text] [PDF]


Home page
Occup. Environ. Med.Home page
E Hnizdo, L Yu, L Freyder, M Attfield, J Lefante, and H W Glindmeyer
The precision of longitudinal lung function measurements: monitoring and interpretation
Occup. Environ. Med., October 1, 2005; 62(10): 695 - 701.
[Abstract] [Full Text] [PDF]


Home page
Eur Respir JHome page
A. J. Mehta, P. K. Henneberger, K. Toren, and A-C. Olin
Airflow limitation and changes in pulmonary function among bleachery workers
Eur. Respir. J., July 1, 2005; 26(1): 133 - 139.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Respir. Crit. Care Med.Home page
M. J. TOBIN
Sleep-disordered Breathing, Control of Breathing, Respiratory Muscles, Pulmonary Function Testing, Nitric Oxide, and Bronchoscopy in AJRCCM 2000
Am. J. Respir. Crit. Care Med., October 15, 2001; 164(8): 1362 - 1375.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by WANG, M.-L.
Right arrow Articles by PETSONK, E. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by WANG, M.-L.
Right arrow Articles by PETSONK, E. L.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Proc. Am. Thorac. Soc. Am. J. Respir. Cell Mol. Biol.
Copyright © 2000 American Thoracic Society