Published ahead of print on September 22, 2005, doi:10.1164/rccm.200502-274OC
© 2005 American Thoracic Society doi: 10.1164/rccm.200502-274OC
Proteomic Patterns of Preinvasive Bronchial LesionsDivision of Allergy, Pulmonary, and Critical Care Medicine, Department of Medicine; Departments of Biostatistics and Pathology; Mass Spectrometry Research Center; Division of Hematology and Oncology, Departments of Surgery and Cancer Biology, Vanderbilt-Ingram Comprehensive Cancer Center, Vanderbilt University School of Medicine; Veterans Affairs Medical Center, Nashville, Tennessee; and Department of Medicine and Pathology, University of Colorado Health Science Center, Denver, Colorado Correspondence and requests for reprints should be addressed to Pierre P. Massion, M.D., Division of Allergy, Pulmonary and Critical Care Medicine, Vanderbilt-Ingram Comprehensive Cancer Center, 2220 Pierce Avenue, 640 Preston Research Building, Nashville, TN 37232-6838. E-mail: pierre.massion{at}vanderbilt.edu
Purpose: A proteomics approach is warranted to further elucidate the molecular steps involved in lung tumor development. We asked whether we could classify preinvasive lesions of airway epithelium according to their proteomic profile. Experimental Design: We obtained matrix-assisted laser desorption/ionization time-of-flight mass spectrometry profiles from 10-µm sections of fresh-frozen tissue samples: 25 normal lung, 29 normal bronchial epithelium, and 20 preinvasive and 36 invasive lung tumor tissue samples from 53 patients. Proteomic profiles were calibrated, binned, and normalized before analysis. We performed class comparison, class prediction, and supervised hierarchic cluster analysis. We tested a set of discriminatory features obtained in a previously published dataset to classify this independent set of normal, preinvasive, and invasive lung tissues. Results: We found a specific proteomic profile that allows an overall predictive accuracy of over 90% of normal, preinvasive, and invasive lung tissues. The proteomic profiles of these tissues were distinct from each other within a disease continuum. We trained our prediction model in a previously published dataset and tested it in a new blinded test set to reach an overall 74% accuracy in classifying tumors from normal tissues. Conclusions: We found specific patterns of protein expression of the airway epithelium that accurately classify bronchial and alveolar tissue with normal histology from preinvasive bronchial lesions and from invasive lung cancer. Although further study is needed to validate this approach and to identify biomarkers of tumor development, this is a first step toward a new proteomic characterization of the human model of lung cancer tumorigenesis.
Key Words: early detection mass spectrometry preneoplasia profiling The discovery of preinvasive lesions in the high-risk population places patients at increased risk of developing lung cancer (13). Although the natural history of these preinvasive lesions is poorly understood, increasing evidence suggests that, in the absence of treatment, high-grade preinvasive lesions, most commonly found in patients with a prior history or with concomitant cancer, will develop into invasive carcinoma in 30 to 50% of the cases, whereas the vast majority of low-grade preinvasive lesions remain stable or regress on follow-up (47). In addition, nonstepwise progression of preinvasive lesions makes prediction of lung cancer progression based on histologic grade unreliable. Therefore, the need for molecular biomarkers predictive of tumor progression is becoming more evident (810). In a recent report, we used conventional matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI MS) to generate protein profiles that accurately classify and predict histologic subgroups of lung cancer (11). Using biostatistical methods to select differentially expressed peaks (MS signals) and after the development of a class prediction model (12), 82 discriminatory signals were found to classify normal lung from lung cancer tissue samples in a derivation and validation study design with excellent accuracy. Other recent studies indicated the importance of using protein expression profiles as a diagnostic or prognostic biomarker for patients with early-stage lung cancer using two-dimensional polyacrylamide gel electrophoresis analysis (13, 14), protein arrays (15), or surface-enhanced laser desorption/ionization MS (16). Although new strategies are needed for the early detection of lung cancer, clinical proteomics applied directly to the airway epithelium may provide a novel approach for the selection of biomarkers. Our primary goal is to establish an MALDI MSbased proteomics approach to obtain specific patterns of protein expression of the airway epithelium at different stages of tumor development. We hypothesize that airway epithelium (alveolar and bronchial) modify its proteomic expression profile during tumor development. In this first report, we demonstrated that proteomic profiles obtained by MALDI MS allow the classification of normal alveolar or normal bronchial epithelium, preinvasive lesions, and invasive lung tumors and may provide new insights to the biology of these preinvasive lesions of unclear clinical significance. Some of the results of these studies have been previously reported in abstract form (17).
Patients and Tissue Samples A total of 53 patients from Vanderbilt University Medical Center (VUMC) participated in this study between January 2002 and November 2003. A total of 110 samples were obtained from bronchoscopic or surgical procedures. Eighteen of 29 normal bronchial epithelium samples and 18 of 20 preinvasive tissues were obtained at the time of bronchoscopy, and all 25 normal alveolar and all 36 lung tumor tissues were obtained at the time of surgery. Normal alveolar specimens were obtained from at least 2 cm away from the tumor. Endobronchial biopsies were obtained by white-light bronchoscopy and laser-induced fluorescence endoscopy from VUMC and University of Colorado Health Science Center (UCHSC) between May 2002 and December 2003. Biopsies were obtained from patients with lung cancer or at risk of lung cancer from predetermined as well as suspicious sites. Of the 10 patients without concomitant lung cancer, five were recruited from the UCHSC lung cancer cohort (with sputum atypia), one patient was a donor of bronchial epithelium as a motor vehicle accident victim, and four patients were recruited from VUMC on the basis of clinical suspicion of lung cancer. Squamous metaplasia and mild dysplasia were grouped as low-grade preinvasive lesion, whereas moderate dysplasia, severe dysplasia, and carcinoma in situ were grouped as high-grade preinvasive lesion. Bronchial epithelial lesions were graded according to the World Health Organization nomenclature (18). The project was approved by the local institutional review board, and informed consent was obtained for all individuals.
Preparation of Tissue Samples
Proteomic Analysis
Alignment, Binning, and Normalization of the MS Profiles Alignment of MALDI MS peaks across multiple spectra was accomplished by use of a computing algorithm that simultaneously determines an optimal set of " bins" for categorizing each peak as a specific protein (24). Optimal bins are identified as those that maximize the number of single peaks within each bin across the spectra while minimizing the number of bins that have multiple peaks within a spectrum. We have demonstrated that this is an effective approach in a recent proteomics study of lung cancer (11). Spectra were normalized to each other according to the maximum sum of total ion current before binning. The maximum sum was divided by the sum of each spectrum to obtain the normalization factor for each spectrum. This factor was then used to multiply each data point for each spectrum.
Statistical Analysis
Study Population To obtain specific patterns of protein expression of the airway epithelium at different stages of tumor development, we obtained MALDI MS profiles on 110 tissue samples from a total of 53 individuals, 43 of whom were patients with concomitant lung cancer. The patients' characteristics are presented in Table 1. The nature of the samples obtained in patients with specific tumor histology is provided in Table 2. Because of the heterogeneous nature of lung cancer and because the nature of the cell type of origin of these tumors remains a subject of controversy, both alveolar epithelium and bronchial epithelium were included as controls for lung tumors examined. Some patients provided more than one type of tissue sample as shown in Table 3. These samples included 25 histologically normal lung (alveolar space), 29 histologically normal bronchial epithelium, 20 preinvasive tumor tissue samples (13 low-grade and 7 high-grade), and 36 invasive tumor tissue samples.
Preinvasive Lesions Differ from Normal Airway Epithelium and from Invasive Lung Cancer For each histologic subgroup, we obtained a proteomic profile that distinguishes from other lesions with predictive accuracy between 83 and 100%. Results are summarized in Table 4. Examples of such profiles are provided in Figure 1. Classification and misclassification rates were calculated by use of leave-one-out cross-validation class prediction method based on our covariate method of analysis. To avoid the possibility of overfitting, we reported only one set of "winner" features (based on the predetermination cut-off of the p values), which was applied to the prediction model. Normal bronchial epithelium proteomic profiles clearly differ from those of invasive cancer and all other classes. Preinvasive lesions provided different proteomic patterns from normal bronchial epithelium or invasive cancer. Importantly, and despite our relatively small sample size, we found patterns characteristic of either low- or high-grade lesions based on the different expression of 14 features (Figure 2A). We performed the same comparisons using agglomerative hierarchic cluster analysis with very similar results. For example, hierarchic cluster analysis of high-grade preinvasive lesion versus invasive tumor is shown in Figure 2B. The potential issue of multiple comparisons in high-dimensional data analysis is adjusted by the FDR (see STATISTICAL ANALYSIS).
Distinguishing Normal from Preinvasive Lesions from Cancer within a Continuum In an effort to distinguish proteomic profiles of normal, preinvasive, and invasive tumors in a continuum, we analyzed the profiles obtained from normal bronchial epithelium, preinvasive bronchial epithelium, and squamous carcinomas of the lung. Results of supervised clustering analysis (Figure 3A) and of multidimensional scaling analysis of these three groups (Figure 3B) show distinct clustering of the groups in a continuum. Biomarkers found in preinvasive and in invasive lesions but not in normal airway epithelium of subjects with or at risk for lung cancer would be of particular interest in the development of markers for early detection. We searched for such features in our dataset and, although some specific features are completely absent from normal bronchial tissues, they were found in 0 to 15% of low-grade preinvasive lesions, in 14 to 43% of high-grade lesions, and in 6 to 24% of the squamous invasive tumor tissues; none of these candidate biomarkers reached statistical significance set at p < 0.0005.
Proteomic Profiles of Preinvasive Lesions Resemble Those of Normal Epithelium After training our prediction model to detect features that discriminate between normal bronchial epithelium and invasive tumor, we asked how the model would classify low- and high-grade lesions. Proteomic profiles obtained from 12 of 13 low-grade and all high-grade preinvasive lesions were classified by 54 discriminatory features as normal airway epithelium.
Validation of Prediction Model in Normal and Invasive Lung Cancer
We report for the first time proteomic expression profiles specific to different stages of lung tumor development. We detected MALDI MS signals that were discriminatory and predictive of alveolar and bronchial epithelium, low- and high-grade preinvasive lesions, and invasive lung tumors with overall 90% accuracy. When proteomic profiles of preinvasive lesions were tested using patterns that distinguish between normal airway and invasive tumors obtained in a training set, the majority of preinvasive lesions were classified as normal-appearing epithelium. We also demonstrated in a training and test paradigm that a set of discriminatory features obtained in an independent, previously published set of normal and tumor samples correctly classified 74% of our samples as tumor or normal epithelium. Although the risk of developing lung cancer increases with the presence of preinvasive lesions, the molecular determinants predicting the irreversible progression to lung cancer have not been identified. Previous studies have addressed the genetic basis for classification of preinvasive lesions in a model of tumor development (1). Yet, none of these biomarkers are predictive of progression to invasive cancer. The proteomic analysis of tumor development might provide a new understanding of the pathologic states a given tumor may undergo before acquiring its invasive phenotype. Our previous study, which defined profiles that allow classification of lung tumors from normal lung and from tumors associated with lymph node involvement (11), clearly showed the strength of tissue-based proteomics as an analytic tool. This study provides evidence for a specific phenotype of the airway epithelium as it progresses from normal to preinvasive and to invasive cancer. The findings of the supervised cluster and MDS analyses, although they do not prove true progression, suggest a continuum between proteomic patterns of lesions as they progress toward invasive lesions. Yet, we did not find significant MALDI MS features robust enough to pass our conservative statistical approach that would be present in preinvasive and invasive lesions but not in normal bronchial epithelium. Several considerations may explain this lack of specific markers of tumor development. The study design (cross-sectional as opposed to longitudinal), the sample size, and the analysis of only a fraction of the proteome offered by the MALDI MS approach need to be taken into consideration. Biological variability may also prevent us from finding markers of tumor development because only a small percentage of preinvasive lesions are known to transform into cancer (7). The samples examined may not have harbored these features. A prospective study following the natural history of a large number of these lesions may answer this question. This report is our first step toward the selection of proteomic profiles predictive of lung cancer development. We found that this tissue-based proteomics approach allows us not only to recapitulate the classification of preinvasive lesions on the basis of histologic grade but to provide biological information about these lesions. Indeed, when testing proteomic predictors of lung cancer versus normal tissue to classify profiles from preinvasive lesions (as being closer to normal or cancer), both low- and high-grade lesions clustered with normal epithelium. These findings suggest that only a subset of preinvasive lesions (yet to be identified in a prospective study) may in fact progress to an invasive phenotype, and histology by itself may not be the ideal surrogate marker of tumor progression. This observation is in agreement with higher regression and low progression of preinvasive lesions shown in earlier studies (7, 31, 32). With the current and independent set of tissues, we were able to reproduce similar results to those obtained in our previous report (11). Despite the inclusion of normal bronchial samples in the test set that were not represented in the training set (i.e., those of a separate operator [S.M.J.R.]) and of differences in the data processing (see METHODS), we were able to correctly classify 74% of the samples as normal epithelium or as invasive tumor. These results further validate our proteomic strategy. In our previous report (12), only alveolar tissues were representative of the control group. In the current study, because 18 of 20 preinvasive tissues were obtained from bronchial biopsies, we included a series of bronchial tissue as controls in addition to alveolar tissue. The difference between profiles of normal alveolar epithelium and bronchial epithelium is interesting and important for future research related to cellular origin of squamous carcinoma and adenocarcinoma. Limitations to this study include considerations about patient selection, methodology, sample size, and protein identification. First, most of our preinvasive lesions and our normal-appearing epithelium were obtained from patients with concomitant lung cancer. The natural history of these preinvasive lesions may be different from those of patients without concomitant lung cancer. Also, we lack a true biological "normal" epithelium for reference because the majority of our patients were known to have lung cancer and were current or ex-smokers. Second, technical considerations may have affected the outcome of the analysis. The tissues analyzed were obtained from different sources (surgically resected specimen vs. endobronchial biopsies) and processed slightly differently (without or with OCT embedding). Although OCT medium prevents optimal ionization of proteins (33), the necessary removal of water-soluble OCT could introduce a difference in methodology, yet across all histologic subtypes and therefore unlikely to affect the analysis. In addition, the resolution of the MS, daily variation of the vacuum, and laser intensity of the instrument itself and its overall performance are variables that have been addressed before (11). Third, a relatively small sample size of preinvasive lesions requires confirmation in a larger dataset. Nevertheless, our data strongly illustrate the strength of proteomic profiles to discriminate pathologic states along the various steps of tumor development. And fourth, we report the predictive value of protein profiles and not proteins or peptides. The identification of the proteins or peptides making these specific signatures, though critical, is beyond the scope of this study because we first need to validate them in an independent test set. Ultimately, selection and identification of proteins predictive of what preinvasive lesion is likely to develop to lung cancer is a critical goal of future investigation. In conclusion, the original selection of these discriminatory profiles by MALDI MS is a critical first step in the identification of proteomic expression patterns along lung cancer development. Ultimately, this MALDI MSbased approach in tissue may provide characterization of the molecular determinants leading preinvasive lesions to an invasive phenotype.
The authors thank the individuals who provided their informed consent and participated in the study. They thank Lynne Fenner, Blake Mann, Darienne Adkins, Heather Templeton, Candace Murphy, and Anthony Frazier for their assistance in consenting individuals, in obtaining clinical data elements and biological samples. They also thank Hans Rudolf Aerni and Lisa Manier for their expertise in proteomic analysis.
Supported by the Damon Runyon Cancer Research Foundation (P.P.M. is a Damon Runyon-Lilly Clinical Investigator, CI 19-03). Also supported in part by the Lung SPORE P50 CA 90949 to D.P.C. and by the Flight Attendant Medical Research Institute to P.P.M. Originally Published in Press as DOI: 10.1164/rccm.200502-274OC on September 22, 2005 Conflict of Interest Statement: None of the authors have a financial relationship with a commercial entity that has an interest in the subject of this manuscript. Received in original form February 19, 2005; accepted in final form September 20, 2005
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||