About this item:

686 Views | 670 Downloads

Author Notes:

Corresponding author: John J Heine, john.heine@moffitt.org

MB is the primary author and was instrumental in designing and developing the work and performed data analyses.

EEF is the secondary author, performed data analyses, assisted in developing the differential evolution and statistical learning methods with MB and JJH.

TKO reviewed and edited the manuscript.

WHL provided expertise used in the statistical learning developments, reviewed and edited the manuscript.

WM was instrumental in the data collection protocol implementation, reviewed and edited the manuscript. ZC reviewed and edited the manuscript.

FRK reviewed and edited the manuscript.

SSR was the Principal Investigator for the protocol involving the tumor tissue and clinical data collection, reviewed and edited the manuscript.

JJH is the senior author and played an important role in the development of the statistical learning system and data analyses.

All authors have read and approved the final manuscript.

The authors thank Dr. Robert C. Hermann, Northwest Oncology Center, Marietta GA, for his efforts in the data collection for this project.

Drs. Owonikoko, Khuri, and Ramalingam are Distinguished Cancer Scholars of the Georgia Cancer Coalition.

The authors declare that they have no competing interests.

Subjects:

Statistical learning methods as a preprocessing step for survival analysis: evaluation of concept using lung cancer data

Tools:

Journal Title:

BioMedical Engineering OnLine

Volume:

Volume 10, Number 97

Publisher:

, Pages 1-15

Type of Work:

Article | Final Publisher PDF

Abstract:

Background Statistical learning (SL) techniques can address non-linear relationships and small datasets but do not provide an output that has an epidemiologic interpretation. Methods A small set of clinical variables (CVs) for stage-1 non-small cell lung cancer patients was used to evaluate an approach for using SL methods as a preprocessing step for survival analysis. A stochastic method of training a probabilistic neural network (PNN) was used with differential evolution (DE) optimization. Survival scores were derived stochastically by combining CVs with the PNN. Patients (n = 151) were dichotomized into favorable (n = 92) and unfavorable (n = 59) survival outcome groups. These PNN derived scores were used with logistic regression (LR) modeling to predict favorable survival outcome and were integrated into the survival analysis (i.e. Kaplan-Meier analysis and Cox regression). The hybrid modeling was compared with the respective modeling using raw CVs. The area under the receiver operating characteristic curve (Az) was used to compare model predictive capability. Odds ratios (ORs) and hazard ratios (HRs) were used to compare disease associations with 95% confidence intervals (CIs). Results The LR model with the best predictive capability gave Az = 0.703. While controlling for gender and tumor grade, the OR = 0.63 (CI: 0.43, 0.91) per standard deviation (SD) increase in age indicates increasing age confers unfavorable outcome. The hybrid LR model gave Az = 0.778 by combining age and tumor grade with the PNN and controlling for gender. The PNN score and age translate inversely with respect to risk. The OR = 0.27 (CI: 0.14, 0.53) per SD increase in PNN score indicates those patients with decreased score confer unfavorable outcome. The tumor grade adjusted hazard for patients above the median age compared with those below the median was HR = 1.78 (CI: 1.06, 3.02), whereas the hazard for those patients below the median PNN score compared to those above the median was HR = 4.0 (CI: 2.13, 7.14). Conclusion We have provided preliminary evidence showing that the SL preprocessing may provide benefits in comparison with accepted approaches. The work will require further evaluation with varying datasets to confirm these findings.

Copyright information:

© 2011 Behera et al; licensee BioMed Central Ltd.

This is an Open Access work distributed under the terms of the Creative Commons Attribution 2.0 Generic License (http://creativecommons.org/licenses/by/2.0/).

Creative Commons License

Export to EndNote