Volume 42, Issue 5 pp. 897-903
Research Article

Assessment of robustness and transferability of classification models built for cancer diagnostics using Raman spectroscopy

Martina Sattlecker

Martina Sattlecker

Cranfield University, College Road, Cranfield, Bedfordshire, MK43 0AL, UK

Search for more papers by this author
Nick Stone

Nick Stone

Biophotonics Research Group, Gloucestershire Royal Hospital, Great Western Road, Gloucester, GL1 3NN, UK

Search for more papers by this author
Jennifer Smith

Jennifer Smith

Biophotonics Research Group, Gloucestershire Royal Hospital, Great Western Road, Gloucester, GL1 3NN, UK

Search for more papers by this author
Conrad Bessant

Corresponding Author

Conrad Bessant

Cranfield University, College Road, Cranfield, Bedfordshire, MK43 0AL, UK

Cranfield University, College Road, Cranfield, Bedfordshire, MK43 0AL, UK.Search for more papers by this author
First published: 26 September 2010
Citations: 11

Abstract

Over recent years, Raman spectroscopy has been demonstrated as a prospective tool for application in cancer diagnostics. The use of Raman spectroscopy for this purpose relies on pattern recognition methods that have been developed to perform well on data achieved under laboratory conditions. However, the application of Raman spectroscopy as a routine clinical tool is likely to result in imperfect data due to instrument-to-instrument variation. Such corruption to the pure tissue spectral data is expected to negatively impact the classification performance of the diagnostic model. In this paper, we present a thorough assessment of the robustness of the Raman approach. This was achieved by perturbing a set of spectra in different ways, including various linear shifts, nonlinear shifts and random noise and using previously optimised classification models to predict the class membership of each spectrum in a testing set. The loss of predictive power with increased corruption was used to calculate a score, which allows an easy comparison of the model robustness. For this approach, three different types of classification models, including linear discriminant analysis (LDA), partial least square discriminant analysis (PLS-DA) and support vector machine (SVM), built for lymph node diagnostics were the subject of the robustness testing. The results showed that a linear perturbation had the highest impact on the performance of all classification models. Among all linear corruption methods, a gradient y-shift resulted in the highest performance loss. Thus, the factor most likely to affect the predictive outcome of models when using different systems is a gradient y-shift. Copyright © 2010 John Wiley & Sons, Ltd.