A data-mining approach to biomarker identification from protein profiles using discrete stationary wavelet transform |
| |
Authors: | Hussain Montazery-Kordy Mohammad Hossein Miran-Baygi Mohammad Hassan Moradi |
| |
Affiliation: | [1]Department of Electrical and Computer Engineering, Tarbiat Modares University, P.O. Box 14115-111, Tehran, Iran; [2]Faculty of Biomedical Engineering, Amir Kabir University of Technology, P.O. Box 15875-4413, Tehran, Iran |
| |
Abstract: | Objective: To develop a new bioinformatic tool based on a data-mining approach for extraction of the most informative proteins that could be used to find the potential biomarkers for the detection of cancer. Methods: Two independent datasets from serum samples of 253 ovarian cancer and 167 breast cancer patients were used. The samples were examined by surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF MS). The datasets were used to extract the informative proteins using a data-mining method in the discrete stationary wavelet transform domain. As a dimensionality reduction procedure, the hard thresholding method was applied to reduce the number of wavelet coefficients. Also, a distance measure was used to select the most discriminative coefficients. To find the potential biomarkers using the selected wavelet coefficients, we applied the inverse discrete stationary wavelet transform combined with a two-sided t-test. Results: From the ovarian cancer dataset, a set of five proteins were detected as potential biomarkers that could be used to identify the cancer patients from the healthy cases with accuracy, sensitivity, and specificity of 100%. Also, from the breast cancer dataset, a set of eight proteins were found as the potential biomarkers that could separate the healthy cases from the cancer patients with accuracy of 98.26%, sensitivity of 100%, and specificity of 95.6%. Conclusion: The results have shown that the new bioinformatic tool can be used in combination with the high-throughput proteomic data such as SELDI-TOF MS to find the potential biomarkers with high discriminative power. |
| |
Keywords: | Proteomics Discrete stationary wavelet transform Data mining Feature selection Biomarker Cancer classification |
本文献已被 维普 等数据库收录! |
|