Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

access icon free Lung cancer prediction from microarray data by gene expression programming

Lung cancer is a leading cause of cancer-related death worldwide. The early diagnosis of cancer has demonstrated to be greatly helpful for curing the disease effectively. Microarray technology provides a promising approach of exploiting gene profiles for cancer diagnosis. In this study, the authors propose a gene expression programming (GEP)-based model to predict lung cancer from microarray data. The authors use two gene selection methods to extract the significant lung cancer related genes, and accordingly propose different GEP-based prediction models. Prediction performance evaluations and comparisons between the authors’ GEP models and three representative machine learning methods, support vector machine, multi-layer perceptron and radial basis function neural network, were conducted thoroughly on real microarray lung cancer datasets. Reliability was assessed by the cross-data set validation. The experimental results show that the GEP model using fewer feature genes outperformed other models in terms of accuracy, sensitivity, specificity and area under the receiver operating characteristic curve. It is concluded that GEP model is a better solution to lung cancer prediction problems.

References

    1. 1)
    2. 2)
      • 34. Sherrod, P.H.: DTREG predictive modelling software. Available at https://www.dtreg.com/, accessed 7 February 2015.
    3. 3)
      • 2. Society, A.C.: ‘Cancer facts & figures 2011’ (American Cancer Society Inc., 2011), vol. 1.
    4. 4)
      • 27. Natarajan, A., Ravi, T.: ‘A survey on gene feature selection using microarray data for cancer classification’. Int. J. Comput. Sci. Commun., 2014, 5, pp. 126129.
    5. 5)
    6. 6)
      • 14. Joseph, A.C., David, S.W.: ‘Applications of machine learning in cancer prediction and prognosis’, Cancer Inf., 2006, 2, pp. 5977.
    7. 7)
      • 3. Laureen, W., Goh, B.C.: ‘An overview of cancer trends in Asia’ (Innovationmagazine.com., 2012).
    8. 8)
      • 16. Ayer, T., Alagoz, O., Chhatwal, J., et al: ‘Breast cancer risk estimation with artificial neural networks revisited: discrimination and calibration’. PMC, 2010, vol. 116, pp. 33103321.
    9. 9)
    10. 10)
      • 18. Ferreira, C.: ‘Gene expression programming in problem solving’2002.
    11. 11)
      • 23. Koza, J.R.: ‘Genetic programming: on the programming of computers by means of natural selection’ (MIT Press, Cambridge, MA, 1992).
    12. 12)
    13. 13)
    14. 14)
      • 26. Ferreira, C.: ‘Gene expression programming: mathematical modeling by an artificial intelligence’ (Springer-Verlag, Berlin, 2006), vol. 850.
    15. 15)
    16. 16)
    17. 17)
      • 22. Ferreira, C.: ‘Gepsoft predictive modeling software’ (Candida Ferreira, 2001), vol. 2015.
    18. 18)
    19. 19)
      • 25. Han, X.R., Li, X.C., Si, H.Z., et al: ‘QSAR study of the anti-cancer activity of 38 compounds in different cancer cell lines based on gene expression programming’, Adv. Mater. Res., 2014, pp. 12911294.
    20. 20)
    21. 21)
    22. 22)
    23. 23)
    24. 24)
    25. 25)
      • 38. Kubat, M.: ‘Neural networks: a comprehensive foundation by Simon Haykin, Macmillan, 1994, ISBN 0–02-352781-7’ (Cambridge University Press, 1999).
    26. 26)
      • 10. Diaz, J.M., Pinon, R.C., Solano, G.: ‘Lung cancer classification using genetic algorithm to optimize prediction models’. Fifth Int. Conf. on Information, Intelligence, Systems and Applications, IISA 2014, Chania, 2014, pp. 16.
    27. 27)
    28. 28)
      • 19. Ferreira, C., Gepsoft, U.: ‘What is gene expression programming’, (Candida Ferreira, 2008).
    29. 29)
      • 6. Melissa, C.S.: ‘Lung cancer’ (Medicine.net, 2011).
    30. 30)
    31. 31)
    32. 32)
    33. 33)
    34. 34)
      • 17. Yu, Z., Lu, H., Si, H., et al: ‘A highly efficient gene expression programming (GEP) model for auxiliary diagnosis of small cell lung cancer’, PLoS ONE, 2015, 10, pp. 119.
    35. 35)
      • 32. Touw, W.G., Bayjanov, J.R., Overmars, L., et al: ‘Data mining in the life sciences with random forest: a walk in the park or lost in the jungle?’, Brief. Bioinf., 2012, 14, p. bbs034.
    36. 36)
    37. 37)
    38. 38)
      • 5. Spitz, M.R., Wei, Q., Dong, Q., et al: ‘Genetic susceptibility to lung cancer the role of DNA damage and repair’, Cancer Epidemiol. Biomarkers Prev., 2003, 12, pp. 689698.
    39. 39)
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-syb.2015.0082
Loading

Related content

content/journals/10.1049/iet-syb.2015.0082
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address