access icon free In silico discovery of significant pathways in colorectal cancer metastasis using a two-stage optimisation approach

Accurate and reliable modelling of protein–protein interaction networks for complex diseases such as colorectal cancer can help better understand mechanism of diseases and potentially discover new drugs. Different machine learning methods such as empirical mode decomposition combined with least square support vector machine, and discrete Fourier transform have been widely utilised as a classifier and for automatic discovery of biomarkers for the diagnosis of the disease. The existing methods are, however, less efficient as they tend to ignore interaction with the classifier. In this study, the authors propose a two-stage optimisation approach to effectively select biomarkers and discover interactions among them. At the first stage, particle swarm optimisation (PSO) and differential evolution (DE) are used to optimise parameters of support vector machine recursive feature elimination algorithm, and dynamic Bayesian network is then used to predict temporal relationship between biomarkers across two time points. Results show that 18 and 25 biomarkers selected by PSO and DE-based approach, respectively, yields the same accuracy of 97.3% and F1-score of 97.7 and 97.6%, respectively. The stratified analysis reveals that Alpha-2-HS-glycoprotein was a dominant hub gene with multiple interactions to other genes including Fibrinogen alpha chain, which is also a potential biomarker for colorectal cancer.

Inspec keywords: Bayes methods; recursive functions; support vector machines; medical computing; molecular biophysics; evolutionary computation; genetics; proteins; particle swarm optimisation; cancer

Other keywords: dynamic Bayesian network; colorectal cancer metastasis; differential evolution; Fibrinogen alpha chain; particle swarm optimisation; biomarkers; stratified analysis; hub gene; support vector machine recursive feature elimination; two-stage optimisation approach; protein–protein interaction networks; Alpha-2-HS-glycoprotein

Subjects: Biomedical engineering; Physics of subcellular structures; Optimisation techniques; Biomolecular interactions, charge transfer complexes; Knowledge engineering techniques; Biology and medical computing; Other topics in statistics

References

    1. 1)
    2. 2)
      • 29. Korb, K., Nicholson, A.: ‘Bayesian artificial intelligence’ (CRC press, 2011).
    3. 3)
      • 19. Engelbrecht, A.: ‘Computational intelligence: an introduction’ (John Willey & Sons, 2007).
    4. 4)
      • 37. R Core Team ‘R: A language and environment for statistical computing’, R Foundation for Statistical Computing, Vienna, Austria, 2014. Available at http://www.R-project.org/.
    5. 5)
      • 15. Rogalsky, T., Derksen, R., Kocabiyik, S.: ‘Differential evolution in aerodynamic optimization’, Can. Aeronaut. Space J., 2000, 46, (4), pp. 183190.
    6. 6)
      • 48. Cao, W.H., Liu, H.M., Liu, X., et al: ‘Relaxin enhances in-vitro invasiveness of breast cancer cell lines by upregulation of S100A4/MMPs signaling’, Eur. Rev. Med. Pharmacol. Sci., 2013, 17, (5), pp. 609617.
    7. 7)
      • 30. Murphy, K.: ‘Machine learning: a probabilistic perspective’ (MIT press, 2012).
    8. 8)
    9. 9)
    10. 10)
    11. 11)
    12. 12)
      • 59. Paquin, M.C., Leblanc, C., Lemieux, E., et al: ‘Functional impact of colorectal cancer-associated mutations in the transcription factor E2F4’, Int. J. Oncol., 2013, 43, (6), pp. 20152022.
    13. 13)
      • 38. Ling, C.X., Jin, H., Harry, Z.: ‘AUC: a better measure than accuracy in comparing learning algorithms’, in Yang, X., Brahim, C. (Eds.): ‘Advances in artificial intelligence’ (Springer, 2003), vol. 2671, pp. 329341.
    14. 14)
    15. 15)
    16. 16)
    17. 17)
    18. 18)
    19. 19)
      • 11. Cortes, C., Vladimir, V.: ‘Support-vector networks’, Mach. Learn., 1995, 20, (3), pp. 273297.
    20. 20)
    21. 21)
      • 39. Lebre, S.: original version 1.0 by Sophie Lebre and contribution of Julien Chiquet to version 2.0 (2013). G1DBN: a package performing dynamic Bayesian network inference. R package version 3.1.1. Available at http://CRAN.R-project.org/package=G1DBN.
    22. 22)
    23. 23)
      • 16. Storn, R.: ‘On the usage of differential evolution for function optimization’. Proc. of Int. Conf. on North American Fuzzy Information Processing, Berkeley, CA, 1996, pp. 519523.
    24. 24)
      • 34. Tibshirani, R.: ‘Regression shrinkage and selection via the lasso’, J. R. Stat. Soc. Ser. B, Methodol., 1996, 58, (1), pp. 267288.
    25. 25)
      • 6. Rathore, S., Iftikhar, M., Hussain, M.: ‘A novel approach for automatic gene selection and classification of gene based colon cancer datasets’. Proc. of Int. Conf. on Emerging Technologies (ICET), Islamabad, December 2014, pp. 4247.
    26. 26)
    27. 27)
    28. 28)
    29. 29)
      • 1. ‘The National Health Service (NHS) UK, Department of Health, Bowel Cancer’. Available at http://www.nhs.uk/conditions/Cancer-of-the-colon-rectum-or-bowel/Pages/Introduction.aspx, accessed February 2015.
    30. 30)
    31. 31)
      • 33. Nagarajan, R., Scutari, M., Lèbre, S.: ‘Bayesian networks in R –with applications in systems biology’ (Springer, 2013).
    32. 32)
    33. 33)
      • 52. Rachel, S., Adam, S., David, R.: ‘Pro-opiomelanocortin is a novel biomarker for small cell lung cancer’, Endocrine Abstr., 2010, 21, p. 221.
    34. 34)
    35. 35)
    36. 36)
    37. 37)
    38. 38)
    39. 39)
      • 36. Swiss Institute of Bioinformatics SIB, Tagident Bioinformatics Resource Tool’. Available at http://web.expasy.org/tagident/, accessed June 2015.
    40. 40)
    41. 41)
      • 42. Lebre, S.: ‘Stochastic process analysis for genomics and dynamic Bayesian networks inference’. PhD thesis, Université d'Evry-Val d'Essonne, 2007.
    42. 42)
      • 5. Hong, Y., Zeng-li, L., Wei, H.: ‘Research for the colon cancer based on the EMD and LS-SVM’. Proc. of Int. Conf. on Intelligent Computation Technology and Automation (ICICTA), Shenzhen, Guangdong, March 2011, pp. 888891.
    43. 43)
      • 41. Akutekwe, A., Seker, H.: ‘A hybrid dynamic Bayesian network approach for modelling temporal associations of gene expressions for hypertension diagnosis’. Proc. of Int. Conf. on Engineering in Medicine and Biology Society (EMBC), Chicago, USA, August 2014, pp. 804807.
    44. 44)
    45. 45)
      • 18. Kennedy, J., Eberhart, R.: ‘Particle swarm optimization’. Proc. of Int. Conf. on IEEE Neural Networks, Piscataway, NJ, 1995, pp. 19421948.
    46. 46)
      • 22. Shi, Y., Eberhart, R.: ‘A modified particle swarm optimizer’. Proc. of Int. Conf. on IEEE World Congress on Computational Intelligence, Anchorage, AK, May 1998, pp. 6973.
    47. 47)
    48. 48)
    49. 49)
    50. 50)
    51. 51)
    52. 52)
    53. 53)
      • 31. Liu, Z., Zhang, W., Horimoto, K., et al: ‘Gaussian graphical model for identifying significantly responsive regulatory networks from time course high-throughput data’, Syst. Biol., 2013, 7, (5), pp. 143152.
    54. 54)
    55. 55)
      • 2. ‘American Cancer Society, What is Colorectal Cancer’. Available at http://www.cancer.org/cancer/colonandrectumcancer/detailedguide/colorectal-cancer-key-statistics, accessed February 2015.
    56. 56)
      • 9. Akutekwe, A., Seker, H.: ‘Particle swarm optimization-based bio-network discovery method for the diagnosis of colorectal cancer’. Proc. of Int. Conf. on IEEE Bioinformatics and Biomedicine (BIBM), Belfast, November 2014, pp. 813.
    57. 57)
      • 56. The UniProt Consortium: ‘UniProt: a hub for protein information’, Nucleic Acids Research, 2015, gku989.
    58. 58)
      • 40. Akutekwe, A., Seker, H.: ‘Two-stage computational bio-network discovery approach for metabolites: ovarian cancer as a case study’. Proc. of Int. Conf. on Biomedical and Health Informatics (BHI), June 2014, pp. 97100.
    59. 59)
      • 28. Russell, S., Norvig, P.: ‘Artificial intelligence: a modern approach’ (Prentice Hall, 2010).
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-syb.2015.0031
Loading

Related content

content/journals/10.1049/iet-syb.2015.0031
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading