Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

access icon openaccess Prediction of cancer using customised fuzzy rough machine learning approaches

This Letter proposes a customised approach for attribute selection applied to the fuzzy rough quick reduct algorithm. The unbalanced data is balanced using synthetic minority oversampling technique. The huge dimensionality of the cancer data is reduced using a correlation-based filter. The dimensionality reduced balanced attribute gene subset is used to compute the final minimal reduct set using a customised fuzzy triangular norm operator on the fuzzy rough quick reduct algorithm. The customised fuzzy triangular norm operator is used with a Lukasiewicz fuzzy implicator to compute the fuzzy approximation. The customised operator selects the least number of informative feature genes from the dimensionality reduced datasets. Classification accuracy using leave-one-out cross validation of 94.85, 76.54, 98.11, and 99.13% is obtained using a customised function for Lukasiewicz triangular norm operator on leukemia, central nervous system, lung, and ovarian datasets, respectively. Performance analysis of the conventional fuzzy rough quick reduct and the proposed method are performed using parameters such as classification accuracy, precision, recall, F-measure, scatter plots, receiver operating characteristic area, McNemar test, chi-squared test, Matthew's correlation coefficient and false discovery rate that are used to prove that the proposed approach performs better than available methods in the literature.

References

    1. 1)
    2. 2)
      • 16. Anarki, J.R., Eftekhari, M.: ‘Rough set based feature selection – a review’. Proc. 5th Conf. on Information and Knowledge Technology, Shiraz, Iran, 2013, pp. 301306, doi: 10.1109/IKT.2013.6620083.
    3. 3)
      • 8. Wang, A., An, N., Chen, G., et al: ‘Incremental wrapper based gene selection with Markov blanket’. Proc. IEEE Int. Conf. on Bioinformatics and Biomedicine (BIBM), Belfast, UK, 2014, pp. 7479, doi: 10.1109/BIBM.2014.6999251.
    4. 4)
    5. 5)
    6. 6)
    7. 7)
    8. 8)
    9. 9)
    10. 10)
    11. 11)
      • 23. http://datam.i2r.a-star.edu.sg/datasets/krbd, accessed 10 December 2011.
    12. 12)
      • 12. Arunkumar, C., Ramakrishnan, S.: ‘A hybrid approach to feature selection using correlation coefficient and fuzzy rough quick reduct algorithm applied to cancer microarray data’. Proc. 10th Int. Conf. on Intelligent Systems and Control (ISCO 2016), Coimbatore, India, 2016, pp. 414419, doi: 10.1109/ISCO.2016.7726921.
    13. 13)
    14. 14)
    15. 15)
      • 13. Arunkumar, C., Ramakrishnan, S.: ‘Modified fuzzy rough quick reduct algorithm for feature selection in cancer microarray data’, Asian J. Inf. Technol., 2016, 15, pp. 199210, doi: 10.3923/ajit.2016.199.210.
    16. 16)
    17. 17)
      • 17. Radzikowska, A.M., Etieniie Kerre, E.: ‘An algebraic characterisation of fuzzy rough sets’. IEEE Int. Conf. on Fuzzy Systems, Budapest, Hungary, 2004, doi: 10.1109/FUZZY.2004.1375698.
    18. 18)
    19. 19)
      • 18. Anarki, J.R., Eftekhari, M.: ‘Improving fuzzy-rough quick reduct for feature selection’. Proc. 19th Iranian Conf. on Electrical Engineering, Tehran, Iran, 2011, pp. 16.
    20. 20)
    21. 21)
    22. 22)
    23. 23)
    24. 24)
http://iet.metastore.ingenta.com/content/journals/10.1049/htl.2018.5055
Loading

Related content

content/journals/10.1049/htl.2018.5055
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address