http://iet.metastore.ingenta.com
1887

SamEn-SVR: using sample entropy and support vector regression for bug number prediction

SamEn-SVR: using sample entropy and support vector regression for bug number prediction

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
IET Software — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

Monitoring and predicting the trend of bug number time series of a software system is crucial for both software project managers and software end-users. For software managers, accurate prediction of bug number of a software system will assist them in making timely decisions, such as effort investment and resource allocation. For software end-users, knowing possible bug number of their systems ahead will enable them to adopt timely actions in coping with the loss caused by possible system failures. This study proposes an approach called SamEn-SVR to combine sample entropy and support vector regression (SVR) to predict software bug number using time series analysis. The basic idea is to use template vectors with the smallest complexity as input vectors for SVR classifiers to ensure predictability of time series. By using Mozilla Firefox bug data, we conduct extensive experiments to compare the proposed approach and state-of-the-art techniques including auto-regressive integrated moving average (ARIMA), X12 enhanced ARIMA and polynomial regression to predict bug number time series. Experimental results demonstrate that the proposed SamEn-SVR approach outperforms state-of-the-art techniques in bug number prediction.

References

    1. 1)
      • S. Pressman . (2005)
        1. Pressman, S.: ‘Software engineering: a practitioner's approach’ (McGraw-Hill Education, Ohio USA, 2005, 6th edn.).
        .
    2. 2)
      • W. Zhang , S. Wang , Q. Wang .
        2. Zhang, W., Wang, S., Wang, Q.: ‘KSAP an approach to bug report assignment using KNN search and heterogeneous proximity’, Inf. Softw. Technol., 2016, 70, pp. 6884.
        . Inf. Softw. Technol. , 68 - 84
    3. 3)
      • H. Zhang .
        3. Zhang, H.: ‘An initial study of the growth of eclipse defects’. Fifth Int. Workshop on Mining Software Repositories (MSR 2008), 10–11 May 2008.
        . Fifth Int. Workshop on Mining Software Repositories (MSR 2008)
    4. 4)
      • A.E. Hassan .
        4. Hassan, A.E.: ‘The road ahead for mining software repositories’. Frontiers of Software Maintenance (FoSM'2008), 2008, pp. 4857.
        . Frontiers of Software Maintenance (FoSM'2008) , 48 - 57
    5. 5)
      • T. Zimmermann , P. Weigerber , S. Diehl .
        5. Zimmermann, T., Weigerber, P., Diehl, S., et al: ‘Mining version histories to guide software changes’, IEEE Trans. Softw. Eng., 2005, 31, (6), pp. 429445, doi: 10.1109/TSE.2005.72.
        . IEEE Trans. Softw. Eng. , 6 , 429 - 445
    6. 6)
      • I. Herraiz , J.M. González-Barahona , G. Robles .
        6. Herraiz, I., González-Barahona, J.M., Robles, G.: ‘Forecasting the number of changes in eclipse using time series analysis’. Proc. Fourth Int. Workshop on Mining Software Repositories (MSR 2007), 19–20 May 2007, p. 32.
        . Proc. Fourth Int. Workshop on Mining Software Repositories (MSR 2007) , 32
    7. 7)
      • F. Takens . (1981)
        7. Takens, F.: ‘Detecting strange attractors in turbulence’ in ‘Lecture Notes in Mathematics’, vol. 898 (Springer, Berlin, 1981), pp. 366381.
        .
    8. 8)
      • T. Sauer , J.A. Yorke , M. Casdagli .
        8. Sauer, T., Yorke, J.A., Casdagli, M.: ‘Embedology’, J. Stat. Phys., 1991, 65, p. 579.
        . J. Stat. Phys. , 579
    9. 9)
      • R.H. Shumway , D.S. Stoffer . (2006)
        9. Shumway, R.H., Stoffer, D.S.: ‘Time series analysis and its applications with R examples (springer texts in statics)’ (Springer, Heidelberg, Berlin, 2006).
        .
    10. 10)
      • G.E.P. Box , G. Jenkins . (1994)
        10. Box, G.E.P., Jenkins, G.: ‘Time series analysis: forecasting and control, Holden-day’ (Prentice-Hall, New York, NY, 1994, 3rd edn.).
        .
    11. 11)
      • 11. ‘X12-ARIMA’, available at http://www.census.gov/srd/www/x12a/.
        .
    12. 12)
      • D.F. Findley , C.C. Hood .
        12. Findley, D.F., Hood, C.C.: ‘X-12-ARIMA and its application to some Italian indicator series’. Seasonal adjustment procedures – experiences and perspectives, Istituto Nazionale di Statistica, Rome, 2000, 10, (20), pp. 231251.
        . Seasonal adjustment procedures – experiences and perspectives , 231 - 251
    13. 13)
      • D.F. Findley , B.C. Monsell , W.R. Bell .
        13. Findley, D.F., Monsell, B.C., Bell, W.R., et al: ‘New capabilities and methods of the X-12-ARIMA seasonal adjustment program’, J. Bus. Econ. Stat., 1998, 16, pp. 127176.
        . J. Bus. Econ. Stat. , 127 - 176
    14. 14)
      • M. Hazewinkel . (2001)
        14. Hazewinkel, M.: ‘Taylor formula’ in ‘Encyclopedia of Mathematics’ (Springer, Heidelberg, Berlin, 2001), ISBN 978-1-55608-010-4.
        .
    15. 15)
      • D.E. Rumelhart , G.E. Hinton , R.J. Williams .
        15. Rumelhart, D.E., Hinton, G.E., Williams, R.J.: ‘Learning internal representations by error propagation’. Parallel Distributed Processing, Exploitations in the Microstructure of Cognition, Cambridge, MA, 1986, vol. 1, pp. 318362.
        . Parallel Distributed Processing, Exploitations in the Microstructure of Cognition , 318 - 362
    16. 16)
      • C.D. Cantrell . (2000)
        16. Cantrell, C.D.: ‘Modern mathematical methods for physicists and engineers’ (Cambridge University Press, Cambridge, 2000).
        .
    17. 17)
      • J.S. Richman , J.R. Moorman .
        17. Richman, J.S., Moorman, J.R.: ‘Physiological time-series analysis using approximate entropy and sample entropy’, Am. J. Physiol. Heart Circ. Physiol., 2000, 278, (6), pp. 20392049.
        . Am. J. Physiol. Heart Circ. Physiol. , 6 , 2039 - 2049
    18. 18)
      • W. Zhang , T. Yoshida , X. Tang .
        18. Zhang, W., Yoshida, T., Tang, X.: ‘Text classification based on multi-word with support vector machine’, Knowl.-Based Syst., 2008, 21, (8), pp. 879886.
        . Knowl.-Based Syst. , 8 , 879 - 886
    19. 19)
      • 19. ‘Mozilla Firefox Project’, available at https://www.mozilla.org.
        .
    20. 20)
      • 20. ‘MSR 2010 challenge’, available at http://msr.uwaterloo.ca/msr2010/challenge/.
        .
    21. 21)
      • 21. ‘Eviews Trial Version’, available at http://www.eviews.com/home.html.
        .
    22. 22)
      • L.C. Briand , I. Wieczorek . (2002)
        22. Briand, L.C., Wieczorek, I.: ‘Resource estimation in software engineering’, in ‘Encyclopedia of software engineering’ (Wiley, Hoboken, 2002), pp. 11601196.
        .
    23. 23)
      • H.B. Mann , R. Whitney .
        23. Mann, H.B., Whitney, R.: ‘On a test of whether one of two random variables is stochastically larger than the other’, Ann. Math. Stat., 1947, 18, (1), pp. 5060.
        . Ann. Math. Stat. , 1 , 50 - 60
    24. 24)
      • C.C.H. Yuen .
        24. Yuen, C.C.H.: ‘On analyzing maintenance process data at the global and detail levels: a case study’. Proc. 6th Int. Conf. on Software Maintenance, 1988, pp. 248255.
        . Proc. 6th Int. Conf. on Software Maintenance , 248 - 255
    25. 25)
      • C.F. Kemerer , S. Slaughter .
        25. Kemerer, C.F., Slaughter, S.: ‘An empirical approach to studying software evolution’, IEEE Trans. Software Eng., 1999, 25, (4), pp. 493509.
        . IEEE Trans. Software Eng. , 4 , 493 - 509
    26. 26)
      • B. Kenmei , G. Antoniol , M. Penta .
        26. Kenmei, B., Antoniol, G., Penta, M.: ‘Trend analysis and issue prediction in large-scale open source systems’. Proc. 12th European Conf. on Software Maintenance and Reengineering, 2008, pp. 7382.
        . Proc. 12th European Conf. on Software Maintenance and Reengineering , 73 - 82
    27. 27)
      • F. Caprio , G. Casazza , M.D. Penta .
        27. Caprio, F., Casazza, G., Penta, M.D., et al: ‘Measuring and predicting the Linux kernel evolution’. Proc. Int. Workshop of Empirical Studies on Software Maintenance, Florence, Italy, 2001.
        . Proc. Int. Workshop of Empirical Studies on Software Maintenance
    28. 28)
      • J. Pati , K.K. Shukla .
        28. Pati, J., Shukla, K.K.: ‘A comparison of ARIMA, neural network and a hybrid technique for Debian bug number prediction’. Proc. 5th Int. Conf. on Computer and Communication Technology, 2014, pp. 4753.
        . Proc. 5th Int. Conf. on Computer and Communication Technology , 47 - 53
    29. 29)
      • W. Wu , W. Zhang , Y. Yang .
        29. Wu, W., Zhang, W., Yang, Y., et al: ‘Time series analysis for bug number prediction’. Proc. 2nd Int. Conf. on Software Engineering and Data Mining, 2010, pp. 589596.
        . Proc. 2nd Int. Conf. on Software Engineering and Data Mining , 589 - 596
    30. 30)
      • S. Huang , P. Chuang , C. Wub .
        30. Huang, S., Chuang, P., Wub, C., et al: ‘A chaos-based support vector regressions for exchange rate forecasting’, Expert Syst. Appl., 2010, 37, pp. 85908598.
        . Expert Syst. Appl. , 8590 - 8598
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-sen.2017.0168
Loading

Related content

content/journals/10.1049/iet-sen.2017.0168
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address