Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

access icon free Improving semantic compression specification in large relational database

The large-scale relational databases normally have a large size and a high degree of sparsity. This has made database compression very important to improve the performance and save storage space. Using standard compression techniques (syntactic) such as Gzip or Zip does not take advantage of the relational properties, as these techniques do not look at the nature of the data. Since semantic compression accounts for and exploits both the meanings and dynamic ranges of error for individual attributes (lossy compression); and existing data dependencies and correlations between attributes in the table (lossless compression), it is very effective for table-data compression. Inspired by semantic compression, this study proposes a novel independent lossless compression system through utilising data-mining model to find the frequent pattern with maximum gain (representative row) in order to draw attribute semantics, besides a modified version of an augmented vector quantisation coder to increase total throughput of the database compression. This algorithm enables more granular and suitable for every kind of massive data tables after synthetically considering compression ratio, space, and speed. The experimentation with several very large real-life datasets indicates the superiority of the system with respect to previously known lossless semantic techniques.

References

    1. 1)
    2. 2)
      • 17. Ng, W., Ravishanka, C.: ‘Relational database compression using augmented vector quantization’. Int. Conf. on Data Engineering, Taiwan, March 1995, pp. 540549.
    3. 3)
      • 6. Buchsbaum, A., Caldwell, D., Church, K., et al: ‘Engineering the compression of massive tables: an experimental approach’. Proc. 11th Annual ACM-SIAM Symp. on Discrete Algorithms, 2000, vol. 9, pp. 175184.
    4. 4)
    5. 5)
      • 4. Hoque, A., McGregor, D., Wilson, J.: ‘Database compression using an offline dictionary method’. Advances in Information Systems, 2002, pp. 1120.
    6. 6)
      • 25. Murugesan, C., Ravichandran, T.: ‘Real time database compression optimization using iterative length compression algorithm’. Int. Conf. on Computer Science and Information Technology, USA, 2013, pp. 99105.
    7. 7)
      • 11. Babu, S., Garofalakis, M., Rastogi, R.: ‘SPARTAN: a model based semantic compression system for massive data tables’. Int. Conf. on Management of Data, USA, 21–24 2001, pp. 283294.
    8. 8)
      • 22. Tian-lei, H., Chen, G., Li, X., et al: ‘Automatic relational database compression scheme design based on swarm evolution’, J. Zhejiang Univ. Sci., 2006, A7, (10), pp. 16421651.
    9. 9)
      • 2. Cannane, A., Williams, H.: ‘A compression scheme for large databases’. Proc. IEEE 11th Database Conf., Australasian, 2000, pp. 611.
    10. 10)
      • 10. Walker, A., Paul Benjamin, D.: ‘Semantic encoding and compression of database tables’. U.S. Patent 6,691,132, 10 February 2004.
    11. 11)
      • 19. Stonebraker, M., Abadi, D., Batkin, A.: ‘C-Store: a column-oriented DBMS’. Proc. 31st Int. Conf. on Very large Databases, 2005, pp. 553564.
    12. 12)
      • 14. Benjamin, D., Walker, A.: ‘Semantic encoding of relational databases in wireless networks’. Int. Conf. on Data Mining, Intrusion Detection, Information Assurance and Data Networks Security, USA, March 2005, pp. 255262.
    13. 13)
      • 26. Umarani, J., Prakash, S., Gupta, P.: ‘An indexing technique for biometric database’. IEEE Int. Conf. on Wavelet Analysis and Pattern Recognition (ICWAPR'08), 2008, vol. 2, pp. 758763.
    14. 14)
      • 1. Aghav, S.: ‘Database compression techniques for performance optimization’. Proc. IEEE Second Int. Conf. on Computer Engineering and Technology (ICCET), 2010, vol. 6, pp. 714717.
    15. 15)
      • 23. Atayero, A., Alatishe, A., Olugbara, O.: ‘Compression of high-dimensional data spaces using non-differential augmented vector quantization’, Int. J. Inf. Commun. Technol. Res., 2011, 1, (8), pp. 329336.
    16. 16)
      • 15. Huang, H.: ‘Lossless semantic compression for relational databases’. Master thesis, Simon Fraser University, November 2001.
    17. 17)
      • 3. Chen, Z.: ‘Building compressed database systems’. PhD dissertation, Cornell University, 2002.
    18. 18)
    19. 19)
      • 18. Abadi, D., Madden, S., Ferreira, M.: ‘Integrating compression and execution in column-oriented database systems’. Proc. ACM Int. Conf. on Management of Data, 2006, pp. 671682.
    20. 20)
      • 5. Karumbunathan, A.: ‘Using predictive models for compression in database systems’. Technical Report, CIT 367, Brown University, USA, 2013, pp. 112.
    21. 21)
      • 16. Chien-Le, G., Aisaka, K., Tsukamoto, M., et al: ‘Database compression with data mining methods’. Information organization and Databases, US, 2000, pp. 177190.
    22. 22)
      • 20. Muthukumar, M., Ravichandran, T.: ‘Analyzing compression performance for real time database systems’. Int. Conf. on Advanced Computer Engineering and Applications (ICACEA), 2012, pp. 1722.
    23. 23)
      • 7. Tamrakar, A., Nanda, V.: ‘A compression algorithm for optimization of storage consumption of non-Oracle database’, Int. J. Adv. Res. Comput. Sci. Electron. Eng. (IJARCSEE), 2012, 1, (5), pp. 3943.
    24. 24)
      • 12. Jagadish, H., Ng, R., Ooi, B., et al: ‘ItCompress: an iterative semantic compression algorithm’. Int. Conf. on Data Engineering, USA, 2004, pp. 646657.
    25. 25)
      • 8. Muthukumar, M., Ravichandran, T.: ‘Real time database compression optimization using iterative length compression algorithm’. Int. Conf. on Computer Science and Information Technology, 2013, pp. 99105.
    26. 26)
      • 13. Jagadish, H., Madar, J., Ng, R.: ‘Semantic compression and pattern extraction with fascicles’, J. Very Large Databases (VLDB), 1999, 99, pp. 710.
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-sen.2015.0054
Loading

Related content

content/journals/10.1049/iet-sen.2015.0054
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address