Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

access icon free Modified coherence-based dictionary learning method for speech enhancement

This paper presents a new method for speech enhancement based on a dictionary learning method. The proposed approach is based on using coherence measure in dictionary learning. Data required for better fitting to atoms in sparse representation of noise is provided by a noise estimation algorithm that causes noise dictionary to be trained with the same data size as speech signal. To decrease coherence between dictionaries after the training step, a new method is applied to yield incoherent dictionaries. In sparse representation of speech data, the highest energy atoms of noise dictionary are replaced with the lowest energy atoms, under certain conditions. A similar replacement can happen in sparse representation of noise data. Furthermore, in this paper, only one noise dictionary, chosen by a classification method, is used in speech enhancement step, resulting in a faster algorithm. Objective and subjective measures are used for evaluating the simulation results. According to experimental results, the proposed algorithm has been found superior in performance and computation overhead in comparison with the earlier methods in this context. Moreover, this method achieves significantly better results compared with baseline methods such as multi-band and geometric spectral subtraction.

References

    1. 1)
      • 3. Langarani, M.E., Veisi, H., Sameti, H.: ‘The effect of phase information in speech enhancement and speech recognition’ (ISSPA, 2012), pp. 14461447.
    2. 2)
      • 23. Rix, A., Beerends, J., Hollier, M., Hekstra, A.: ‘Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs’. In Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 2001, pp. 749752.
    3. 3)
    4. 4)
      • 18. Hirsch, H.G., Pearce, D.: ‘The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions’. Proc. ISCA ITRWASR, 2000, pp. 181188.
    5. 5)
    6. 6)
    7. 7)
    8. 8)
      • 19. http://pianosociety.com..
    9. 9)
    10. 10)
    11. 11)
      • 17. Varga, A., Steeneken, H.J.M., Tomlinson, M., Jones, D.: ‘The Noisex-92 study on the effect of additive noise on automatic speech recognition’. Technical Report, DRA Speech Research Unit, Malvern, UK, 1992.
    12. 12)
    13. 13)
    14. 14)
    15. 15)
      • 16. Kamath, S., Loizou, P.C.: ‘A multi-band spectral subtraction method for enhancing speech corrupted by colored noise’. Student Research Abstracts of Proc. In Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 2002.
    16. 16)
      • 4. Mehmetcik, E., Ciloglu, T.: ‘Speech enhancement by maintaining phase continuity’. Proc. Meetings on Acoustics, 2012, vol. 18.
    17. 17)
      • 14. http://www.dcs.shef.ac.uk/spandh/gridcorpus.
    18. 18)
      • 6. Lee, H., Battle, A., Raina, R., Ng, A.Y.: ‘Efficient sparse coding algorithms’. Advances in Neural Information Processing Systems, 2006.
    19. 19)
    20. 20)
    21. 21)
      • 1. Sigg, C.D., Dikk, T., Buhmann, J.M.: ‘Speech enhancement with sparse coding in learned dictionaries’. In Proc. IEEE Int. Conf. Acoustic, Speech, Signal Processing, 2010, pp. 47584761.
    22. 22)
    23. 23)
      • 22. Loizou, P.C.: ‘Speech enhancement: theory and practice’ (Taylor and Francis., 2007).
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-spr.2014.0148
Loading

Related content

content/journals/10.1049/iet-spr.2014.0148
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address