access icon free Two-microphone subband noise reduction scheme with a new noise subtraction parameter for speech quality enhancement

An improved subband noise reduction technique is proposed for two-microphone voice communication systems. The technique aims to enhance the speech quality by utilising a subband structure with different noise reduction schemes for different frequency bands. In the low-frequency band where dominant cues of speech spectral components are usually located and the noise signals from the two channels are mainly correlated, the spectral subtraction method, together with a new variable noise subtraction parameter, is employed so that the noise attenuation performance and speech distortion are controllable. In the high-frequency band where less-dominant frequency information of speech spectrum is located, the modified cross-spectral subtraction technique is utilised to remove the high-frequency decorrelated noise spectral components. Extensive comparisons among various noise reduction techniques based on computer simulations demonstrate that the proposed two-microphone subband noise reduction scheme achieves excellent noise attenuation performance while preserving the speech quality.

Inspec keywords: signal denoising; microphones; speech enhancement; correlation methods

Other keywords: speech spectrum; speech quality enhancement; microphone subband noise reduction; variable noise subtraction parameter; spectral subtraction method; subband noise reduction technique; modified cross-spectral subtraction technique; signal correlation; two-microphone voice communication system; noise attenuation; speech distortion

Subjects: Speech processing techniques; Audio equipment and systems; Speech and audio signal processing

References

    1. 1)
      • 27. Jeub, M., Herglotz, C., Nelke, C., Beaugeant, C., Vary, P.: ‘Noise reduction for dual-microphone mobile phones exploiting power level differences’. Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Kyoto, Japan, March 2012, pp. 16931696.
    2. 2)
      • 22. McCowan, I.A., Bourlard, H.: ‘Microphone array post-filter for diffuse noise field’. Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Orlando, USA, May 2002, pp. 905908.
    3. 3)
      • 28. Vaseghi, S.V.: ‘Advanced digital signal processing and noise reduction’ (Wiley, 2000, 2nd edn.), Chapter 11.
    4. 4)
      • 33. ‘Methods for subjective determination of transmission quality’, ITU-T Recommendation, 1996, p. 800.
    5. 5)
    6. 6)
      • 9. Li, C., Liu, W.J.: ‘A novel multi-band spectral subtraction method based on phase modification and magnitude compensation’. Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Prague, Czech Republic, May 2011, pp. 47604763.
    7. 7)
      • 29. ‘TIMIT (Texas Instruments (TI) and Massachusetts Institute of Technology (MIT)) speech database’, which is sponsored by DARPA (The Defense Advanced Research Projects Agency of the United States Department of Defense), http://www.ldc.upenn.edu/, accessed June 2013.
    8. 8)
      • 21. Zelinski, R.: ‘A microphone array with adaptive post-filtering for noise reduction in reverberant rooms’. Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), New York, USA, April 1988, pp. 25782581.
    9. 9)
      • 23. Dorbecker, M., Ernst, S.: ‘Combination of two-channel spectral subtraction and adaptive wiener post-filtering for noise reduction and dereverberation’. Proc. European Signal Processing Conference (EUSIPCO), Trieste, Italy, September 1996, pp. 995998.
    10. 10)
    11. 11)
    12. 12)
    13. 13)
      • 32. ‘Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs’, ITU-T Recommendation, 2001, p. 862.
    14. 14)
      • 26. Yousefian, N., Kokkinakis, K., Loizou, P.C.: ‘A coherence-based algorithm for noise reduction in dual-microphone applications’. Proc. European Signal Processing Conference (EUSIPCO), Aalborg, Denmark, August 2010, pp. 19041908.
    15. 15)
      • 14. Li, J., Akagi, M., Suzuki, Y.: ‘Extension of the two-microphone noise reduction method for binaural hearing aids’. Proc. Int. Conf. Audio, Language, and Image Processing (ICALIP), Shanghai, China, July 2008, pp. 97101.
    16. 16)
      • 30. ‘NOISEX-92: database of recording of various noises’, http://sipl.technion.ac.il/Info/Downloads_DataBases_NoiseX92_e.shtml, accessed June 2013.
    17. 17)
    18. 18)
      • 8. Kamath, S.D., Loizou, P.C.: ‘A multi-band spectral subtraction method for enhancing speech corrupted by colored noise’. Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Orlando, USA, May 2002, pp. IV4164.
    19. 19)
      • 18. Rahmani, M., Akbari, A., Ayad, B., Mazoochi, M., Moin, M.S.: ‘A modified coherent based method for dual microphone speech enhancement’. Proc. Int. Conf. Signal Processing and Communications (ICSPC), Dubai, United Arab Emirates, November 2007, pp. 225228.
    20. 20)
    21. 21)
    22. 22)
      • 19. Zamani, B., Rahmani, M., Akbari, A.: ‘Residual noise control for coherence based dual microphone speech enhancement’. Proc. Int. Conf. Computer and Electrical Engineering (ICCEE), Phuket, Thailand, March 2008, pp. 601605.
    23. 23)
    24. 24)
    25. 25)
    26. 26)
      • 17. Zhang, X., Jia, Y.: ‘A soft-decision based noise power cross power spectral density estimation for two-microphone speech enhancement systems’. Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Philadelphia, USA, April 2005, pp. 813816.
    27. 27)
    28. 28)
      • 11. Esch, T., Vary, P.: ‘Efficient musical noise suppression for speech enhancement systems’. Proc. Int. Conf. Acoustic, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, April 2009, pp. 44094412.
    29. 29)
    30. 30)
    31. 31)
      • 7. Berouti, M., Schwartz, R., Makhoul, J.: ‘Enhancement of speech corrupted by acoustic noise’. Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Washington, USA, April 1979, pp. 208211.
    32. 32)
      • 31. Enqing, D., Guizhong, L., Yatong, Z., Yu, C.: ‘Voice activity detection based on short-time energy and noise spectrum adaptation’. Proc. Int. Conf. Signal Processing (ICSP), Beijing, China, October 2002, pp. 464467.
    33. 33)
http://iet.metastore.ingenta.com/content/journals/10.1049/iet-spr.2013.0182
Loading

Related content

content/journals/10.1049/iet-spr.2013.0182
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading