© The Institution of Engineering and Technology
An algorithm for target speech enhancement based on degenerate unmixing and estimation technique (DUET) is described. Although the DUET can accomplish source separation only from two mixtures, the requirements of knowing the number of sources in advance and of estimating the attenuation and delay parameters for all sources prevent it from being used in real-world applications. Circumventing these requirements, the described algorithm is useful for speech enhancement where only one target speech should be extracted. Experimental results show that the algorithm provides much faster convergence of all the required parameters and noise suppression performances that are better than or comparable to the DUET with negligible distortion of the recovered speech.
References
-
-
1)
-
O. Yilmaz ,
S. Rickard
.
Blind separation of speech mixtures via time-frequency masking.
IEEE Trans. Signal Process.
,
7 ,
1830 -
1847
-
2)
-
H. Lane ,
B. Tranel
.
The Lombard sign and the role of hearing in speech.
J. Speech Hear. Res.
,
677 -
709
-
3)
-
S. Haykin
.
(2000)
Unsupervised adaptive filtering, Volume 1: Blind source separation.
-
4)
-
P.C. Loizou
.
(2007)
Speech enhancement: theory and practice.
-
5)
-
Rickard, S., Balan, R., Rosca, J.: `Real-time time-frequency based blind source separation', Int. Workshop on ICA and BSS, 2001, p. 651–656.
http://iet.metastore.ingenta.com/content/journals/10.1049/el.2010.3033
Related content
content/journals/10.1049/el.2010.3033
pub_keyword,iet_inspecKeyword,pub_concept
6
6