Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

Bilinear model for speaker adaptation using tensor analysis

Bilinear model for speaker adaptation using tensor analysis

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
Electronics Letters — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

A novel speaker adaptation method based on two-way analysis of training speakers is described. A set of training models is expressed as a tensor and is decomposed into two factors using nonlinear iterative partial least squares, producing a bilinear model. The resulting model has bases of lower dimension and more free parameters than those of eigenvoice, enabling more elaborate modelling for a moderate amount of adaptation data. Results from the isolated-word recognition test show that the proposed model outperforms both eigenvoice and maximum likelihood linear regression (MLLR) for adaptation data longer than 15 s. Moreover, the proposed method can straightforwardly be extended to n-way analysis, e.g. for simultaneous adaptation of speaker, environment, etc.

References

    1. 1)
      • H. Wold , K.G. J̈oreskog , H. Wold . (1982) Soft modeling: the basic design and some extensions, Systems under indirect observation.
    2. 2)
      • Xu, D., Yan, S., Zhang, L., Zhang, H.-J., Liu, Z., Shum, H.-Y.: `Concurrent subspaces analysis', Proc. IEEE Computer Society Conf. Computer Vision Pattern Recognition, June 2005, 2, p. 20–25.
    3. 3)
      • L. de Lathauwer , B. de Moor , J. Vandewalle . A multilinear singular value decomposition. Siam J. Matrix Anal. Appl. , 4 , 1253 - 1278
    4. 4)
      • R. Kuhn , J.-C. Junqua , P. Nguyen , N. Niedzielski . Rapid speaker adaptation in eigenvoice space. IEEE Trans. Speech Audio Process. , 6 , 695 - 707
http://iet.metastore.ingenta.com/content/journals/10.1049/el.2010.2484
Loading

Related content

content/journals/10.1049/el.2010.2484
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address