http://iet.metastore.ingenta.com
1887

Adaptation of hidden Markov model mean parameters using two-dimensional PCA with constraint on speaker weight

Adaptation of hidden Markov model mean parameters using two-dimensional PCA with constraint on speaker weight

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
Electronics Letters — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

A basis-based speaker adaptation technique is proposed, where basis vectors are derived using two-dimensional principal component analysis (2DPCA) and the speaker weight for the target speaker is constrained in the space of training speaker weights. During adaptation, the speaker weight that is derived in the maximum-likelihood framework is constrained by projecting the weight into the space of the weights of training speakers. In the experiments, the proposed approach shows performance improvement over the unconstrained 2DPCA-based approach.

References

    1. 1)
      • 1. Rabiner, L.R.: ‘A tutorial on hidden Markov models and selected applications in speech recognition’, Proc. IEEE, 1989, 77(2), pp. 257286.
    2. 2)
    3. 3)
      • 3. Jolliffe, I.T.: ‘Principal component analysis’ (Springer, New York, 2002, 2nd edn).
    4. 4)
    5. 5)
    6. 6)
      • 6. Dempster, A.P., Laird, N.M., Rubin, B.D.: ‘Maximum likelihood from incomplete data via the EM algorithm’, J. R. Stat. Soc. Ser. B, Stat. Methodol., 1977, 39, pp. 138.
    7. 7)
    8. 8)
      • 8. Paul, D.B., Baker, J.M.: ‘The design for the Wall Street Journal-based CSR corpus’. Proc. DARPA Speech and Natural Language Workshop, Newark, DE, USA, July 1992, pp. 357362.
    9. 9)
http://iet.metastore.ingenta.com/content/journals/10.1049/el.2014.0448
Loading

Related content

content/journals/10.1049/el.2014.0448
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address