Language Model Score Regularization for Speech Recognition

View Fulltext

Author(s): Yike Zhang^{1, 2} ; Pengyuan Zhang^{1, 2} ; Yonghong Yan^{1, 2, 3}
- Affiliations: 1: Institute of Acoustics, Chinese Academy of Sciences , Beijing 100190 , China ;
  2: University of Chinese Academy of Sciences , Beijing 100049 , China ;
  3: Xinjiang Technical Institute of Physics and Chemistry , Chinese Academy of Sciences, Urumchi 830011 , China
Source: Volume 28, Issue 3, May 2019, p. 604 – 609
DOI: 10.1049/cje.2019.03.015 , Print ISSN 1022-4653, Online ISSN 2075-5597

Inspired by the fact that back-off and interpolated smoothing algorithms have significant effect on statistical language modeling, this paper proposes a sentence-level Language model (LM) score regularization algorithm to improve the fault-tolerance of LMs for recognition errors. The proposed algorithm is applicable to both count-based LMs and neural network LMs. Instead of predicting the occurrence of a sequence of words under a fixed order Markov assumption, we use a composite model consisting of different order models with either n-gram or skip-gram features to estimate the probability of the sequence of words. In order to simplify implementations, we derive a connection between bidirectional neural networks and the proposed algorithm. Experiments were carried out on the Switchboard corpus. Results on N-best lists re-scoring show that the proposed algorithm achieves consistent word error rate reduction when it is applied to count-based LMs, Feedforward neural network (FNN) LMs, and Recurrent neural network (RNN) LMs.

Language Model Score Regularization for Speech Recognition

Related content