Voice activity detection using a periodicity measure

Voice activity detection using a periodicity measure

For access to this article, please select a purchase option:

Buy article PDF
(plus tax if applicable)
Buy Knowledge Pack
10 articles for $120.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Your details
Why are you recommending this title?
Select reason:
IEE Proceedings I (Communications, Speech and Vision) — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

The paper describes a voice activity detector (VAD) that can operate reliably in SNRs down to 0 dB and detect most speech at −5 dB. The detector applies a least-squares periodicity estimator to the input signal, and triggers when a significant amount of periodicity is found. It does not aim to find the exact talkspurt boundaries and, consequently, is most suited to speech-logging applications where it is easy to include a small margin to allow for any missed speech. The paper discusses the problem of false triggering on nonspeech periodic signals and shows how robustness to these signals can be achieved with suitable preprocessing and postprocessing.


    1. 1)
      • K. Bullington , J.M. Fraser . Engineering aspects of TASI. Bell Syst. Tech. J. , 353 - 364
    2. 2)
      • Kobatake, H., Tawa, K., Ishida, A.: `Speech/nonspeech discrimination for speech recognition system under real life noise environments', S7.5, ICASSP '89, 1, p. 365–368.
    3. 3)
      • Freeman, D.K., Cosier, G., Southcott, C.B., Boyd, I.: `The voice activity detector for the Pan-European digital cellular mobile telephone service', S7.6, ICASSP '89, 1, p. 369–372.
    4. 4)
      • Irwin, M.J.: `Periodicity estimation in the presence of noise', Inst. Acoust. Conf. '79, 1980, Windemere United Kingdom, JSRU Report 1009.
    5. 5)
      • D.H. Friedmann . Pseudo-maximum-likelihood speech pitch extraction. IEEE Trans. , 3 , 213 - 221
    6. 6)
      • M.J. Irwin , S.J. Harborne , A.W. Allan . (1981) Automatic speech-signal quality estimation, JSRU Technical Memorandum T6/1/81.

Related content

This is a required field
Please enter a valid email address