Assessment of the impact of system phase response non-linearity on the speech signals quality

Main Article Content

Аркадій Миколайович Продеус
К. П. Пилипенко
Александр Яковлевич Калюжный
С. Г. Бартенев

Abstract

It is shown that phase distortion of speech signals are acceptable for human auditory system when the maximum difference of group delay times in the high and low frequencies is below 50 ms - the interference between adjacent vowels and consonants is not perceived with such a difference of group delay. There were founded values of objective measures of speech quality in the form of a segmental signal-to-noise ratio (SSNR), the log-spectral distortion (LSD), bark spectral distortion (BSD) and perceptual evaluation of speech quality (PESQ), according to the detected threshold value of 50 ms. Bibl. 7, Fig. 6, Tab. 1.

Article Details

How to Cite
Продеус, А. М., Пилипенко, К. П., Калюжный, А. Я., & Бартенев, С. Г. (2015). Assessment of the impact of system phase response non-linearity on the speech signals quality. Electronics and Communications, 20(2). https://doi.org/10.20535/2312-1807.2015.20.2.47733
Section
Theory of signals and systems

References

Edited by Martin R., Heute U. and Antweiler C. (2008), Advances in Digital Speech Transmission. John Wiley & Sons Ltd, England, P. 572.

Blauert J. (1978), Group delay distortions in electroacoustical systems. J. Acoust. Soc. Am. Vol.63, No.5. Pp. 1478-1483.

Habets E.A.P. (2007), Single- and Multi-Microphone Speech Dereverberation using Spectral Enhancement. PhD dissertation, Eindhoven, P. 257.

Perceptual Evaluation of Speech Quality (PESQ) ITU-T Recommendations P.862, P.862.1, P.862.2. Version 2.0. October 2005.

Didovskiy V.S., Didovskaia M.V., Prodeus A.N. (2008), “Acoustic assessment of speech communication channels. Monograph,” K.: Imex-Ltd, P. 420. (Rus)

Oppenheim A., Schafer R. (2006), “Digital signal processing,” M.: Techospera, P. 858. (Rus)

Smirnova N.S., Chistikov P.G. (2011), “Phonetic analysis program in statistics in Russian texts and its use for applications in the field of speech technology,” Proc. XXVII Intern. Conf. «Dialog», M., Pp. 632-644 (Rus)