Objective quality evaluation of speech band-limited signals
Main Article Content
Abstract
Dependence of objective quality evaluation of speech band-limited signals is experimentally obtained. As part of this task, a comparison of the considered indicators of the speech quality had been made. It is shown that computationally simple indicators, such as segmental SNR (SSNR) and log-spectral distortion (LSD), may not adequately respond to changes in bandwidth. More complex computationally perceptual indicators, such as bark spectral distortion (BSD) and perceptual evaluation of speech quality (PESQ), behave much more correct and, in the end, clarify the real needs of the human auditory system to speech perception.
Reference 14, figures 5.
Article Details
![Creative Commons License](http://i.creativecommons.org/l/by/4.0/88x31.png)
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
References
Beerends J., Wijngaarden S., Buuren R. Extension of ITU-T Recommendation P.862 PESQ towards Measuring Speech Intelligibility with Vocoders. New Directions for Improving Audio Effectiveness // Meeting Proceedings RTO-MP-HFM-123, Paper 10, P.10-1–10-6. Neuilly-sur-Seine, France: RTO. [Online]. Available: http://www.rto.nato.int/abstracts.aps
Blauert J., ed. Communication acoustics. – Springer-Verlag Berlin Heidelberg, 2005. – 385 p.
Cote N. Integral and diagnostic intrusive prediction of speech - Springer-Verlag Berlin Heidelberg, 2011. – 267 p.
Ellis D. PLP and RASTA in Matlab // [Online]. Available: http://www.ee.columbia.edu/~dpwe/resources/matlab/rastamat/
Hermansky H. Perceptual Linear Prediction (PLP) analysis of speech // J. Acoust. Soc. America. – 1990. – Vol. 87. – P. 1738-1753.
Loizou P. Matlab Software. PESQ and other objective measures for evaluating quality of speech // [Online]. Available: http://ecs.utdallas.edu/loizou/speech/software.htm
Moller S. Quality of Telephone-Based Spoken Dialogue Systems – Springer Science + Business Me-dia, Inc., 2005. – 490 p.
Naylor P., Gaubitch N. Speech Dereverberation. – Springer, 2010. – 399 p.
Next-Generation (3G/4G) Voice Quality Testing with POLQA®. White Paper. – Rohde & Schwarz, 2012. – 22 p.
Perceptual Evaluation of Speech Quality (PESQ) ITU-T Recommendations P.862, P.862.1, P.862.2. Version 2.0 – October 2005.
Perceptual Objective Listening Quality Assessment (POLQA) ITU-T Recommendations P.863 – Janu-ary 2011.
Prodeus A. PESQ Matlab Driver // MathWorks, 2014. [Online]. Available: http://www.mathworks.com/matlabcentral/fileexchange/47333-pesq-matlab-driver
Raake A. Speech Quality of VoIP. Assessment and Prediction. – John Wiley, 2006. - 338 p.
Recommendation P.862. Amendment 2 (11/05), 2011. [Online]. Available: http://www.itu.int/rec/T-REC-P.862-200511-I!Amd2/en