Subjective Evaluation of the Speech Intelligibility on the Background of Noise and Reverberation
Main Article Content
Abstract
In this paper, the results of speech intelligibility subjective assessment of Ukrainian speech monosyllabic sound combinations against background noise and reverberation through articulation tests are presented. The evaluation was carried out with the help of specially developed software that allowed automating and thus greatly facilitating and accelerating the procedure of articulation tests. Special text and sound tables of the Ukrainian speech monosyllables of the consonant-vowel-consonant (CVC) type were developed for the tests. The recordings of pure signals were made in the muffled room of the Acoustics and Acoustoelectronics Department of the National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute". The recording was performed at a frequency of 22050 Hz and a bit depth of 16 bits. Speech monosyllables were read using a verbal environment. Listening was done for four situations: pure language; speech distorted by noise; speech distorted by reverberation; speech distorted by the combined effect of noise and reverberation. In the first case, speech monosyllables of 3 articulation tables were listened, each of which contained 50 monosyllables. In the second case, speech distorted by the additive noise with the signal-to-noise ratios minus 10 dB, 0 dB and plus 10 dB was listened. In this case, models of white, pink and brown noises were used, the masking property of which is rather well-studied. In the third case, the reverberant speech for reverberation times from 0.3 to 2.7 s was modeled by convolution of pure speech signals with impulse characteristics of various rooms, and in the fourth case the joint action of pink noise and reverberation was considered. It turned out that the masking ability of white noise exceeds one for brown noise for signal-to-noise ratios (SNR) less than minus 5 dB, which is not entirely consistent with previous preliminary predictive estimates. Signals listening had been made by two ways, namely through the headphones and by means of acoustic monitors. The SNR varied in the range ‑10…+10 dB, and the reverberation time varied in the range 0.3…2.7 s. In addition, it turned out that listening to speech distorted by noise through acoustic monitors could lead to a significant increase in the intelligibility of speech, compared to the case of listening through headphones. With a signal-to-noise ratio of minus 10 dB, the values of speech intelligibility scores increased from 0.1-0.3 to 0.85-0.93. Similar results were obtained for the reverberation action: the speech intelligibility increased from 0.65 to 0.94 for the reverberation time of 2.7 s. This speech intelligibility growth can be explained, firstly, by the action of early reflections. Secondly, usage of two loudspeakers as sound sources and also binaural listening can be also considered as reasons of speech intelligibility increase. Nevertheless, these reasons for increasing the intelligibility of speech are hardly the only ones. Therefore, it is necessary to further study the causes of the revealed significant increase in speech intelligibility. Among such reasons can be both features of the developed automated system of articulation tests, and features of the psychophysical state of listeners.
Ref. 13, fig. 5.
Article Details

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
References
V. S. Didkovskyi, M. V. Didkovskaya, and A. N. Prodeus, Akusticheskaja ekspertiza kanalov rechevoj kommunikacii. Monografija [Acoustic examination of the channels of speech communication. Monograph], Kyiv: Imex-LTD, 2008. ISBN 978-966-8861-85-7
A. N. Prodeus, V. S. Didkovskyi, and M. V. Didkovskaya, Akusticheskaya ekspertiza i korrekciya kommunikacionnyh kanalov. [Acoustic examination and correction of communication channels. Monograph], Saarbrucken: LAP LAMBERT Academic Publishing, OmniScriptum GmbH & Co. KG, 2017. ISBN: 978-3-330-04591-0
O. Arkhypova, V. Zhuravlov, and V. Kumeiko, "Artykuliatsiini tablytsi sliv ukrainskoi movy [Articulation tables of words of the Ukrainian language]," vol. 19, no. 2, pp. 13-17, 2009. URL: http://ela.kpi.ua/handle/123456789/9689
A. E. Arhipov, and E. A. Arhipova, "Analiz i obrabotka dannyh artikulyacionnyh ispytanij [Analysis and processing of articulatory test data]," Scientific and Practical Journal "Information Protection", no. 4, pp. 34-42, 2012. URL: http://aprodeus.narod.ru/Teaching/ZAI/Art_2012_Arhipova_Analis_artikul_tabl.pdf
V. G. Mihajlov, "Diagnosticheskie artikulyacionnye tablicy [Diagnostic articulation tables]," Acoustic Journal, vol. 48, no. 5, pp. 705-712, 2002. URL: http://www.akzh.ru/pdf/2002_5_705-712.pdf
"ANG-2200. Acoustic Noise Generator. Owner’s Guide," Research Electronics, Intl. 455 Security Place Algood, TN 38506, USA. [Online]. Available: https://reiusa.net/wp-content/uploads/2017/11/ANG_Manual_revG.pdf
A. M. Prodeus, A. V. Vityk, and D. Yu. Didenko, "Sub’iektyvne otsiniuvannia yakosti ta rozbirlyvosti movlennievykh syhnaliv, spotvorenykh syntezovanymy shumamy [Subjective evaluation of quality and intelligibility of speech distorted by synthesized noise]," Microsystems, Electronics and Acoustics, vol. 22, no. 6, pp. 45-63, 2017. DOI: 10.20535/2523-4455.2017.22.6.101929
A. Prodeus, K. Bukhta, P. Morozko, O. Serhiienko, I. Kotvytskyi, and I. Shherbenko, "14th International Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering (TCSET)," in Automated System for Subjective Evaluation of the Ukrainian Speech Intelligibility, Lviv-Slavske, Ukraine, February 20-24, 2018 (pre-print).
J. Bradley, H. Sato, M. Picard, "On the importance of early reflections for speech in rooms," Journal of Acoustical Society of America, vol. 113, no. 6, pp. 3233-3244, June 2003. DOI: 10.1121/1.1570439
J. S. Bradley, "Speech intelligibility studies in classrooms," Journal of the Acoustical Society of America, vol. 80, no. 3, pp. 846-854, 1986. DOI: 10.1121/1.393908
W. Yang and J. S. Bradley, "Effects of room acoustics on the intelligibility of speech in classrooms for young children," Journal of the Acoustical Society of America, vol. 125, no. 2, pp. 922-933, 2009. DOI: 10.1121/1.3058900
I. ArweIler, J. Buchholz, and T. Dau, "Proceedings of 2nd International Symposium on Auditory and Audiological Research (ISAAR 2009)," in Speech intelligibility enhancement by early reflections, Elsinore, Denmark, August 2009. URL: http://orbit.dtu.dk/en/publications/speech-intelligibility-enhancement-by-early-reflections(368caba2-dca2-461f-bc45-d8615f1dd675).html
Yu. K. Makarov, and A. A. Horev, "K ocenke effektivnosti zashity akusticheskoj (rechevoj) informacii [To assess the effectiveness of protection of acoustic (speech) information]," Information Security, no. 4, 2005. URL: http://www.vrsystems.ru/stati/k_ocenke_effektivnosti_zashiti_akusticheskoi_(rechevoi)_informacii.htm