DOI: https://doi.org/10.20535/2523-4455.mea.209928
Зображення обкладинки

Апаратно-програмний комплекс «Штучна голова». Частина 2. Оцінювання розбірливості мови в аудиторіях

Olexandr Oleksandrovych Dvornyk, Daria Ievheniivna Motorniuk, Maryna Vitaliivna Didkovska, Arkadii Mykolaiovych Prodeus

Анотація


Виконано експериментальні дослідження можливості використання розробленого апаратно-програмного комплексу «Штучна голова» для двоканального оцінювання розбірливості мови, спотвореної реверберацією. На першому етапі такого оцінювання здійснюють запис відгуку приміщення на тестовий сигнал у вигляді mls-послідовності. На другому етапі оцінюють імпульсну характеристику приміщення шляхом обчислення взаємно-кореляційної функції відгуку із тестовим сигналом. На третьому етапі розраховують модуляційні коефіцієнти за формулою Шредера, використовуючи оцінку імпульсної характеристики приміщення. На четвертому, останньому етапі, оцінюють розбірливість мови модуляційним або формантно-модуляційним методом. Результати проведених досліджень узгоджуються із результатами попередніх досліджень, де показано, що розбірливість мови в середині приміщення є меншою за таку біля стінки приміщення.

Ключові слова


розбірливість мови; реверберація; тестовий сигнал; імпульсна характеристика; модуляційний метод; формантно-модуляційний метод

Повний текст:

PDF

Перелік посилань для Cited-By Linking


O. O. Dvornyk, D. I. Motorniuk, M. V. Didkovska, and A. M. Prodeus, “Artificial Software Complex ‘Artificial Head’. Part 1 Adjusting the Frequency Response of the Path,” Microsystems, Electron. Acoust., vol. 25, no. 1, pp. 56–64, Jul. 2020, DOI: 10.20535/2523-4455.mea.198431.

J. Blauert, Ed., The Technology of Binaural Listening. Berlin, Heidelberg: Springer Berlin Heidelberg, 2013, ISBN: 978-3-642-37761-7.

“HEAD acoustics - Telecom Division - Binaural Recording Systems - Artificial Head Measurement System HMS II.3 - Overview.” [Online]. Available: https://www.head-acoustics.com/eng/telecom_hms_II_3.htm. [Accessed: 23-Mar-2020].

P. L. SØndergaard and P. Majdak, “The auditory modeling toolbox,” in The Technology of Binaural Listening, Springer Berlin Heidelberg, 2013, pp. 33–56.

M. Jeub, M. Schafer, and P. Vary, “A binaural room impulse response database for the evaluation of dereverberation algorithms,” in 2009 16th International Conference on Digital Signal Processing, 2009, pp. 1–5, DOI: 10.1109/ICDSP.2009.5201259.

“IKS: Aachen Impulse Response Database.” [Online]. Available: https://www.iks.rwth-aachen.de/en/research/tools-downloads/databases/aachen-impulse-response-database/. [Accessed: 23-Mar-2020].

Y. S. Kostyuchok, L. S. Martynovych, D. Y. Motorniuk, V. O. Nechytailo, A. V. Hrapachevskiy, and A. M. Prodeus, “Acoustic certification of classrooms,” Electron. Commun., vol. 21, no. 2, pp. 63–70, Nov. 2016, DOI: 10.20535/2312-1807.2016.21.2.82217.

“SOUND FORGE Audio Studio 14 – Домашняя студия редактирования звука.” [Online]. Available: https://www.magix.com/ru/muzyka/sound-forge/sound-forge-audio-studio/. [Accessed: 23-Mar-2020].

A. Prodeus, V. Didkovskyi, M. Didkovska, I. Kotvytskyi, and D. Motorniuk, “Automated Subjective Assessment of Speech Intelligibility Under Diotic and Dichotic Listening,” in Data-Centric Business and Applications, Springer, Cham, 2020, pp. 1–16, URL: http://link.springer.com/10.1007/978-3-030-43070-2_1. DOI: 10.1007/978-3-030-43070-2_1

E. A. P. Habets, N. D. Gaubitch, and P. A. Naylor, “Temporal selective dereverberation of noisy speech using one microphone,” in 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, pp. 4577–4580, DOI: 10.1109/ICASSP.2008.4518675.

G. A. Soulodre, N. Popplewell, and J. S. Bradley, “Combined effects of early reflections and background noise on speech intelligibility,” J. Sound Vib., vol. 135, no. 1, pp. 123–133, Nov. 1989, DOI: 10.1016/0022-460X(89)90759-1.

J. S. Bradley, H. Sato, and M. Picard, “On the importance of early reflections for speech in rooms,” J. Acoust. Soc. Am., vol. 113, no. 6, p. 3233, 2003, DOI: 10.1121/1.1570439.

I. Arweiler, J. Buchholz, and T. Dau, “Speech intelligibility enhancement by early reflections | Proceedings of the International Symposium on Auditory and Audiological Research,” in Speech processing and perception under adverse conditions, 2009, pp. 289–298, URL: https://proceedings.isaar.eu/index.php/isaarproc/article/view/2009-29.

H. Sato and J. S. Bradley, “Evaluation of acoustical conditions for speech communication in working elementary school classrooms,” J. Acoust. Soc. Am., vol. 123, no. 4, pp. 2064–2077, 2008, DOI: 10.1121/1.2839283.

V. S. Didkovskyi, M. V. Didkovskaya, and A. M. Prodeus, Komp’yuterna obrobka akustychnykh syhnaliv. Navchalʹnyy posibnyk. [Computer processing of acoustic signals. Tutorial.]. Kyiv: Imecs-LTD, 2010.

A. N. Tikhonov, “O nekorrektnykh zadachakh lineynoy algebry i ustoychivom metode ikh resheniya [On ill-posed problems of linear algebra and a stable method for solving them],” DAN USSR, vol. 163, no. 3, pp. 591–594, 1965.

H. J. M. Steeneken and T. Houtgast, “A physical method for measuring speech‐transmission quality,” J. Acoust. Soc. Am., vol. 67, no. 1, pp. 318–326, 1980, DOI: 10.1121/1.384464.

H. J. . Steeneken and T. Houtgast, “Validation of the revised STIr method,” Speech Commun., vol. 38, no. 3–4, pp. 413–425, Nov. 2002, DOI: 10.1016/S0167-6393(02)00010-9.

A. M. Prodeus, L. B. Dronzhevskaya, V. A. Klimkov, and D. A. Shagitova, “Formantnyy i formantno-modulyatsionnyy metody otsenki razborchivosti rechi. Chast’ 1. Unifikatsiya algoritmov,” Electron. Commun., vol. 15, no. 6 (part 2), pp. 117–124, 2010.

A. N. Prodeus, L. B. Dronzhevskaya, V. A. Klimkov, and D. A. Shagitova, “Formantnyy i formantno-modulyatsionnyy metody otsenki razborchivosti rechi. Chast’ 2. Tochnost’ i skorost’ izmereniy [Formant and formant-modulation methods for assessing speech intelligibility. Part 2. Accuracy and speed of measurements.],” Electron. Commun., vol. 16, no. 6, pp. 16–24, 2011.

M. R. Schroeder, “Modulation Transfer Functions: Definition and Measurement,” Acta Acust. united with Acust., vol. 49, no. 3, pp. 79-182(4), 1981, URL: https://www.ingentaconnect.com/content/dav/aaua/1981/00000049/00000003/art00004?crawler=true&mimetype=application/pdf.

N. B. Pokrovskiy, Raschet i izmerenie razborchivosti rechi [Prediction and measurement of speech intelligibility]. Moskow, USSR: Svyazizdat, 1962.

M. A. Sapozhkov, Rechevoy signal v kibernetike i svyazi [Speech signal in cybernetics and communication]. Moscow: Svyazizdat, 1963.

A. M. Prodeus, V. S. Didkovskyi, and M. V. Didkovskaya, Akusticheskaya ekspertiza i korrektsiya kommunikatsionnykh kanalov [Acoustic examination and correction of communication channels]. LAP LAMBERT Academic Publishing, 2017, ISBN: 978-3-330-04591-0.

V. E. Gantmakher, N. E. Bystrov, and D. V. Chebotarev, Shumopodobnyye signaly. Analiz, sintez, obrabotka [Noise-like signals. Analysis, synthesis, processing]. St. Peterburg: Nauka i tekhnika, 2005.

M. R. Schroeder, “New Method of Measuring Reverberation Time,” J. Acoust. Soc. Am., vol. 37, no. 6, pp. 1187–1188, Jun. 1965, DOI: 10.1121/1.1939454.

U. Zölzer, Ed., DAFX: Digital Acoustic Effects, 2nd ed. John Wiley & Sons, 2011, ISBN: 978-0-470-66599-2.


Перелік посилань


  1. O. O. Dvornyk, D. I. Motorniuk, M. V. Didkovska, and A. M. Prodeus, “Artificial Software Complex ‘Artificial Head’. Part 1 Adjusting the Frequency Response of the Path,” Microsystems, Electron. Acoust., vol. 25, no. 1, pp. 56–64, Jul. 2020, DOI: 10.20535/2523-4455.mea.198431.
  2. J. Blauert, Ed., The Technology of Binaural Listening. Berlin, Heidelberg: Springer Berlin Heidelberg, 2013, ISBN: 978-3-642-37761-7.
  3. “HEAD acoustics - Telecom Division - Binaural Recording Systems - Artificial Head Measurement System HMS II.3 - Overview.” [Online]. Available: https://www.head-acoustics.com/eng/telecom_hms_II_3.htm. [Accessed: 23-Mar-2020].
  4. P. L. SØndergaard and P. Majdak, “The auditory modeling toolbox,” in The Technology of Binaural Listening, Springer Berlin Heidelberg, 2013, pp. 33–56.
  5. M. Jeub, M. Schafer, and P. Vary, “A binaural room impulse response database for the evaluation of dereverberation algorithms,” in 2009 16th International Conference on Digital Signal Processing, 2009, pp. 1–5, DOI: 10.1109/ICDSP.2009.5201259.
  6. “IKS: Aachen Impulse Response Database.” [Online]. Available: https://www.iks.rwth-aachen.de/en/research/tools-downloads/databases/aachen-impulse-response-database/. [Accessed: 23-Mar-2020].
  7. Y. S. Kostyuchok, L. S. Martynovych, D. Y. Motorniuk, V. O. Nechytailo, A. V. Hrapachevskiy, and A. M. Prodeus, “Acoustic certification of classrooms,” Electron. Commun., vol. 21, no. 2, pp. 63–70, Nov. 2016, DOI: 10.20535/2312-1807.2016.21.2.82217.
  8. “SOUND FORGE Audio Studio 14 – Домашняя студия редактирования звука.” [Online]. Available: https://www.magix.com/ru/muzyka/sound-forge/sound-forge-audio-studio/. [Accessed: 23-Mar-2020].
  9. A. Prodeus, V. Didkovskyi, M. Didkovska, I. Kotvytskyi, and D. Motorniuk, “Automated Subjective Assessment of Speech Intelligibility Under Diotic and Dichotic Listening,” in Data-Centric Business and Applications, Springer, Cham, 2020, pp. 1–16, URL: http://link.springer.com/10.1007/978-3-030-43070-2_1. DOI: 10.1007/978-3-030-43070-2_1
  10. E. A. P. Habets, N. D. Gaubitch, and P. A. Naylor, “Temporal selective dereverberation of noisy speech using one microphone,” in 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, pp. 4577–4580, DOI: 10.1109/ICASSP.2008.4518675.
  11. G. A. Soulodre, N. Popplewell, and J. S. Bradley, “Combined effects of early reflections and background noise on speech intelligibility,” J. Sound Vib., vol. 135, no. 1, pp. 123–133, Nov. 1989, DOI: 10.1016/0022-460X(89)90759-1.
  12. J. S. Bradley, H. Sato, and M. Picard, “On the importance of early reflections for speech in rooms,” J. Acoust. Soc. Am., vol. 113, no. 6, p. 3233, 2003, DOI: 10.1121/1.1570439.
  13. I. Arweiler, J. Buchholz, and T. Dau, “Speech intelligibility enhancement by early reflections | Proceedings of the International Symposium on Auditory and Audiological Research,” in Speech processing and perception under adverse conditions, 2009, pp. 289–298, URL: https://proceedings.isaar.eu/index.php/isaarproc/article/view/2009-29.
  14. H. Sato and J. S. Bradley, “Evaluation of acoustical conditions for speech communication in working elementary school classrooms,” J. Acoust. Soc. Am., vol. 123, no. 4, pp. 2064–2077, 2008, DOI: 10.1121/1.2839283.
  15. V. S. Didkovskyi, M. V. Didkovskaya, and A. M. Prodeus, Komp’yuterna obrobka akustychnykh syhnaliv. Navchalʹnyy posibnyk. [Computer processing of acoustic signals. Tutorial.]. Kyiv: Imecs-LTD, 2010.
  16. A. N. Tikhonov, “O nekorrektnykh zadachakh lineynoy algebry i ustoychivom metode ikh resheniya [On ill-posed problems of linear algebra and a stable method for solving them],” DAN USSR, vol. 163, no. 3, pp. 591–594, 1965.
  17. H. J. M. Steeneken and T. Houtgast, “A physical method for measuring speech‐transmission quality,” J. Acoust. Soc. Am., vol. 67, no. 1, pp. 318–326, 1980, DOI: 10.1121/1.384464.
  18. H. J. . Steeneken and T. Houtgast, “Validation of the revised STIr method,” Speech Commun., vol. 38, no. 3–4, pp. 413–425, Nov. 2002, DOI: 10.1016/S0167-6393(02)00010-9.
  19. A. M. Prodeus, L. B. Dronzhevskaya, V. A. Klimkov, and D. A. Shagitova, “Formantnyy i formantno-modulyatsionnyy metody otsenki razborchivosti rechi. Chast’ 1. Unifikatsiya algoritmov,” Electron. Commun., vol. 15, no. 6 (part 2), pp. 117–124, 2010.
  20. A. N. Prodeus, L. B. Dronzhevskaya, V. A. Klimkov, and D. A. Shagitova, “Formantnyy i formantno-modulyatsionnyy metody otsenki razborchivosti rechi. Chast’ 2. Tochnost’ i skorost’ izmereniy [Formant and formant-modulation methods for assessing speech intelligibility. Part 2. Accuracy and speed of measurements.],” Electron. Commun., vol. 16, no. 6, pp. 16–24, 2011.
  21. M. R. Schroeder, “Modulation Transfer Functions: Definition and Measurement,” Acta Acust. united with Acust., vol. 49, no. 3, pp. 79-182(4), 1981, URL: https://www.ingentaconnect.com/content/dav/aaua/1981/00000049/00000003/art00004?crawler=true&mimetype=application/pdf.
  22. N. B. Pokrovskiy, Raschet i izmerenie razborchivosti rechi [Prediction and measurement of speech intelligibility]. Moskow, USSR: Svyazizdat, 1962.
  23. M. A. Sapozhkov, Rechevoy signal v kibernetike i svyazi [Speech signal in cybernetics and communication]. Moscow: Svyazizdat, 1963.
  24. A. M. Prodeus, V. S. Didkovskyi, and M. V. Didkovskaya, Akusticheskaya ekspertiza i korrektsiya kommunikatsionnykh kanalov [Acoustic examination and correction of communication channels]. LAP LAMBERT Academic Publishing, 2017, ISBN: 978-3-330-04591-0.
  25. V. E. Gantmakher, N. E. Bystrov, and D. V. Chebotarev, Shumopodobnyye signaly. Analiz, sintez, obrabotka [Noise-like signals. Analysis, synthesis, processing]. St. Peterburg: Nauka i tekhnika, 2005.
  26. M. R. Schroeder, “New Method of Measuring Reverberation Time,” J. Acoust. Soc. Am., vol. 37, no. 6, pp. 1187–1188, Jun. 1965, DOI: 10.1121/1.1939454.
  27. U. Zölzer, Ed., DAFX: Digital Acoustic Effects, 2nd ed. John Wiley & Sons, 2011, ISBN: 978-0-470-66599-2.






Copyright (c) 2020 Дворник О. О., Моторнюк Д. Є., Дідковська М. В., Продеус А. М.

Creative Commons License
Ця робота ліцензована Creative Commons Attribution 4.0 International License.

ISSN: 2523-4447
e-ISSN: 2523-4455