Binaural Capability of Locating Sound Sources of Information Signals

Main Article Content

Mariia Volodymyrivna Vdovenko
https://orcid.org/0000-0003-1666-3389
Svetlana Andriivna Luniova
https://orcid.org/0000-0003-0683-1211

Abstract

The purpose of the work done was to estimate the human binaural capability of locating stereo sound sources of information signals, especially speech, choir singing, and symphonic music as compared to pulse signals. The measurements were conducted in an average-sized hall where sound was emitted by a stereo system consisting of two loudspeakers with the base width of 4.5 m.

Based on the results of analysis of existing methods of binaural sound perception modelling, the estimation was performed based on correlational processing of signals recorded using a head dummy placed in various points of the room.

For the recording points selected, the maximum time delays between signals arriving at a listener’s left and right ears were calculated, and the interaural cross-correlation functions were obtained. The general functions, especially the peak shift, correlation interval, and peak sharpness (including the conditions when it splits into two individual ones), were analysed. The correlation factor values and levels were calculated. As a result, the conclusions regarding the capability of locating an imaginary sound source were made based on the cross-correlation function factors values, which simplified the application of this method in practice.

Based on the results of experiments conducted and on the subjective sensations of perception, the authors have come to the conclusion that a stereophony area has sizes much wider than those assumed earlier from the signal time delay of 1 ms at a receiving stereo pair (representing a shift of an imaginary source towards the ear perceiving the signal earlier).

At the same time, it was found that signals with a speech component are much harder to locate than pulse signals. While music signals, especially symphonic music, are close to pulse signals in terms of human locating capabilities.

The found patterns have allowed us to introduce adjustments to the stereophony area calculations. Based on the research results, we suggest defining the stereophony zone border by the correlation factor value of 0.5. Given that, the interaural cross-correlation function properties and subjective perception provide for acceptable speech legibility and music transparency. The key conclusion is that this area is quite narrow for speech signals and is actually limited to the time delay of signals arriving at left and right ears – around 1 ms. Music signals have a wider stereophony area defined by the time delay between perception binaural pair components of around 10 ms. Therefore, sizes of a stereophonic sound area ought to be defined with regard to an information signal’s type.

Ref. 14, fig. 3, tabl. 3.

Article Details

How to Cite
[1]
M. V. Vdovenko and S. A. Luniova, “Binaural Capability of Locating Sound Sources of Information Signals”, Мікросист., Електрон. та Акуст., vol. 23, no. 6, pp. 58–65, Dec. 2018.
Section
Acoustical devices and systems

References

L. A. Jeffress, “A place theory of sound localization.,” J. Comp. Physiol. Psychol., vol. 41, no. 1, pp. 35–39, 1948, DOI: 10.1037/h0061495.

A. Kohlrausch, J. Braasch, D. Kolossa, and J. Blauert, “An Introduction to Binaural Processing,” in The Technology of Binaural Listening, J. Blauert, Ed. Springer-Verlag Berlin Heidelberg, 2013, pp. 1–32.

T. May, S. van de Par, and A. Kohlrausch, “Binaural Localization and Detection of Speakers in Complex Acoustic Scenes,” in The Technology of Binaural Listening, J. Blauert, Ed. Springer-Verlag Berlin Heidelberg, 2013, pp. 397–425.

E. Vincent, R. Gribonval, and C. Fevotte, “Performance measurement in blind audio source separation,” IEEE Trans. Audio, Speech Lang. Process., vol. 14, no. 4, pp. 1462–1469, 2006, DOI: 10.1109/TSA.2005.858005.

M. L. Jepsen, S. D. Ewert, and T. Dau, “A computational model of human auditory signal processing and perception,” J. Acoust. Soc. Am., vol. 124, no. 1, pp. 422–438, 2008, DOI: 10.1121/1.2924135.

E. C. Cherry and B. M. A. Sayers, “‘Human “Cross-Correlator”’—A Technique for Measuring Certain Parameters of Speech Perception,” J. Acoust. Soc. Am., vol. 28, no. 5, pp. 889–895, 1956, DOI: 10.1121/1.1908506.

V.V. Furduyev Akusticheskiye osnovy veshchaniya. Moscow, USSR: Gossudarstvennoye izdatel'stvo literatury po voprosam svyazi i radio, 1980, p.318.

J. Blauert and W. Cobben, “Some Consideration of Binaural Cross Correlation Analysis,” Acta Acust. united with Acust., vol. 39, no. 2, pp. 96–104, 1978.

R. M. Stern and H. S. Colburn, “Theory of binaural interaction based on auditory‐nerve data. IV. A model for subjective lateral position,” J. Acoust. Soc. Am., vol. 64, no. 1, pp. 127–140, 1978, DOI: 10.1121/1.381978.

H. Haas, “The Influence of a Single Echo on the Audibility of Speech,” J. Audio Eng. Soc., vol. 20, no. 2, pp. 146–159, 1972.

Y. A. Kovalgin, Stereofonia [Stereophony]. Moscow, USSR: Radio i svyaz, 1989.

Y. S. Vakhitov, Teoreticheskiye osnovy elektroakustiki i elektroakusticheskaya apparatura [Theoretical fundamentals of electroacoustics and electroacoustic equipment]. Moscow, USSR: Iskusstvo, 1982.

R. Otnes and L. Enokson, Prikladnoy analiz vremennykh ryadov [Applied time series analysis]. Moscow, USSR, 1982.

J. Blauert, Prostranstvennyy slukh [Spatial hearing]. Moscow, USSR: Enrgiya, 1979.