Binaural Capability of Locating Sound Sources of Information Signals
Main Article Content
Abstract
The purpose of the work done was to estimate the human binaural capability of locating stereo sound sources of information signals, especially speech, choir singing, and symphonic music as compared to pulse signals. The measurements were conducted in an average-sized hall where sound was emitted by a stereo system consisting of two loudspeakers with the base width of 4.5 m.
Based on the results of analysis of existing methods of binaural sound perception modelling, the estimation was performed based on correlational processing of signals recorded using a head dummy placed in various points of the room.
For the recording points selected, the maximum time delays between signals arriving at a listener’s left and right ears were calculated, and the interaural cross-correlation functions were obtained. The general functions, especially the peak shift, correlation interval, and peak sharpness (including the conditions when it splits into two individual ones), were analysed. The correlation factor values and levels were calculated. As a result, the conclusions regarding the capability of locating an imaginary sound source were made based on the cross-correlation function factors values, which simplified the application of this method in practice.
Based on the results of experiments conducted and on the subjective sensations of perception, the authors have come to the conclusion that a stereophony area has sizes much wider than those assumed earlier from the signal time delay of 1 ms at a receiving stereo pair (representing a shift of an imaginary source towards the ear perceiving the signal earlier).
At the same time, it was found that signals with a speech component are much harder to locate than pulse signals. While music signals, especially symphonic music, are close to pulse signals in terms of human locating capabilities.
The found patterns have allowed us to introduce adjustments to the stereophony area calculations. Based on the research results, we suggest defining the stereophony zone border by the correlation factor value of 0.5. Given that, the interaural cross-correlation function properties and subjective perception provide for acceptable speech legibility and music transparency. The key conclusion is that this area is quite narrow for speech signals and is actually limited to the time delay of signals arriving at left and right ears – around 1 ms. Music signals have a wider stereophony area defined by the time delay between perception binaural pair components of around 10 ms. Therefore, sizes of a stereophonic sound area ought to be defined with regard to an information signal’s type.
Ref. 14, fig. 3, tabl. 3.
Article Details

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
References
L. A. Jeffress, “A place theory of sound localization.,” J. Comp. Physiol. Psychol., vol. 41, no. 1, pp. 35–39, 1948, DOI: 10.1037/h0061495.
A. Kohlrausch, J. Braasch, D. Kolossa, and J. Blauert, “An Introduction to Binaural Processing,” in The Technology of Binaural Listening, J. Blauert, Ed. Springer-Verlag Berlin Heidelberg, 2013, pp. 1–32.
T. May, S. van de Par, and A. Kohlrausch, “Binaural Localization and Detection of Speakers in Complex Acoustic Scenes,” in The Technology of Binaural Listening, J. Blauert, Ed. Springer-Verlag Berlin Heidelberg, 2013, pp. 397–425.
E. Vincent, R. Gribonval, and C. Fevotte, “Performance measurement in blind audio source separation,” IEEE Trans. Audio, Speech Lang. Process., vol. 14, no. 4, pp. 1462–1469, 2006, DOI: 10.1109/TSA.2005.858005.
M. L. Jepsen, S. D. Ewert, and T. Dau, “A computational model of human auditory signal processing and perception,” J. Acoust. Soc. Am., vol. 124, no. 1, pp. 422–438, 2008, DOI: 10.1121/1.2924135.
E. C. Cherry and B. M. A. Sayers, “‘Human “Cross-Correlator”’—A Technique for Measuring Certain Parameters of Speech Perception,” J. Acoust. Soc. Am., vol. 28, no. 5, pp. 889–895, 1956, DOI: 10.1121/1.1908506.
V.V. Furduyev Akusticheskiye osnovy veshchaniya. Moscow, USSR: Gossudarstvennoye izdatel'stvo literatury po voprosam svyazi i radio, 1980, p.318.
J. Blauert and W. Cobben, “Some Consideration of Binaural Cross Correlation Analysis,” Acta Acust. united with Acust., vol. 39, no. 2, pp. 96–104, 1978.
R. M. Stern and H. S. Colburn, “Theory of binaural interaction based on auditory‐nerve data. IV. A model for subjective lateral position,” J. Acoust. Soc. Am., vol. 64, no. 1, pp. 127–140, 1978, DOI: 10.1121/1.381978.
H. Haas, “The Influence of a Single Echo on the Audibility of Speech,” J. Audio Eng. Soc., vol. 20, no. 2, pp. 146–159, 1972.
Y. A. Kovalgin, Stereofonia [Stereophony]. Moscow, USSR: Radio i svyaz, 1989.
Y. S. Vakhitov, Teoreticheskiye osnovy elektroakustiki i elektroakusticheskaya apparatura [Theoretical fundamentals of electroacoustics and electroacoustic equipment]. Moscow, USSR: Iskusstvo, 1982.
R. Otnes and L. Enokson, Prikladnoy analiz vremennykh ryadov [Applied time series analysis]. Moscow, USSR, 1982.
J. Blauert, Prostranstvennyy slukh [Spatial hearing]. Moscow, USSR: Enrgiya, 1979.