Algorithm for automatic classification of speech segments on based on autocorrelation and energy characteristics
Main Article Content
Abstract
The article is devoted to the speech segmentation algorithm by vocal features, based on specifics of autocorrelation function and energy distribution over frequency domain. The algorithm’s classification characteristics are high enough and independent of definite speech base, what demonstrates the proposed algorithm advantage compared to algorithm made for processing of voice with definite characteristics. The operational results with various male and female utterances are considered.
Article Details
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
References
C. Lemyre, M. Jelinek, and R. Lefebvre, “New approach to voiced onset detection in speech signal and its application for frame error concealment”, in 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, Las Vegas, NV, USA, 2008, pp. 4757–4760. DOI:10.1109/ICASSP.2008.4518720
A. Prodeus, “Digital speech coding:modeling vocoders in Matlab”, Electronics and Communications, no. 1, pp. 56–64, 2006.
M. Kulesza, G. Szwoch, and A. Czyżewsk, “HighQuality Speech Coding using Combined Parametricand Perceptual Modules”, PWASET, no. 13, pp. 244–249, 2006.
T. Van Pham, “Wavelet analisys for robust speechprocessing and applications”, 2007, p. 171.
V. Zhuikov and A. Kharchenko, “Algorithm for classifying speech signal segments”, Electronics and Communications, no. 2-3, pp. 130–137, 2009.
Orlov A.I., Applied statistics, Moscow: Izdatel’stvo «Ekzamen», 2004.
G. Szwoch, M. Kulesza, and A. Czyzewski, “Transient detection algorithms for speech coding applications”, The Journal of the Acoustical Society of America, vol. 120, no. 5_Supplement, pp. 3217–3217, Nov. 2006. DOI:10.1121/1.4788162
Y. Kalintsev, Speech intelligibility in digital vocoders, Moscow: Radio and Communications, 1991.
J. Flanagan, Analysis, synthesis and speech perception trans, Moscow: Communication, 1968.
J. Kaiser, “On a simple algorithm to calculate the ’energy’ of a signal”, in International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, NM, USA, 1990, pp. 381–384. DOI:10.1109/ICASSP.1990.115702
P. Maragos and A. Potamianos, “Higher order differential energy operators”, IEEE Signal Processing Letters, vol. 2, no. 8, pp. 152–154, Aug. 1995 DOI:10.1109/97.404130
J. Markel and A. Gray, Linear speech prediction, Moscow: Svyaz, 1980.
V. Hardle, J. Krekyacharyan, D. Picard, and A. Tsybakov, Wavelets,approximation and statistical applications