METHOD OF NEURAL NETWORK ANALYSIS OF VOICE SIGNAL

Authors

DOI:

https://doi.org/10.28925/2663-4023.2020.7.3142

Keywords:

recognition of emotions; authentication; voice signal; neural network; recognition method

Abstract

The article is devoted to increase of efficiency of the means of analysis of biometric characteristics of subjects, interacting with information-control systems of various purpose. It is shown that from the standpoint of the possibility of using as a sensor the reading of the biometric parameters of the standard peripheral equipment of the computer, the widespread use in the information-control systems of voice messages, their high informativeness, the complexity of falsification of biometric information, as well as the possibility of carrying out hidden monitoring, the prospects have analysis tools voice signal. The necessity of improvement of methodology of neural network analysis of voice signal for recognition of emotions and person is grounded. Possibility of increase of efficiency of neural network means of analysis due to adaptation of parameters of neural network model to the conditions of use is determined. The principles of determination of valid neural network models and the most effective type of neural network model used for voice signal analysis have been formed. A coding procedure for the input signal is developed, which allows to use in the neural network a compact representation of the most informative features of a voice signal. A procedure for encoding a neural network output signal has also been developed to increase the efficiency of its learning. The method of neural network analysis of the voice signal is developed, which due to the offered principles of adaptation and procedures of coding of input and output parameters, allows to introduce into neural means a neural network whose architecture is adapted to the expected conditions of use. The effectiveness of the proposed method has been proven experimentally. Experimental studies have shown that the use of the developed method allows to ensure the accuracy of recognition of emotions of the identified speaker, which is approximately 0.94, which corresponds to the best modern decisions in this field. It is suggested to correlate the ways of further research with the development of solutions that would allow to analyze the voice signal of arbitrary duration under the conditions of noise of different kind.

Downloads

Download data is not yet available.

References

Gnatyuk S. Critical Aviation Information Systems Cybersecurity, Meeting Security Challenges Through Data Analytics and Decision Support, NATO Science for Peace and Security Series, D: Information and Communication Security. - IOS Press Ebooks, Vol.47, №3, рр. 308-316, 2016.

Tereikovskyi I. and Korchenko А. «Modeli etalonov lingvisticheskikh peremennykh dlya obnaruzheniya sniffing-atak», Protection of information, Vol. 19, №3, pp. 228-242, 2017. (In Russian).

Ivanov А. I. Neyrosetevyye algoritmy biometricheskoy identifikatsii lichnosti. Moskow, 144 p., 2004. (In Russian).

Gnatyuk S., Sydorenko V., Aleksander M. Unified data model for defining state critical information infrastructure in civil aviation, Proceedings of the 2018 IEEE 9th International Conference on Dependable Systems, Services and Technologies (DESSERT), Kyiv, Ukraine, May 24-27, 2018, pp. 37-42.

Tereikovska L., Tereikovskyi I., Mussiraliyeva S., Akhmed G. Recognition of emotions by facial Geometry using a capsule neural network. International Journal of Civil Engineering and Technology (IJCIET). Vol. 10, Issue 04, April 2019, pp. 270-279.

Akhmetov B., Tereykovsky I., Doszhanova A., Tereykovskaya L. Determination of input parameters of the neural network model, intended for phoneme recognition of a voice signal in the systems of distance learning. International Journal of Electronics and Telecommunications. Vol 64, No 4 (2018), 425-432.

ISO/IEC 19794-5:2011/AMD 2:2015 [ISO/IEC 19794-5:2011/AMD 2:2015] Information technology - Biometric data interchange formats - Part 5: Face image data - Amendment 2: XML encoding and clarification of defects. [Електронний ресурс]. Доступно: https://www.iso.org/standard/61589.html.

ISO/IEC 24713-1:2008 Information technology - Biometric profiles for interoperability and data interchange - Part 1: Overview of biometric systems and biometric profiles. [Електронний ресурс]. Доступно: https://www.iso.org/standard/38823.html.

Lebedeva N.N. and Karimova E.D. «Akusticheskiye kharakteristiki rechevogo signala kak pokazatel' funktsional'nogo sostoyaniya cheloveka». Advances in Physiological Sciences, №1, pp. 57-95, 2014. (In Russian).

Ajinkya N. Jadhav, Nagaraj V. Dharwadkarю. A Speaker Recognition System Using Gaussian Mixture Model, EM Algorithm and K-Means Clustering, International Journal of Modern Education and Computer Science, Vol.10, No.11, pp. 19-28, 2018.

Altincay H. (2003). Speaker identification by combining multiple classifiers using Dempster–Shafer theory of evidence. Speech Communication, v.41, N4, 531–547.

Toliupa S., Tereikovskiy I., Dychka I., Tereikovska L., Trush A.. The Method of Using Production Rules in Neural Network Recognition of Emotions by Facial Geometry. 3rd International Conference on Advanced Information and Communications Technologies (AICT). 2019, 2-6 July 2019, Lviv, Ukraine, pp. 323 – 327.

Campbell W., Sturim D., Reynolds D. (2006). Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett., v.13, N5, pp. 308–311.

Tereykovska L., Tereykovskiy I., Aytkhozhaeva E., Tynymbayev S., Imanbayev A. Encoding of neural network model exit signal, that is devoted for distinction of graphical images in biometric authenticate systems (2017). // News of the national academy of sciences of the republic of kazakhstan series of geology and technical sciences. Volume 6, Number 426 (2017), pp. 217 – 224.

Akhmetov, B., Tereikovskyi, I., Tereikovska, L., Adranova, A. Neural Network User Authentication by Geometry of the Auricle // Recent Developments in Data Science and Intelligent Analysis of Information Proceedings of the XVIII International Conference on Data Science and Intelligent Analysis of Information, June 4–7, 2018, Kyiv, Ukraine, pp.11-19.

Tereikovskyi I, Chernyshev D., Tereikovska L.A., Mussiraliyeva S, Akhmed G. The Procedure for the Determination of Structural Parameters of a Convolutional Neural Network to Fingerprint Recognition. Journal of Theoretical and Applied Information Technology, No 8, Volume 97, 2019, pp. 2381-2392.

Oksiiuk O., Tereikovska L. and Tereikovskiy I. Determination of Expected Output Signals of the Neural Network Model Intended for Image Recognition. 4th International Scientific-Practical Conference Problems of Infocommunications Science and Technology (PIC S&T) 2017, pp. 596-599.

Tereikovskyi, I., Mussiraliyeva, S., Kosyuk, Y., Bolatbek, M., Tereikovska, L. An experimental investigation of infrasound influence hard drives of a computer system // International Journal of Civil Engineering and Technology. 2018. Volume 9, Issue 6, June 2018, pp. 1558–1566.

Dychka I., Chernyshev D., Tereikovskyi I., Tereikovska L., Pogorelov V. (2020) Malware Detection Using Artificial Neural Networks. Advances in Computer Science for Engineering and Education II. ICCSEEA 2019. Advances in Intelligent Systems and Computing, vol 938. Springer, pp. 3-12.

Tereikovska L. «Metod neyromerezhevoho analizu klaviaturnoho pocherku». Computer-integrated technologies: education, science, production, 2019, №37, pp. 53-59. (In Ukrainian).

Mihaylenko V. М. and Tereikovska L A. «Paradigmy neyrosetevogo raspoznavaniya emotsiy». Management of complex systems development. 2019. №39. pp. 179-186. (In Russian).

Downloads


Abstract views: 491

Published

2020-03-26

How to Cite

Tereikovska, L. (2020). METHOD OF NEURAL NETWORK ANALYSIS OF VOICE SIGNAL . Electronic Professional Scientific Journal «Cybersecurity: Education, Science, Technique», 3(7), 31–42. https://doi.org/10.28925/2663-4023.2020.7.3142