AN OVERVIEW OF THE RECOGNITION ALGORITHM OF A HUMAN VOICE

Авторы

  • Orken Mamyrbayev Institute of Information and Computing Technologies, Almaty, Kazakhstan
  • A. Karelova Al-Farabi Kazakh National University, Almaty, Kazakhstan

Ключевые слова:

algorithm; Gaussian mixture; identification; recognition; classification.

Аннотация

Speech recognition has various applications, including human-machine interaction, sorting phone
calls by gender classification, categorizing videos with tags, and so on. Currently, machine learning is a popular field
that is widely used in various fields and applications, taking advantage of the latest developments in digital
technologies and the advantages of data storage capabilities from electronic media. In this article, we will focus on
voice gender recognition for a class of text-dependent systems using the Dynamic time distortion (DTW) algorithm
and for a class of text-independent systems, the Gaussian mixture model. With this method, it is possible to
distinguish a person's voice with the highest accuracy, since the components of Gaussian mixtures can simulate the
personality of the voice. The article presents the results of testing the algorithm, and concludes that the Gaussian
mixture model is applicable to solving the problem of identifying a person by voice.

Загрузки

Опубликован

2021-02-08

Как цитировать

Mamyrbayev, O., & Karelova, A. (2021). AN OVERVIEW OF THE RECOGNITION ALGORITHM OF A HUMAN VOICE. Известия НАН РК. Серия физико-математическая, (1), 32–38. извлечено от https://journals.nauka-nanrk.kz/physics-mathematics/article/view/265