Speech-based gender recognition using linear prediction and mel-frequency cepstral coefficients

Gender discrimination and awareness are essentially practiced in social, education, workplace, and economic sectors across the globe. A person manifests this attribute naturally in gait, body gesture, facial, including speech. For that reason, automatic gender recognition (AGR) has become an interes...

Full description

Bibliographic Details
Published in:Indonesian Journal of Electrical Engineering and Computer Science
Main Author: Ali Y.M.; Noorsal E.; Mokhtar N.F.; Saad S.Z.M.; Abdullah M.H.; Chin L.C.
Format: Article
Language:English
Published: Institute of Advanced Engineering and Science 2022
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85139389165&doi=10.11591%2fijeecs.v28.i2.pp753-761&partnerID=40&md5=4ab9d7ba2e20d66755d8c80ffaf711b7
Description
Summary:Gender discrimination and awareness are essentially practiced in social, education, workplace, and economic sectors across the globe. A person manifests this attribute naturally in gait, body gesture, facial, including speech. For that reason, automatic gender recognition (AGR) has become an interesting sub-topic in speech recognition systems that can be found in many speech technology applications. However, retrieving salient gender-related information from a speech signal is a challenging problem since speech contains abundant information apart from gender. The paper intends to compare the performance of human vocal tract-based model i.e., linear prediction coefficients (LPC) and human auditory-based model i.e., Mel-frequency cepstral coefficients (MFCC) which are popularly used in other speech recognition tasks by experimentation of optimal feature parameters and classifier’s parameters. The audio data used in this study was obtained from 93 speakers uttering selected words with different vowels. The two feature vectors were tested using two classification algorithms namely, discriminant analysis (DA) and artificial neural network (ANN). Although the experimental results were promising using both feature parameters, the best overall accuracy rate of 97.07% was recorded using MFCC-ANN techniques with almost equal performance for male and female classes. © 2022 Institute of Advanced Engineering and Science. All rights reserved.
ISSN:25024752
DOI:10.11591/ijeecs.v28.i2.pp753-761