Audio classification based on machine learning: understanding animal behavior through sound

2023-9-06
Yaz, Furkan
Machine learning-based products that try to make our lives easier are increasing day by day. Thanks to the machine learning models running behind them, these products can be seen or heard and provide information about the context they are in. In this study, three machine learning methods that can hear and understand cat sounds were developed to serve this purpose. These models are Artificial Neural Network, Convolutional Neural Network, and CatBoost. A data library of two thousand sounds was created to understand 6 different cat behaviors. The process proceeded in three basic steps: pre-processing, feature extraction, and classification. One of the most widely used feature extraction algorithms, Mel-Frequency Cepstral Coefficients (MFCCs) has been preferred for Audio Feature Extraction. Accuracy was used as the evaluation metric. A minimum classification success rate of 95% was achieved in all models and the most successful model was determined as Convolutional Neural Network. The reason for this is considered as the presence of convolutional and pooling layers in the model architecture.
Citation Formats
F. Yaz, “Audio classification based on machine learning: understanding animal behavior through sound,” M.S. - Master of Science, Middle East Technical University, 2023.