Instrument Classification Using Different Machine Learning and Deep Learning Methods

Authors

  • Yuqing Su

DOI:

https://doi.org/10.54097/hset.v34i.5435

Keywords:

Musical instrument classification, MFCCs, kNN, GMM, SVM, ANN, CNN, RNN

Abstract

Instruments are categorized into the 5 groups in the Sachs-Hornbostel system: idiophones, membranophones, aerophones, chordophones, and electrophones. It might be easy to tell the Sachs-Hornbostel group that an instrument belongs to. However, distinguishing single instrument sound can be hard in monophonic or polyphonic music pieces and it is an important subject for musicians. Using computer science models can help musicians to analyze songs easily and fasten the speed of finding the instrument that are wanted by music producers or composers. This work aims to compare different models on particular instruments (monophonic sound) recognition which is an important problem in the field of music information retrieval. Jupyter Notebook is included for easy reproducibility. Among the six models chosen in this research: k-nearest neighbors(kNN), Support Vector Machines(SVM), Gaussian Mixture Modeling(GMM), Artificial Neural Networks(ANN), Convolutional Neural Networks(CNN) and Recurrent Neural Networks(RNN), CNN is the most accurate model and SVM is the fastest model while CNN has the prospect of being improved because it can be adjusted manually.

Downloads

Download data is not yet available.

References

Marques, J., & Moreno, P. J. (1999). A study of musical instrument classification using gaussian mixture models and support vector machines. Cambridge Research Laboratory Technical Report Series CRL, 4, 143.

Solanki, A., & Pandey, S. (2019). Music instrument recognition using deep convolutional neural networks. International Journal of Information Technology, 1-10.

Philharmonia Limited Registered Charity. 2022. Sound samples [dataset]. https://philharmonia.co.uk/resources/sound-samples/

Bernier, J. J., & Stafford, R. E. (1972). The relationship of musical instrument preference to timbre discrimination. Journal of Research in Music Education, 20(2), 283-285.

Solanki, A., & Pandey, S. (2019). Music instrument recognition using deep convolutional neural networks. International Journal of Information Technology, 1-10.

Liu, J., & Xie, L. (2010, May). SVM-based automatic classification of musical instruments. In 2010 International Conference on Intelligent Computation Technology and Automation (Vol. 3, pp. 669-673). IEEE.

Li, T. L., & Chan, A. B. (2011, January). Genre classification and the invariance of MFCC features to key and tempo. In International Conference on MultiMedia Modeling (pp. 317-327). Springer, Berlin, Heidelberg.

Harrison, O. 2019. Machine learning basics with the K-nearest neighbors algorithm. https://towardsdatascience.com/machine-learning-basics-with-the-k-nearest-neighbors-algorithm-6a6e71d01761

Lu, L., Zhang, H. J., & Li, S. Z. (2003). Content-based audio classification and segmentation by using support vector machines. Multimedia systems, 8(6), 482-492.

Kumar, A. 2022. Gaussian mixture models: What are they & when to use? https://vitalflux.com/gaussian-mixture-models-what-are-they-when-to-use/#:~:text=Gaussian%20mixture%20models%20(GMMs)%20are,marketing%20and%20so%20much%20more!

Shreevathsa, P. K., Harshith, M., & Rao, A. (2020, January). Music instrument recognition using machine learning algorithms. In 2020 International Conference on Computation, Automation and Knowledge Management (ICCAKM) (pp. 161-166). IEEE.

Saha, S. 2018. A comprehensive guide to Convolutional Neural Networks - the eli5 way. https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53

Gupta, A. 2022. Difference between ANN, CNN and RNN. https://www.geeksforgeeks.org/difference-between-ann-cnn-and-rnn/#:~:text=ANN%20is%20considered%20to%20be,compatibility%20when%20compared%20to%20CNN.&text=Facial%20recognition%20and%20Computer%20vision

Downloads

Published

28-02-2023

How to Cite

Su, Y. (2023). Instrument Classification Using Different Machine Learning and Deep Learning Methods. Highlights in Science, Engineering and Technology, 34, 136-142. https://doi.org/10.54097/hset.v34i.5435