IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v9y2021i18p2274-d636683.html
   My bibliography  Save this article

3D-DCDAE: Unsupervised Music Latent Representations Learning Method Based on a Deep 3D Convolutional Denoising Autoencoder for Music Genre Classification

Author

Listed:
  • Lvyang Qiu

    (Department of Multimedia Engineering, Dongguk University-Seoul, Seoul 04620, Korea)

  • Shuyu Li

    (Department of Multimedia Engineering, Dongguk University-Seoul, Seoul 04620, Korea)

  • Yunsick Sung

    (Department of Multimedia Engineering, Dongguk University-Seoul, Seoul 04620, Korea)

Abstract

With unlabeled music data widely available, it is necessary to build an unsupervised latent music representation extractor to improve the performance of classification models. This paper proposes an unsupervised latent music representation learning method based on a deep 3D convolutional denoising autoencoder (3D-DCDAE) for music genre classification, which aims to learn common representations from a large amount of unlabeled data to improve the performance of music genre classification. Specifically, unlabeled MIDI files are applied to 3D-DCDAE to extract latent representations by denoising and reconstructing input data. Next, a decoder is utilized to assist the 3D-DCDAE in training. After 3D-DCDAE training, the decoder is replaced by a multilayer perceptron (MLP) classifier for music genre classification. Through the unsupervised latent representations learning method, unlabeled data can be applied to classification tasks so that the problem of limiting classification performance due to insufficient labeled data can be solved. In addition, the unsupervised 3D-DCDAE can consider the musicological structure to expand the understanding of the music field and improve performance in music genre classification. In the experiments, which utilized the Lakh MIDI dataset, a large amount of unlabeled data was utilized to train the 3D-DCDAE, obtaining a denoising and reconstruction accuracy of approximately 98%. A small amount of labeled data was utilized for training a classification model consisting of the trained 3D-DCDAE and the MLP classifier, which achieved a classification accuracy of approximately 88%. The experimental results show that the model achieves state-of-the-art performance and significantly outperforms other methods for music genre classification with only a small amount of labeled data.

Suggested Citation

  • Lvyang Qiu & Shuyu Li & Yunsick Sung, 2021. "3D-DCDAE: Unsupervised Music Latent Representations Learning Method Based on a Deep 3D Convolutional Denoising Autoencoder for Music Genre Classification," Mathematics, MDPI, vol. 9(18), pages 1-17, September.
  • Handle: RePEc:gam:jmathe:v:9:y:2021:i:18:p:2274-:d:636683
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/9/18/2274/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/9/18/2274/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Pieter-Tjerk de Boer & Dirk Kroese & Shie Mannor & Reuven Rubinstein, 2005. "A Tutorial on the Cross-Entropy Method," Annals of Operations Research, Springer, vol. 134(1), pages 19-67, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhe Jiang & Shuyu Li & Yunsick Sung, 2022. "Enhanced Evaluation Method of Musical Instrument Digital Interface Data based on Random Masking and Seq2Seq Model," Mathematics, MDPI, vol. 10(15), pages 1-17, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mattrand, C. & Bourinet, J.-M., 2014. "The cross-entropy method for reliability assessment of cracked structures subjected to random Markovian loads," Reliability Engineering and System Safety, Elsevier, vol. 123(C), pages 171-182.
    2. R. Y. Rubinstein, 2005. "A Stochastic Minimum Cross-Entropy Method for Combinatorial Optimization and Rare-event Estimation," Methodology and Computing in Applied Probability, Springer, vol. 7(1), pages 5-50, March.
    3. Kin-Ping Hui, 2011. "Cooperative Cross-Entropy method for generating entangled networks," Annals of Operations Research, Springer, vol. 189(1), pages 205-214, September.
    4. Mathieu Balesdent & Jérôme Morio & Loïc Brevault, 2016. "Rare Event Probability Estimation in the Presence of Epistemic Uncertainty on Input Probability Distribution Parameters," Methodology and Computing in Applied Probability, Springer, vol. 18(1), pages 197-216, March.
    5. Tran, Cong Quoc & Keyvan-Ekbatani, Mehdi & Ngoduy, Dong & Watling, David, 2021. "Stochasticity and environmental cost inclusion for electric vehicles fast-charging facility deployment," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 154(C).
    6. Xi Chen & Enlu Zhou, 2015. "Population model-based optimization," Journal of Global Optimization, Springer, vol. 63(1), pages 125-148, September.
    7. Zhou, Yuekuan & Zheng, Siqian, 2020. "Climate adaptive optimal design of an aerogel glazing system with the integration of a heuristic teaching-learning-based algorithm in machine learning-based optimization," Renewable Energy, Elsevier, vol. 153(C), pages 375-391.
    8. Akimoto, Youhei & Auger, Anne & Hansen, Nikolaus, 2022. "An ODE method to prove the geometric convergence of adaptive stochastic algorithms," Stochastic Processes and their Applications, Elsevier, vol. 145(C), pages 269-307.
    9. Anastasia Spiliopoulou & Ioannis Papamichail & Markos Papageorgiou & Yannis Tyrinopoulos & John Chrysoulakis, 2017. "Macroscopic traffic flow model calibration using different optimization algorithms," Operational Research, Springer, vol. 17(1), pages 145-164, April.
    10. Zhang, Yali & Shang, Pengjian, 2019. "Multivariate multiscale distribution entropy of financial time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 515(C), pages 72-80.
    11. Chen, Siliang & Chen, Kang & Zhu, Xu & Jin, Xinqiao & Du, Zhimin, 2022. "Deep learning-based image recognition method for on-demand defrosting control to save energy in commercial energy systems," Applied Energy, Elsevier, vol. 324(C).
    12. A. Gouda & T. Szántai, 2008. "Rare event probabilities in stochastic networks," Central European Journal of Operations Research, Springer;Slovak Society for Operations Research;Hungarian Operational Research Society;Czech Society for Operations Research;Österr. Gesellschaft für Operations Research (ÖGOR);Slovenian Society Informatika - Section for Operational Research;Croatian Operational Research Society, vol. 16(4), pages 441-461, December.
    13. Liang Huang & Juanjuan Zhu & Mulan Qiu & Xiaoxiang Li & Shasha Zhu, 2022. "CA-BASNet: A Building Extraction Network in High Spatial Resolution Remote Sensing Images," Sustainability, MDPI, vol. 14(18), pages 1-15, September.
    14. Reuven Y. Rubinstein, 2006. "How Many Needles are in a Haystack, or How to Solve #P-Complete Counting Problems Fast," Methodology and Computing in Applied Probability, Springer, vol. 8(1), pages 5-51, March.
    15. Ad Ridder & Bruno Tuffin, 2012. "Probabilistic Bounded Relative Error Property for Learning Rare Event Simulation Techniques," Tinbergen Institute Discussion Papers 12-103/III, Tinbergen Institute.
    16. Mehni, Moien Barkhori & Mehni, Mohammad Barkhori, 2023. "Reliability analysis with cross-entropy based adaptive Markov chain importance sampling and control variates," Reliability Engineering and System Safety, Elsevier, vol. 231(C).
    17. Wu, Xin & Nie, Lei & Xu, Meng, 2017. "Robust fuzzy quality function deployment based on the mean-end-chain concept: Service station evaluation problem for rail catering services," European Journal of Operational Research, Elsevier, vol. 263(3), pages 974-995.
    18. Xianmin Wang & Xinlong Zhang & Jia Bi & Xudong Zhang & Shiqiang Deng & Zhiwei Liu & Lizhe Wang & Haixiang Guo, 2022. "Landslide Susceptibility Evaluation Based on Potential Disaster Identification and Ensemble Learning," IJERPH, MDPI, vol. 19(21), pages 1-26, October.
    19. Thirunavukkarasu, M. & Sawle, Yashwant & Lala, Himadri, 2023. "A comprehensive review on optimization of hybrid renewable energy systems using various optimization techniques," Renewable and Sustainable Energy Reviews, Elsevier, vol. 176(C).
    20. Rishang Long & Jianhua Zhang, 2016. "Risk Assessment Method of UHV AC/DC Power System under Serious Disasters," Energies, MDPI, vol. 10(1), pages 1-13, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:18:p:2274-:d:636683. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.