Classification of Chinese Popular Songs Using a Fusion Scheme of GMM Model Estimate and Formant Feature Analysis

Ing Jr Ding; Chih Ta Yen; Che Wei Chang

doi:10.4028/www.scientific.net/AMM.479-480.1006

Paper Titles

Robust Exponential Stability for Uncertain Discrete-Time Switched Systems with Interval Time-Varying Delay via a Switching Signal
p.983

The Hybrid Differential Evolution with Dynamic Scaling Mutation and Wrapper Local Search for Optimization Problems
p.989

Implementation and Experiments of TDOA Monitoring Techniques for Broadcasting Interferences
p.996

Development of an Equipment Failure Identification Expert System with Multiple Reasoning Approaches
p.1001

Classification of Chinese Popular Songs Using a Fusion Scheme of GMM Model Estimate and Formant Feature Analysis
p.1006

Low Phase Noise and Low Power Voltage-Controlled-Oscillator Using Current-Reuse Techniques for Wireless Communication Circuits
p.1010

Design of 3.1-10.6GHz CMOS LNA Based on Input Matching Technique of Common-Gate Topology
p.1014

A Source Routing Protocol for Bluetooth-Based Sensor Networks
p.1018

Development of Software-as-a-Service Cloud Computing Architecture for Manufacturing Management Systems Based on Virtual COM Port Driver Technology
p.1023

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 479-480Classification of Chinese Popular Songs Using a...

Classification of Chinese Popular Songs Using a Fusion Scheme of GMM Model Estimate and Formant Feature Analysis

Abstract:

In this paper, a fusion scheme that combines Gaussian mixture model (GMM) calculations and formant feature analysis, called GMM-Formant, is proposed for classification of Chinese popular songs. Generally, automatic classification of popular music could be performed by two main categories of techniques, model-based and feature-based approaches. In model-based classification techniques, GMM is widely used for its simplicity. In feature-based music recognition, the formant parameter is an important acoustic feature for evaluation. The proposed GMM-Formant method takes use of linear interpolation for combining GMM likelihood estimates and formant evaluation results appropriately. GMM-Formant will effectively adjust the likelihood score, which is derived from GMM calculations, by referring to certain degree of formant feature evaluation outcomes. By considering both model-based and feature-based techniques for song classification, GMM-Formant provides a more reliable recognition classification result and therefore will maintain a satisfactory performance in recognition accuracy. Experimental results obtained from a musical data set of numerous Chinese popular songs show the superiority of the proposed GMM-Formant. Keywords: Song classification; Gaussian mixture model; Formant feature; GMM-Formant.

You might also be interested in these eBooks

Applied Science and Precision Engineering Innovation

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volumes 479-480)

Pages:

1006-1009

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.479-480.1006

Citation:

Cite this paper

Online since:

December 2013

Authors:

Ing Jr Ding, Chih Ta Yen, Che Wei Chang

Keywords:

Formant Feature, Gaussian Mixture Model, GMM-Formant, Song Classification

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

References

[1] C. Wutiwiwatchai, S. Furui, Thai speech processing technology: A review, Speech Communication 49 (2007) 8–27.

DOI: 10.1016/j.specom.2006.10.004

Google Scholar

[2] M. Levy, M. Sandler, Structural segmentation of musical audio by constrained clustering, IEEE Transactions on Audio, Speech, Language Processing 16 (2008) 318–326.

DOI: 10.1109/tasl.2007.910781

Google Scholar

[3] H. Lukashevich, Towards quantitative measures of evaluating song segmentation, Proceedings of International Conference on Music Information Retrieval, 2008, p.375–380.

Google Scholar

[4] E. Peiszer, T. Lidy, A. Rauber, Automatic audio segmentation: Segment boundary and structure detection in popular music, Proceedings of International Workshop on Learning the Semantics of Audio Signals, (2008).

Google Scholar

[5] D.A. Reynolds, R.C. Rose, Robust text-independent speaker identification using Gaussian mixture speaker models, IEEE Transactions on Speech and Audio Processing 3 (1995) 72–83.

DOI: 10.1109/89.365379

Google Scholar

[6] C. H. You, K. A. Lee, H. Li, An SVM kernel with GMM-supervector based on the Bhattacharyya distance for speaker recognition, IEEE Signal Processing Letters 16(1) (2009) 49–52.

DOI: 10.1109/lsp.2008.2006711

Google Scholar

[7] P. Kenny, G. Boulianne, P. Ouellet, P. Dumouchel, Speaker and session variability in GMM-based speaker verification, IEEE Transactions on Audio, Speech, and Language Processing 15(4) (2007) 1448–1460.

DOI: 10.1109/tasl.2007.894527

Google Scholar