Improved Nonnegative Matrix Factorization Based Feature Selection for High Dimensional Data Analysis

Lin Cheng Jiang; Wen Tang Tan; Zhen Wen  Wang; Feng Jing  Yin; Bin Ge; Wen Dong  Xiao

doi:10.4028/www.scientific.net/AMM.347-350.2344

Paper Titles

The Nearest Neighbor Algorithm of Filling Missing Data Based on Cluster Analysis
p.2324

A Novel Dynamic Mind Evolutionary Algorithm for Unit Economic Dispatch Problem
p.2329

The Application of Artificial Intelligence in Ocean Development: In the View of World Expo 2010
p.2335

Determine the Polarity of Domain-Specific Sentiment Words with Usage of Semantic Pattern of Sentences
p.2340

Improved Nonnegative Matrix Factorization Based Feature Selection for High Dimensional Data Analysis
p.2344

Application of Multimedia Technology and Network-Based Environment in Oral English Teaching
p.2349

Analyse of the Mixed-Integer Nonlinear Programming Method for the Application in the Cellular Network
p.2352

Studies and Applies of the Data Mining in the Mathematical Modelling
p.2360

Improved Slope One Algorithm Based on Time Weight
p.2365

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 347-350Improved Nonnegative Matrix Factorization Based...

Improved Nonnegative Matrix Factorization Based Feature Selection for High Dimensional Data Analysis

Abstract:

Feature selection has become the focus of research areas of applications with high dimensional data. Nonnegative matrix factorization (NMF) is a good method for dimensionality reduction but it cant select the optimal feature subset for its a feature extraction method. In this paper, a two-step strategy method based on improved NMF is proposed.The first step is to get the basis of each catagory in the dataset by NMF. Added constrains can guarantee these basises are sparse and mostly distinguish from each other which can contribute to classfication. An auxiliary function is used to prove the algorithm convergent.The classic ReliefF algorithm is used to weight each feature by all the basis vectors and choose the optimal feature subset in the second step.The experimental results revealed that the proposed method can select a representive and relevant feature subset which is effective in improving the performance of the classifier.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volumes 347-350)

Pages:

2344-2348

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.347-350.2344

Citation:

Cite this paper

Online since:

August 2013

Authors:

Lin Cheng Jiang, Wen Tang Tan, Zhen Wen Wang, Feng Jing Yin, Bin Ge, Wen Dong Xiao

Keywords:

Feature Selection, Nonnegative Matrix Factorization, Relieff Algorithm

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

References

[1] Saidi Rabie, Aridhi Sabeur, Maddouri Mondher and Nguifo Engelbert Mephu, Feature extraction in protein sequences classification: A new stability measure, ACM Conference on Bioinformatics, Computational Biology and Biomedicine(BCB 2012), pp.683-689, (2012).

DOI: 10.1145/2382936.2383060

Google Scholar

[2] J. Lafferty and L. Wasserman, Challenges in statistical machine learning, Statistica Sinica, vol. 16, p.307–322, (2006).

Google Scholar

[3] D. Lee and H. Seung, Learning the parts of objects by non-negative matrix factorization, Nature, vol. 401, p.788–791, (1999).

DOI: 10.1038/44565

Google Scholar

[4] Sun Yijun, Todorovic Sinisa and Goodison Steve, Local Learning Based Feature Selection for High-Dimensional Data Analysis, IEEE Transactions on Pattern Analysis and Machine Intelligence(PAMI 2010), vol. 32, pp.1610-1626, 2010.

DOI: 10.1109/tpami.2009.190

Google Scholar

[5] R. Kohavi and G. John, Wrappers for Feature Subset Selection, Artificial Intelligence, vol. 97, pp.273-324, Dec. (1997).

DOI: 10.1016/s0004-3702(97)00043-x

Google Scholar

[6] Igor Kononenko, Estimating Attributes: Analysis and Extensions of RELIEF, Peoceedings of European Conference on Machine Learning. Catania, Springer-Verlag, pp.171-182, (1994).

Google Scholar

[7] Wu Min, Li Jia, Liao Dingan and Lin Qing, Improved method based on NMF for face recognition, International Conference on Multimedia Technology(ICMT 2011), pp.559-562, (2011).

DOI: 10.1109/icmt.2011.6002166

Google Scholar

[8] Cai Deng, Wang Xuanhui, He Xiaofei, Probabilistic Dyadic Data Analysis with Local and Global Consistency, Proceedings of the 26th Annual International Conference on Machine Learning(ICML 09), vol. 382, (2009).

DOI: 10.1145/1553374.1553388

Google Scholar

[9] Lei Shang, A Feature Selection Method Based on Information Gain and Genetic Algorithm, International Conference on Computer Science and Electronics Engineering(ICCSEE 2012), vol. 2, pp.355-358, (2012).

DOI: 10.1109/iccsee.2012.97

Google Scholar

[10] C. Chang and C. Lin, LIBSVM: a library for support vector machines, 2001. Software available at http: /www. csie. ntu. edu. tw/~cjlin/libsvm.

Google Scholar