Classification Algorithm Based on Categorical Data Analysis

Article Preview

Abstract:

Through compares three methods of Standard Deviation mathematical expectation and variance, a classification algorithm based on the Standard Deviation which in the training set is proposed in this paper. The algorithm first mapped the discrete attribute values to the corresponding values, and calculates Standard Deviation, mathematical expectation and Variance of each attribute in each category. The Standard Deviation, mathematical expectation and Variance of each attribute in each category used as coordinates. When there are new datas need to determine the category, we just need to use the attributes of the new data as coordinates, and calculate its distance to each category, and then the data type is the shortest distance category. Comparison of three methods, the Standard Deviation is the most stable and most accurate. This algorithm has advantages in dealing with the noisy date.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1292-1295

Citation:

Online since:

July 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Alan Agresti. Attribute data analysis[M], Beijing: Higher Education Press (In Chinese).

Google Scholar

[2] Xiujun Gong, Jianping Sun, Zhongzhi Shi, Active Bayesian network classifiers[J] Computer research and development, July 2002 (5) : 574-579 (In Chinese).

Google Scholar

[3] Lihua Luan, Genlin Ji. Decision tree classification technology research [J]. Computer engineering, 2004, 21 (9) : 94-96. (In Chinese).

Google Scholar

[4] Yusheng Cheng, Chunmei Zhang, Xuegang Hu. An incremental inductive learning algorithm [J]. Journal of chongqing university (natural science edition), 2004, 27(4): 44 -48. (In Chinese).

Google Scholar

[5] Zhang Lin, Zhang Jianli. Based on the class attribute mathematical expectation classification algorithm [J], anqing normal university (natural science edition), 2003, 1: 31-34. (In Chinese).

Google Scholar