Self-Adaptive Weighting Text Association Categorization Algorithm Research

Article Preview

Abstract:

In text association classification research, feature distribution of the training sample collection impacts greatly on the classification results, even with a same classification algorithm classification results will have obvious differences using different sample collections. In order to solve the problem, the stability of association classification is improved by the weighing method in the paper, the design realizes the association classification algorithms (WARC) based on rule weight. In the WARC algorithm, this paper proposes the concept of classification rule intensity and gives the concrete formula. Using rule intensity defines the rule adjustment factors that adjust uneven classification rules. Experimental results show the accuracy of text classification can be improved obviously by self-adaptive weighting.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 171-172)

Pages:

246-251

Citation:

Online since:

December 2010

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large databases[C]/Proc of ACM SIGMOD International Conference on Management of Data. New York: ACM Press, 1993: 207-216.

DOI: 10.1145/170036.170072

Google Scholar

[2] Agrawal R, Srikant R. Fast algorithms for mining association rules[C]/Proc of the 20th International Conference on Very Large Data Bases. San Francisco: Morgan Kaufmann Publishers, 1994: 478-499.

Google Scholar

[3] S-H. Lin, C-S. Shih, M.C. Chen, J-M Ho, M-T. Ko and Y-M. Huang. Extracting Classification Knowledge of Internet Documents with Mining Term Associations: A Semantic Approach. In Proceedings of ACM/SIGIR'98, pages 241-249, (1998).

DOI: 10.1145/290941.291001

Google Scholar

[4] O.R. Zafane and M.L. Antonie. Classifying Text Documents by Associating Terms with Text Categories. In Thirteenth Australasian Database Conference (ADC'02), Melbourne, Australia, January 2002: 215-222.

Google Scholar

[5] W Li, J. Han and J. pei. CMAR: Accurate and efficient classification based on multiple classification rules. In IEEE International Conference on Data Mining (ICDM'O1) San Jose, California, November 29-December (2001).

DOI: 10.1109/icdm.2001.989541

Google Scholar

[6] J. Li, G. Dong and K. Ramamohanarao. Making use of the most expressive jumping emerging patterns for classification. In Proceedings of the Fourth Pacific-Asia Conference on Knowledge Discovery and Data Mining, Kyoto, Japan, pages 220-232, (2000).

DOI: 10.1007/3-540-45571-x_29

Google Scholar

[7] G. Dong. X. Zhang, L. Wong and J. Li. CAEP: Classification by aggregating emerging patterns. In Proceedings of the Second International Conference on Discovery Science, Tokyo, Japan, pales 30-42, (1999).

DOI: 10.1007/3-540-46846-3_4

Google Scholar

[8] B. Liu, W. Hsu and Y. Ma. Mining Association Rules with Multiple Minimum Supports. In Proceedings of KDD-99, (1999).

Google Scholar

[9] B. Liu, Y. Ma and C.K. Wong . Improving an Association Rule Based Classifier. PKDD 2000: 504-509 Acknowledgement *Supported by the National Natural Science Foundation of China (Nos: 60773218) * About the author: Liangjun Li(1967-),Male, Anshan Normal University, professor, Ph D. of Northeastern University.

Google Scholar