Chinese Sentiment Classifier Machine Learning Based on Optimized Information Gain Feature Selection

Jin Tao Shi; Hui Liang Liu; Yuan Xu; Jun Feng Yan; Jian Feng Xu

doi:10.4028/www.scientific.net/AMR.988.511

Paper Titles

Numerical Simulation of Erosion Wear of Liquid-Solid Two-Phase Flow in Sliding Sleeve of Horizontal Well
p.483

3-Dimensional Localization System Based on Extension of Beacon Nodes and Segmentation of Coordinate Space
p.489

Infiltration Characteristics of Topsoil in Reclamation Farmland Filled with Yellow River Sediment
p.498

Analytical Study of Cylindrical P-Wave Propagation across Jointed Rock Masses
p.502

Chinese Sentiment Classifier Machine Learning Based on Optimized Information Gain Feature Selection
p.511

Optimization of Signal Intersection with the Combination of VISSIM and SYNCHRO
p.517

The Research on Correction Method of Capacitance Signal Drift for Drop Analysis System
p.521

Use of the Principal Component Analysis (PCA) to Reduce Data Complexity in Qualitative Research: An Electro-Electronics Case Study
p.526

A Neutral Framework for Feature Definition and a Generic Algorithm for Feature Recognition
p.530

HomeAdvanced Materials ResearchAdvanced Materials Research Vol. 988Chinese Sentiment Classifier Machine Learning...

Chinese Sentiment Classifier Machine Learning Based on Optimized Information Gain Feature Selection

Abstract:

Machine learning is important solution in the research of Chinese text sentiment categorization , the text feature selection is critical to the classification performance. However, the classical feature selection methods have better effect on the global categories, but it misses many representative feature words of each category. This paper presents an improved information gain method that integrates word frequency and degree of feature word sentiment into traditional information gain methods. Experiments show that classifier improved by this method has better classification .

You might also be interested in these eBooks

Material, Mechanical and Manufacturing Engineering II

View Preview

Info:

Periodical:

Advanced Materials Research (Volume 988)

Pages:

511-516

DOI:

https://doi.org/10.4028/www.scientific.net/AMR.988.511

Citation:

Cite this paper

Online since:

July 2014

Authors:

Jin Tao Shi, Hui Liang Liu, Yuan Xu, Jun Feng Yan, Jian Feng Xu*

Keywords:

Chinese, Classifier, Feature, Machine Learning, Sentiment

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

* - Corresponding Author

References

[1] Zhao Zhiwei. Chinese Text Orientation Analysis[D]. Anhui University, (2012).

Google Scholar

[2] Zhao Yanyan, Qin Bing, Liu Ting. Sentiment Analysis[J]. Journal of Software, 2010，21(8)：pp.1834-1848.

Google Scholar

[3] Bo Pang, Lillian Lee, Shivakumar Vaithyanathan. Sentiment Classfication using Machine Learning Techniques, the 2002 Conference on Empirical Methods in Natural Language Processing, 2002, pp.79-86.

DOI: 10.3115/1118693.1118704

Google Scholar

[4] Xu Linhong, Lin Hongfei, YangZhihao. Text Orientation Identification Based on Semantic Comprehension. [J]. Journal of Chinese Information Processing, 2007, 21(1), pp.96-100.

Google Scholar

[5] Tang Huifeng, Tan Songbo, Chen Xueqi. Research on Sentiment Classification of Chinese Reviews Based on Supervised Machine Learning Techniques[J]. Journal of Chinese Information Processing, 2007，21(6)：pp.88-94.

Google Scholar

[6] JiangHong. Text Representation and Algorithms for Chinese Text Classification[D]．Zhejiang Normal University, (2007).

Google Scholar

[7] Zhang Yun-tao, Gong Ling, Wang Yong-cheng. An improved TF-IDF approach for text classification[J]. Journal of Zhejiang University SCIENCE. 2005 6A(1): pp.49-55.

DOI: 10.1631/jzus.2005.a49

Google Scholar

[8] Yiming Yang, Jan O. Pedersen. A Comparative Study on Feature Selection in Text Categorization[A]. Proceedings of the 14th International Conference on Machine learning[C]. Nashville: Morgan Kaufmann, 1997: pp.412-420.

Google Scholar

[9] Jiawei Han, Micheline Kamber. Data Mining: Concepts and Techniques[M]. Translate by Fan Ming, Meng Xiaofeng. China Machine Press, (2011).

Google Scholar

[10] Lv Hao, Lin Jun, Zeng Xiaoxian. Research and Application of Improved Naïve Bayesian Classification Algorithm[J]. Journal of Hunan University, 2012, 12. pp.1-4.

Google Scholar

[11] Qian Xiaodong, Wang Zheng-ou. Text Categorization Method Based on Improved KNN[J]. Information Science, 2005(4).

Google Scholar

[12] Eduardo Jose Bayro-Corrochano, Nancy Arana-Daniel. Clifford Support Vector Machines for Classification, Regression and Recurrence[J]. IEEE Transactions on Neural Networks, 2010, 21(11).

DOI: 10.1109/tnn.2010.2060352

Google Scholar

[13] Liu Qinghe, Liang Zhengyou. Optimized approach of feature selection based on information gain. Computer Engineering and Applications，2011，47（12）：pp.130-132.

Google Scholar

[14] He Fengying. Orientation analysis for Chinese blog text based on semantic comprehension. Journal of Computer Applications. 2011, 31(8): pp.2130-2136.

Google Scholar