Using Semantic Correlation of HowNet for Short Text Classification

Article Preview

Abstract:

A method using the HowNet ontologies for short texts classification was proposed. First, the domain high frequency words were got as the feature words. Then the feature words were extended to concept by HowNet, which extended the feature from semantic and amends the feature scarcity. Last, the word semantic correlation values were got by calculating the distance between different concepts in node tree. Experimental results prove that the classification efficiency and precision are both improved.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1931-1934

Citation:

Online since:

February 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Fabrizio SebastianiI Machine Learning in Automated Text Categorization Consiglio Nazionale delle Ricerche, Italy. ACM Computing Surveys, Vol. 34, No. 1, 2002, p.1–47.

Google Scholar

[2] Xinghua Fan, Maosong Sun.  A High Performance Two-Class Chinese Text Categorization Method. Department of Computer Science and Technology, TsingHua University.  Chinese Journal of Computers, Vol. 29, No. 1, 2006, pp.124-131. In Chinese.

Google Scholar

[3] Zelikovitz, S. and Marquez ,F. Transductive Learning for Short-Text Classification Problem using Late-nt Semantic. Indexing International Journal of Pattern Recognition and Artificial Intelligence, 19(2), 143-163, (2005).

DOI: 10.1142/s0218001405003971

Google Scholar

[4] Qiang Pu, Guo Wei Yang  Short-Text Classification Based on ICA and LSA. Proceedings of International Symposium on Neural Networks 2006(ISNN 2), 265-270, (2006).

DOI: 10.1007/11760023_39

Google Scholar

[5] Xiwei Wang,Xinghua Fan. Method for Chinese short text classification based on feature extension. Journal of Computer Application,Vol. 29, No. 3, 2009, p.843–845. In Chinese.

DOI: 10.3724/sp.j.1087.2009.00843

Google Scholar

[6] Yahui Ning, Xinghua Fan,Yu Wu. Short Text Classification Based on Domain Word Ontology. Journal of Computer Science,Vol. 36, No. 3, 2009, p.142–145. In Chinese.

Google Scholar

[7] Sheng Wang, Xinghua Fan,Xianlin Chen. Using hyponymy relation for Chinese short text classification. Journal of Computer Application,Vol. 30, No. 3, 2010, p.603–611. In Chinese.

DOI: 10.3724/sp.j.1087.2010.00603

Google Scholar

[8] Enhong Chen, Gaofeng Wu. An Ontology Learning Method Enhanced by Frame Semantics, IEEE Computer Society. 2005, pp.374-382.

DOI: 10.1109/ism.2005.32

Google Scholar

[9] Zhendong Dong. HowNet [EB/OL]. (2007-01-01)  [2009-12-20]. http: /www. keenage. com.

Google Scholar

[10] Feng Li, Fang Li. Chinese word semantic similarity calculation - based on HowNet 2000. Journal of Chinese Informaition Processing, Vol. 21, No. 3, 2007, p.90–105. In Chinese.

Google Scholar

[11] Bin Ge, Fangfang Li,Silu Guo,Quanda Tang. Word's semantic similarity computation method based on HowNet. Application Research of Computers. Vol. 27, No. 9, 2010, p.3329–3333. In Chinese.

Google Scholar