Improvement of KEA Based on Lexical Chain

Article Preview

Abstract:

Keyphrases are very useful and significant for information retrieval, automatic summarizing, text clustering, etc. KEA is a traditional and classical algorithm in keyphrase automatic extraction. But it is mainly based on the statistical information without considering the semantic information. In this paper, We propose a method which combine semantic information with KEA by constructing lexical chain that based on Regets thesaurus. In our method, the semantic similarity between terms is used to construct the lexical chain, and then we use the length of the chain as a feature to build the extraction model. The experiment result shows that the performance of the system has a big improvement compare with the KEA.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 756-759)

Pages:

2999-3004

Citation:

Online since:

September 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Y. Chen, F.S. Tai, and K.L. Chan. Machine learning techniques for business blog search and mining, Expert Systems with Applications, 2008, 35( 3) : 581-590.

DOI: 10.1016/j.eswa.2007.07.015

Google Scholar

[2] H.F. Zhu, F. Bao. Continuous keyword search on multiple text streams, Proceeding of the IEEE International Conference on Communications . Glasgow , England, 2007: 1336-1341.

Google Scholar

[3] M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. Witten, The WEKA Data Mining Software: An Update, SIGKDD Explorations, vol. 11, (2009).

DOI: 10.1145/1656274.1656278

Google Scholar

[4] W.D. Yang, B.L. Shi. Schema aware keyword search over XML streams, Proc. IEEE International Conference on Computer and Information Technology. Fuku shima, Japan, 2007: 29-34.

DOI: 10.1109/cit.2007.119

Google Scholar

[5] Y. Xu, J.T. Li, B. Wang, C.M. Sun, S. Zhang. A study on feature select ion algorithm in text categorization technology, Journal of Computer Research and Development , 2008, 45( 4) : 596-602( in Chinese).

Google Scholar

[6] H.S. Chang, C.C. Hsu. Using topic keyword clusters for automatic document clustering, Proc. 3rd International Conference on Information Technology and Applications. Sydney, Australia, 2005: 419- 424.

DOI: 10.1109/icita.2005.303

Google Scholar