A Method of Domain Compound Concept Extraction Based on Multilevel Filter

Article Preview

Abstract:

Identification and extraction in domain compound concept is the basis of the domain text information processing. This paper builds a multilevel filter extraction model by fusing the thought of statistics and language rule. Firstly, the extraction model screening out domain atomic concept set by using method of improved TF-IDF. We secondly build a space combination rule, screening out initial domain compound concept set. Ultimately we screening out finally domain compound concept set by using POS rules template matching via POS analysis. Experiments show that this method can effectively identify and extract domain compound concept and F-measure has reached 83.9%.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 989-994)

Pages:

2292-2296

Citation:

Online since:

July 2014

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Jipeng Sun, a concept extraction of method oriented to text, J. Computer application and software, 2009, 26(9).

Google Scholar

[2] Shamsfard M, Barforoush AA, Learning ontologies from natural language texts, J. Int'l Journal Human-Computer Studies, 2004, 60(1): 17- 63.

DOI: 10.1016/j.ijhcs.2003.08.001

Google Scholar

[3] Sproat R, Shih C. A statistical method for finding word boundaries in Chinese text, J. Computer Processing of Chinese and Oriental Languagies, 1990, 4: 336-351.

Google Scholar

[4] Wang, B. Y. and S. M. Zhang. A novel feature selection algorithm for text classification based on TFIDF-weight and KL-divergence. Proceedings of the 11th Joint International Computer Conference. 2005: 438-441.

DOI: 10.1142/9789812701534_0099

Google Scholar

[5] Zhangfeng, Xuyun, Houyan, Chinese term extraction system based on mutual information,J. Computer application and research, 1001- 3695(2005)05-0072-02.

Google Scholar

[6] Zifang Lin, Xiufeng Jiang, New words recognition based on word internal model, J. Computer and modernization , 1006-2475(2010)11-0162-03.

Google Scholar