A Method for Chinese Noun Phrase Recognition Base on Word Co-Occurrence Directed Graph

Article Preview

Abstract:

This paper proposes a recognition method for Chinese Noun Phrase based on word co-occurrence directed graph. An input document is firstly scanned in which noun word string is retrieved. Atomic word table and word co-occurrence directed graph is then generated according to the word strings. A search is performed on the graph to find the longest paths with priority weight satisfying certain criteria. The word strings corresponding to the paths are considered as noun phrases. As dimensionality reduction is applied, the scale of the word co-occurrence directed graph is reduced significantly, and thus the efficiency of the algorithm is improved. Experimental results demonstrate that the precision of noun phrase recognition reaches 95.4%.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 756-759)

Pages:

4412-4418

Citation:

Online since:

September 2013

Authors:

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Mei YANG. The construction of modern Chinese compound word. Nanjing: Nanjing Normal University, 2006. (In Chinese).

Google Scholar

[2] Chun-fa YUAN, Chang-ning HUANG. Research of Chinese Morpheme and Word Formation Rules Base on Morpheme Database, Journal of Linguistics Application. No. 3, pp.83-88, 1998. (In Chinese).

Google Scholar

[3] Jian-chao CHEN, Qi-lun ZHENG, Qing-yang LI, Gui-duo YAN. Chinese combined-word detection based on directed net of word-sequence frequency, Application Research of Computers. Vol. 26, No. 10, pp.3746-3749, 2009. (In Chinese).

Google Scholar

[4] Jun ZHAO. Chinese BaseNP Recognition and Structure Analysis. Beijing : Tsinghua University, (1998).

Google Scholar

[5] Rui-xia ZHANG, Lei ZHANG. The Model for Chinese BaseNP Analysis Based on Knowledge Graphs, Journal of Chinese Information Processing. Vol. 18, No. 3, pp.47-53, 2004. (In Chinese).

Google Scholar

[6] Fang XU, Qing-cheng ZONG, Xia WANG. Chinese Base NP Chunking by Error-driven Combination Classifiers, Journal of Chinese Information Processing. Vol. 21, No. 1, pp.115-119, 2007. (In Chinese).

Google Scholar

[7] Lei ZHOU, Qian-ming ZHU. Research on Recognition Method of Unknown Chinese Words Based on Statistic and Regulation, Computer Engineering. Vol. 33, No. 8, pp.196-198, 2007. (In Chinese).

Google Scholar

[8] Rong LI, Jia-heng ZHENG, Mei-ying GUO. Application Study of Hidden Markov Model Based on Genetic Algorithm in Noun Phrase Identification, Computer Science. Vol. 36, No. 10, pp.244-246, 2009. (In Chinese).

Google Scholar

[9] Church K W. A stochastic parts program and noun phrase parser for unrest ricted text, Proceedings of the 2nd Conference on Applied Natural Language Processing, Texas, USA, 1988, pp.136-143.

DOI: 10.3115/974235.974260

Google Scholar

[10] Erik F, Sang Tjong Kim. Noun Phrase Recognition by System Combination, Proceedings of ANLP2NAACL 2000, Seattle, WA, USA, 2000, pp.50-55.

Google Scholar

[11] Taku Kudo and Yuji Matsumoto. Chunking with Support Vector Machines, Proceedings of the 2nd Meeting of the North American Chapter of the Association for Computational Linguistics, USA, (2001).

DOI: 10.3115/1073336.1073361

Google Scholar

[12] Heaps, H. Information Retrieval: Computational and theoretical aspects. New York: Academic Press, (1978).

Google Scholar