Research of Text Topic Automatic Extraction Method Based on Rough Set Theory

Article Preview

Abstract:

On the base of researching currently popular text topic extraction technologies, a new text topic automatic abstracting method is proposed based on rough set theory and rough similarity. Firstly it separated a text into words and sentences to complete information segmentation, and then constructed a similarity matrix by computing the rough similarity between different words to realize the text clustering, finally extracted representative sentences from each class to generate the text topic. The experiment shows that the method is feasible and effective.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 268-270)

Pages:

1127-1131

Citation:

Online since:

July 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2011 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Lide. Large-scale Chinese text processing. Shanghai: Fudan University Press, 1997: 137-139.

Google Scholar

[2] Wang zygomatic, Hou-Kuan Huang, Tian Shengfeng. Text classification implementation technology. Guangxi Normal University (Natural Science), 2003 , 21 (1): 173-179.

Google Scholar

[3] Ronen Feldman and Ido Dagan. DKT-Knowledge Discovery in Textual Databases. In Proceedings of the 1st Annual Conference on Knowledge Discovery and Data Mining, 1995. 112-117.

Google Scholar

[4] Luhn H P. The automatic creation of literature abstracts. IBM Journal of Research and Development, 1958, 2(2): 159-165.

Google Scholar

[5] Hahn U, Inderjeet Mani. The Challenges of Automatic Summarization. IEEE: computer. 2000: 79-101.

Google Scholar

[6] Salton G, Allen J, Buckley C, Singhal A. Automatic analysis, theme generation and summarization of machine-readable texts. 1994, 264(3): 1421-1426.

DOI: 10.1126/science.264.5164.1421

Google Scholar

[7] Z. Pawlak. Rough sets: theoretical aspects and applications[M]. Kluwer Publishers, 1991: 31-42.

Google Scholar

[8] Kuncheva, L., Bezdek, J.C., Duin, R.P.W. Decision templates for multiple classifier fusion: an experimental comparison. Pattern Recognition, 34 (2001) 299-314.

DOI: 10.1016/s0031-3203(99)00223-x

Google Scholar