Design of Hot Issue Founding System Based on Tibetan Web

Article Preview

Abstract:

This paper designs a hot issue Founding system based on the news pages of Tibetan website, which can automatically find the issue on the network at any period of time. Through characteristics judgments of hot issues, and will present the conditions of issue meeting to the user. Because of large-scale corpus, and hot issues for demanding high real-time, then the system divided into groups according to days, and using agglomerative clustering to cluster daily corpus, and the corpus of the selected period of time using Single -pass clustering to get issue list, then order the issues through heat calculation. Finally, through testing Tibetan corpus in 2011, shows that the system has achieved good results, and with higher value.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1839-1844

Citation:

Online since:

September 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Liu XingXing, He Tingting, et al. The network hot issues found system design [J]. Journal of Chinese information, 2008, 22 (6) : 81-82.

Google Scholar

[2] Hong Yu, Zhang yu, et al. Topic detection and tracking of the evaluation and research review [J]. Journal of Chinese information, 2007, 21 (6) : 71-87.

Google Scholar

[3] J. Allan. Introduction to Topic Detection and Tracking in Topic Detection and Tracking: Event-based Information Organization. Kluwer Academic Publishers, 2002: 1-16.

DOI: 10.1007/978-1-4615-0933-2

Google Scholar

[4] Dai Wenhua. Research on text classification and clustering based on genetic algorithm [M]. Beijing: science press, (2008).

Google Scholar

[5] Shui Yidong et al. Cycle classification, and the combination of Single - pass clustering topic identification and tracking methods [J]. Journal of Beijing jiaotong university, 2009 (10).

Google Scholar

[6] Sun Xuegang, Chen Qiongxiu, Ma Liang. the Web document clustering research based on the theme[J]. Journal of Chinese information, 2003 (3) : 21-23.

Google Scholar