Paper Title:
Clustering Chinese Web Search Results Based on Association Calculation
  Abstract

Clustering web search results is a kind of solution which help user to find the interested topic by grouping the search results. This paper presents an improved method for clustering search results focused on Chinese web pages. The main contributions of this paper are the following: First, in this paper, a method which identifies the complete semantic information phrase by comparing the attributes of base clusters in the suffix tree document model and the overlap of their document sets is presented. Second, by analyzing the content and structure of title and snippet of Chinese web search results, one way of sentence segmentation is designed and implemented to constructing suffix tree. Third, In order to better respond to the associate degree of terms, a novel method is proposed which compute the distance in sentence-grain of terms' co-occurrences. Finally, the experiment illustrates that the new clustering method provides an efficient and effective way for user browsing and locating sought information.

  Info
Periodical
Edited by
Qi Luo
Pages
1418-1423
DOI
10.4028/www.scientific.net/AMM.55-57.1418
Citation
Y. Zhao, Y. J. Du, Q. Q. Peng, "Clustering Chinese Web Search Results Based on Association Calculation", Applied Mechanics and Materials, Vols. 55-57, pp. 1418-1423, 2011
Online since
May 2011
Export
Price
$32.00
Share

In order to see related information, you need to Login.

In order to see related information, you need to Login.

Authors: Jing Li Zhou, Xue Jun Nie, Lei Hua Qin, Jian Feng Zhu
Abstract:This paper proposes a novel fuzzy similarity measure based on the relationships between terms and categories. A term-category matrix is...
2620
Authors: Yin Sheng Zhang, Hui Lin Shan, Jia Qiang Li, Jie Zhou
Chapter 8: Nanomaterials and Nanomanufacturing
Abstract:The traditional K-means clustering algorithm prematurely plunges into a local optimum because of sensitive selection of the initial cluster...
1977
Authors: Yi Ding, Xian Fu
Chapter 5: Information Processing and Computational Science
Abstract:Text clustering typically involves clustering in a high dimensional space, which appears difficult with regard to virtually all practical...
939
Authors: Chun Xia Jin, Hai Yan Zhou, Qiu Chan Bai
Chapter 6: Algorithm Design
Abstract:To solve the problem of sparse keywords and similarity drift in short text segments, this paper proposes short text clustering algorithm with...
1716
Authors: Yang Pan, An Hua Chen, Ling Li Jiang
Chapter 3: Mechanical Transmission, Vibration and Noise
Abstract:According to the selection difficulties of initial clustering center of k-means clustering algorithm, this paper proposes a method that is to...
250