Paper Title:
An Improved Text Similarity Calculation Algorithm Based on VSM
  Abstract

Text similarity calculation is a key technology in the fields of text clustering, Web intelligent retrieval and natural language processing etc. Because the traditional text similarity calculation algorithm does not consider the affect of same feature words between texts, sometimes this algorithm may lead to inaccurate results. To solve this problem, this paper gives an improved text similarity calculation algorithm. Considering that the amount of same feature words reflects two texts’ similarity in some extent, the improved algorithm adds in the coverage measured parameter, which effectively reduces the interference of texts with lower similarity. The simulation and experimental results verify the improved algorithm’s correctness and effectiveness.

  Info
Periodical
Advanced Materials Research (Volumes 225-226)
Edited by
Helen Zhang, Gang Shen and David Jin
Pages
1105-1108
DOI
10.4028/www.scientific.net/AMR.225-226.1105
Citation
L. Li, A. H. Zhu, T. Su, "An Improved Text Similarity Calculation Algorithm Based on VSM", Advanced Materials Research, Vols. 225-226, pp. 1105-1108, 2011
Online since
April 2011
Export
Price
$32.00
Share

In order to see related information, you need to Login.

In order to see related information, you need to Login.

Authors: Wei Feng Wang, Jun Tao Yuan, An Lin Zhang, Meng Li
Chapter 5: Road and Bridge Engineering
Abstract:For present-day bridges.cable tensions test is a vitally important job in course of construction.The tensions condition of cables plays an...
1117
Authors: Xue Feng Wu, Yu Fan
Chapter 6: Mechatronics
Abstract:A new algorithms for parameters of an image irregular boundary circle parameters is presented, which is based on “Curve-Approximate Method”...
639
Authors: Li Yan Jiang, Ya Ping Zhong, Qing Jian Wu
Chapter 5: Algorithm Design and Applications
Abstract:The sports injury is common in training, hindered the athletes to further improve the sports results. There are many factors in sports...
1545
Authors: Rui Ren
Chapter 10: Intelligence Algorithm, Optimization Algorithm and their Applications
Abstract:Wireless sensor network is added on traditional GPS to realize double location in this paper. The widely used distributed distance measure...
1561