p.824
p.828
p.833
p.838
p.842
p.846
p.853
p.857
p.863
An Improved Method of Short Text Feature Extraction Based on Words Co-Occurrence
Abstract:
In Chinese text clustering, short text is very different from traditional long text, principally in the low frequency of words. As a result, traditional text feature extraction and the method for weight calculating is not directly suitable for short text clustering .To solve the problem of clustering drift in short text segments ,this paper proposes an method for feature extraction through improving the method of weight calculating based on words co-occurrence. Experiments show the method can get better performance in Chinese short-text clustering compared with the traditional method TF-IDF.
Info:
Periodical:
Pages:
842-845
Citation:
Online since:
February 2014
Authors:
Keywords:
Price:
Сopyright:
© 2014 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: