p.1259
p.1265
p.1271
p.1275
p.1283
p.1289
p.1295
p.1301
p.1307
Combining Burst Detection for Hot Topic Extraction
Abstract:
As traditional text representations are not suitable for online dynamic streams, this paper presents a hot topic extraction technique that can be used for tracking news topics over time. The model combines individual word burst into the document-word vector representation, which can emphasize the temporally features of text streams. An energy ratio threshold based burst detection approach is proposed and TF-PDF is then combined to weigh the terms. Experiment results demonstrate that this model is effective in topic extraction for news stream and it can better improve the clustering performance.
Info:
Periodical:
Pages:
1283-1288
Citation:
Online since:
July 2011
Authors:
Keywords:
Price:
Сopyright:
© 2011 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: