p.2094
p.2099
p.2103
p.2107
p.2111
p.2115
p.2121
p.2125
p.2129
E-Mail Filtration and Classification Based on Variable Weights of the Bayesian Algorithm
Abstract:
The co-occurrence word emphasize the word and word internal relations, so its use can improve shortage from the hypothetical of Bayesian algorithm. To build Token Dictionary, Information Gain algorithm is used to choose Tokens, and Synonyms Dictionary is used to acquire more Tokens. By large amounts of training, the matching scores of Token are counted, according to the matching rate the Tokens that is valuable are selected, and the Token Dictionary is established. The proposed method is used to E-mail classification experiment, the results show that the accuracy of spam filter has a well improvement.
Info:
Periodical:
Pages:
2111-2114
Citation:
Online since:
February 2014
Authors:
Keywords:
Price:
Сopyright:
© 2014 Trans Tech Publications Ltd. All Rights Reserved
Share:
Citation: