An Expanded Feature Extraction of E-Mail Header for Spam Recognition

Article Preview

Abstract:

Currently a spam filtering method is extracting attributes from e-mail header and using machine learning methods to classify the sample sets. But as time goes on, spammers transform different ways to send spam, which result in a great change of spam's header. So the attributes defined in the past could not deal with this change sufficiently. This paper extracted attributes from all possible forged header fields to expand the feature sets, then used the rough set theory to classify the sample sets. Experiment validated more attributes including in feature sets may lead to greater performance, in terms of higher recall and precision, lower fake recognition than other algorithms.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 846-847)

Pages:

1672-1675

Citation:

Online since:

November 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2014 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

* - Corresponding Author

[1] Carpinter J, Hunt R . Tightening the net: A review of current and next generation spam filtering tools [J]. Computers & Security, 2006, 25 (8): 566-578.

DOI: 10.1016/j.cose.2006.06.001

Google Scholar

[2] Md. Saiful Islam, Abdullah Al Mahmud and Md. Rafiqul Islam. Machine learning approaches for modeling spammer behavior [J]. Information Retrieval Technology, 2010, 251 to 260.

DOI: 10.1007/978-3-642-17187-1_24

Google Scholar

[3] Deng Wei-bin, Wang Guo-yin, Hong Zhi-yong. Weighted naive bayes spam filtering method based on rough set [J]. Comuputer Science, 2011, 2: 218-221.

Google Scholar

[4] Chih-Hung Wu, Chiung-Hui Tsai. A time-robust spam classified based on back-propagation neural networks and behavior-based features [C]. Proceedings of the Sixth International Conference on Machine Learning and Cybernetics, 2007, 19-22.

DOI: 10.1109/icmlc.2007.4370519

Google Scholar

[5] Wang Guo-yin. Rough set theoretic and knowledge acquisition [M]. Xi'an JiaoTong University Press, (2001).

Google Scholar

[6] Chih-Hung Wu. Behavior-based spam detection using a hybrid method of rule-based techniques and neural networks [J]. Expert Systems With Applications, 2009, 36 (3): 4321 ~ 4330.

DOI: 10.1016/j.eswa.2008.03.002

Google Scholar

[7] Wang Jue and Miao Duo - qian. Analysis on attribute reduction strategiesof Rough set [J]. Journal of Computer Science and Technology, 1998, 13 (2): 189 ~ 192.

Google Scholar

[8] Blanzieri Enrico, Bryl Anton. A survey of learning-based techniques of email spam filtering [J]. Artificial Intelligence Review. 2008, 29 (1): 63 ~ 92.

DOI: 10.1007/s10462-009-9109-6

Google Scholar