Projection Position-Based Sequential Pattern Mining Algorithm

Article Preview

Abstract:

PrefixSpan algorithm will construct huge amount of projected databases in the process of mining sequence patterns, especially mining dense dataset and long sequence pattern, which will cause decline of the performance of the algorithm. The resource problem can be solved by Projection position-based Sequential Pattern Mining Algorithm so as to reduce time and storage space. In order to avoid producing huge amount of projected databases and reduce unnecessary storage space and scanning time, compared with the others improved algorithm, the PSPM utilizes projected position to locate projected sequence position for mining local frequency items and deletes the non-frequent items. Experiment results demonstrate that PSPM outperforms the PrefixSpan(with pseudo-pro) algorithm in the aspect of Runtime performance.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1298-1302

Citation:

Online since:

December 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] R. Srikant, R. Agrawal: Proceedings of the 5th Int'l Conference on Extending Database Technology (1996), p.3217.

Google Scholar

[2] M. J. Zaki. SPANE: An efficient algorithm for mining frequent sequences. Machine Learning, Vol 11(2001), p.31.

Google Scholar

[3] Han J, Pei J, Mortazviasl B, and et al: Proceedings of the 6th ACM-SIGMOD International Conference on Knowledge Discovery and Data Mining(2000), p.355.

Google Scholar

[4] Pei Jian , Han Jiawei: Proceedings of the 7th International Conference on Data Engineering(2001), p.15.

Google Scholar

[5] J. Pei, H. Jiawei: IEEE Trans. on Knowledge and Data Engineering, Vol. 6(2004), p.1217.

Google Scholar

[6] K. Zhang,Y. Zhu: Journal of Computer Research and Development, Vol. 44(2007), p.126.

Google Scholar

[7] L. Zhang, Z. Li, M. Wang: Application Research of Computers, Vol. 26(2009), p.135.

Google Scholar

[8] I. Jonassen, J. F. Collins and D. G. Higgins: Protein Science, Vol. 4(1995), p.1587.

Google Scholar