An Algorithm of LDA Topic Reduction Based on Rough Set

Article Preview

Abstract:

The topic model LDA can uncover latent topics in text mining, through the probability distribution of words in the text to get the distribution of topics. This approach ignores the correlation between the topics, so in some actual domain it is not enough to reflect the real situations of the topics. We can divide the topics into two classes: key topics and unimportant topics. The key topics can reflect the word attributes well, and other topics can be look as subordinate. Considering the relations of the topics, a topic reduction algorithm is proposed to retain the key topics and delete the redundant topics based on rough set. Because the LDA topics exists uncertainty distribution and rough set can deal with uncertain data well, so the algorithm based on rough set can improve the accuracy of topics analysis.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1593-1596

Citation:

Online since:

October 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Eidelman V, Boyd-Graber J, Resnik P. Topic models for dynamic translation model adaptation. Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers-Volume 2. Association for Computational Linguistics, pp.115-119, (2012).

DOI: 10.3115/v1/p14-1110

Google Scholar

[2] Blei D M. Probabilistic topic models. Communications of the ACM, Vol. 55, No. 4, pp.77-84, (2012).

Google Scholar

[3] Yao Y, Yao B. Covering based rough set approximations. Information Sciences, pp.91-107, (2012).

DOI: 10.1016/j.ins.2012.02.065

Google Scholar

[4] Blei D M, Ng A Y, Jordan M I. Latent dirichlet allocation. the Journal of machine Learning research, No. 3, pp.993-1022, (2003).

Google Scholar

[5] Zhang J, Li T, Ruan D, et al. Rough sets based matrix approaches with dynamic attribute variation in set-valued information systems. International Journal of Approximate Reasoning, Vol. 53, No. 4, pp.620-635, (2012).

DOI: 10.1016/j.ijar.2012.01.001

Google Scholar

[6] Huang C C, Tseng T L B, Jiang F, et al. Rough set theory: a novel approach for extraction of robust decision rules based on incremental attributes. Annals of Operations Research, pp.1-27, (2013).

DOI: 10.1007/s10479-013-1352-1

Google Scholar

[7] Grzymala-Busse J W, Hippe Z S. Mining Incomplete Data—A Rough Set Approach[M]/Emerging Paradigms in Machine Learning. Springer Berlin Heidelberg, pp.49-74, (2013).

DOI: 10.1007/978-3-642-28699-5_4

Google Scholar

[8] Meng Z, Shi Z. Extended rough set-based attribute reduction in inconsistent incomplete decision systems. Information Sciences, No. 204, pp.44-69, (2012).

DOI: 10.1016/j.ins.2012.04.004

Google Scholar