A Microblog Classification Scheme Based on Partial Indexing

Article Preview

Abstract:

As a new media platform, the social network like microblog has aroused wide concern in recent years. People become more and more interested in information from microblog. How to retrieval the huge amount of newly created contents in real time is the key issue of microblog, so real-time search is a scalable indexing service that microblog must provide. Analysis on Sina microbolg, a microblog classification and indexing process has been put forward on basis of partial indexing mechanism. Instead of indexing the whole dataset, a partial index is to index the records that may be queried with high probability. The classification scheme can effectively save the real-time search costs and obtain microblog data with high quality.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

1927-1930

Citation:

Online since:

August 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Li Yungming,Hsiao Hanwen. Recommender service for social network based application. Proc of the 11th International Conference on Electronic Commerce. New York: ACM Press,2009; 378—381.

Google Scholar

[2] Chun Chen, Feng Li, Beng Chin Ooi, Sai Wu: TI: an efficient indexing mechanism for real-time search on tweets. SIGMOD Conference 2011: 649-660.

DOI: 10.1145/1989323.1989391

Google Scholar

[3] B. J. Jansen, G. Campbell, and M. Gregg. Real time search user behavior. In CHI, pages 3961–3966, (2010).

Google Scholar

[4] Busch, M. Gade, K. ; Larson, B. et al. Earlybird: Real-Time Search at Twitter. 2012 IEEE 28th International Conference on Data Engineering (ICDE), Page(s): 1360 – 1369, (2012).

DOI: 10.1109/icde.2012.149

Google Scholar

[5] http: /blog. sina. com. cn.

Google Scholar