Research of Inverted Index Method Based on Block Organizing Technology

Abstract:

Article Preview

In order to further improve the overall efficiency of retrieval system, it proposes a method of inverted index based on block organizing technology. The specific studying process is as follows. Firstly, retrieval performance model of inverted index is generated based on data statistics, and then analyze the organizational strategy of inverted file block index, finally, retrieval performance model is verified through simulation experiment. The result shows that the method of inverted file block organization can get higher algorithm efficiency under the condition of less cycle numbers in the search algorithm, and also reduce the execution time of search algorithm significantly, which can verify the feasibility of inverted file block index method.

Info:

Periodical:

Advanced Materials Research (Volumes 468-471)

Edited by:

Wenzhe Chen, Pinqiang Dai, Yonglu Chen, Dingning Chen and Zhengyi Jiang

Pages:

2836-2841

DOI:

10.4028/www.scientific.net/AMR.468-471.2836

Citation:

X. B. Yang "Research of Inverted Index Method Based on Block Organizing Technology", Advanced Materials Research, Vols. 468-471, pp. 2836-2841, 2012

Online since:

February 2012

Authors:

Export:

Price:

$38.00

[1] B. S. Jeong, E. Omiecinski. Inverted File Partitioning Schemes in Multiple Disk Systems, IEEE Transactions on Parallel and Distributed Systems, 6(2): 142-153(1995).

DOI: 10.1109/71.342125

[2] A. Tomasic, H. G. Molina. Performance of Inverted Indices in Shared-nothing Distributed Text Document Information Retrieval Systems, Proceedings of the 2nd International Conference on Parallel and Distributed Information Systems, San Diego, CA, USA, (1993).

DOI: 10.1109/pdis.1993.253078

[3] F. Scholer, H. E. Williams, etc. Compression of Inverted Indexes for Fast Query Evaluation, Proceedings of the Twenty-Fifth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, (2002).

DOI: 10.1145/564414.564416

[4] V. N. Anh, A. Moffat. Compressed Inverted Files with Reduced Decoding Overheads, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR-98), ACM Press, NY, USA, (1998).

DOI: 10.1145/290941.291011

[5] G. Navarro, E. S. De Moura etc. Adding Compression to Block Addressing Inverted Indexes, Information Retrieval, 3(1): 49-77(2000).

[6] A. Moffat, J. Zobel. Self-indexing Inverted Files for Fast Text Retrieval, ACM Transactions on Information Systems, 14(4): 349-379(1996).

DOI: 10.1145/237496.237497

[7] M. Persin, J. Zobel, etc. Filtered Document Retrieval with Frequency-sorted Indexes, Journal of the American Society for Information Science, 47(10): 749-764(1996).

DOI: 10.1002/(sici)1097-4571(199610)47:10<749::aid-asi3>3.0.co;2-2

[8] Y. Guan, X. Wang, etc. The Frequency-Rank Relation of Language Units in Chinese Computational Language Model, Journal of Chinese Information Processing, 13(2): 8-15(1998).

[9] A. Spink, D. Wolfram, etc. Searching the Web: The Public and Their Queries, Journal of the American Society for Information Science, 52(3): 226-234(2001).

[10] I. H. Witten, A. Moffat, etc. Managing Gigabytes: Compressing and Indexing Documents and Images, New York, NY: Van Nostrand Reinhold, (1994).

In order to see related information, you need to Login.