Automatic Construction of Collocation Dictionary Based on Text Mining

Article Preview

Abstract:

A collocation dictionary is a useful component to many natural language and spoken language processing application, such as grammar checking, text-speech conversion and machine translation. Currently the collocation dictionary is constructed by human. Firstly, it may not be updated frequently and many lexicon entries may be not available. Secondly, to construct such a dictionary may need lots of human resources. In this paper, a data-mining approach for constructing a collocation dictionary is surveyed. The main purpose is to enable cheap and quick acquisition of a collocation dictionary from a large-scale text corpus. Experimental results show the approach is effective and suitability.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 532-533)

Pages:

1243-1247

Citation:

Online since:

June 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] K. Church and P. Hanks, Word Association Norms, Mutual Information, and Lexicography, Computational Linguistics, vol 16, pp.22-29, (1990).

DOI: 10.3115/981623.981633

Google Scholar

[2] R. Agrawal and R. Srikant, Mining Sequential Patterns, Proc. of 11th International Conference On Data Engineering. IEEE Computer Society, Taipei, Taiwan, (1995).

Google Scholar

[3] J. Han, M. Kamber and J. Pei, Data Mining: Concepts and Techniques, 2nd ed., Morgan Kaufmann, (2005).

Google Scholar

[4] M. Rajman and R. Besancon, Text Mining: Natural Language techniques and Text Mining applications, Proc. of the 7th IFIP Working Conference on Database Semantics (DS-7), Leysin, Switzerland, (1997).

Google Scholar

[5] R. Agrawal and R. Srikant, Fast Algorithms for Mining Association Rules, Proc. of 20th VLDB conference, Santiago, Chile, (1994).

Google Scholar