Automatic Construction of Collocation Dictionary Based on Text Mining

Hui Zhang; Yong Qi Wang

doi:10.4028/www.scientific.net/AMR.532-533.1243

Paper Titles

Paratactic Spatial-Temporal Two Dimension Data Fusion Based on Support Vector Machines for Traffic Flow Prediction of Abnormal State
p.1225

Design and Verification of Security Protocol for Information Transmission in Digital Campus
p.1230

Multiprocessor Scheduling Problem Based on Ant Colony Optimization Algorithm
p.1235

Realization of Text Categorization for Small-Scaled Dataset
p.1239

Automatic Construction of Collocation Dictionary Based on Text Mining
p.1243

Research of the Construction of Network Data Storage in the New Period
p.1248

A New Method for Noisy Speech Classification Based on Gaussian Mixture Models
p.1253

Multi-Objects Detection in Remote Sensing Images Using Multiple Kernel Learning
p.1258

Research on Construction of Semantic Web of 3D Model Database
p.1263

HomeAdvanced Materials ResearchAdvanced Materials Research Vols. 532-533Automatic Construction of Collocation Dictionary...

Automatic Construction of Collocation Dictionary Based on Text Mining

Abstract:

A collocation dictionary is a useful component to many natural language and spoken language processing application, such as grammar checking, text-speech conversion and machine translation. Currently the collocation dictionary is constructed by human. Firstly, it may not be updated frequently and many lexicon entries may be not available. Secondly, to construct such a dictionary may need lots of human resources. In this paper, a data-mining approach for constructing a collocation dictionary is surveyed. The main purpose is to enable cheap and quick acquisition of a collocation dictionary from a large-scale text corpus. Experimental results show the approach is effective and suitability.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Advanced Materials Research (Volumes 532-533)

Pages:

1243-1247

DOI:

https://doi.org/10.4028/www.scientific.net/AMR.532-533.1243

Citation:

Cite this paper

Online since:

June 2012

Authors:

Hui Zhang, Yong Qi Wang

Keywords:

Association Rule Mining, Collocation Dictionary, Mutual Information (MI), Text Mining

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

References

[1] K. Church and P. Hanks, Word Association Norms, Mutual Information, and Lexicography, Computational Linguistics, vol 16, pp.22-29, (1990).

DOI: 10.3115/981623.981633

Google Scholar

[2] R. Agrawal and R. Srikant, Mining Sequential Patterns, Proc. of 11th International Conference On Data Engineering. IEEE Computer Society, Taipei, Taiwan, (1995).

Google Scholar

[3] J. Han, M. Kamber and J. Pei, Data Mining: Concepts and Techniques, 2nd ed., Morgan Kaufmann, (2005).

Google Scholar

[4] M. Rajman and R. Besancon, Text Mining: Natural Language techniques and Text Mining applications, Proc. of the 7th IFIP Working Conference on Database Semantics (DS-7), Leysin, Switzerland, (1997).

Google Scholar

[5] R. Agrawal and R. Srikant, Fast Algorithms for Mining Association Rules, Proc. of 20th VLDB conference, Santiago, Chile, (1994).

Google Scholar