Finding Appropriate Lexical Diversity Measurements for Small-Size Corpus

Abstract:

Article Preview

In the present investigation four kinds of lexical diversity measurement have been applied to the sets of word chunks with monotone increasing size. The computational experiment with corpus processing and statistical test has been conducted to find out the most effective lexical diversity measurement in evaluating a small-sized corpus of 350~550 words, and the result shows that D-estimate is the most appropriate among the four lexical diversity measurements which are considered in this research. Also D-estimate shows more stable results than other measurements when the number of words varies between texts.

Info:

Periodical:

Edited by:

Dongye Sun, Wen-Pei Sung and Ran Chen

Pages:

1244-1248

DOI:

10.4028/www.scientific.net/AMM.121-126.1244

Citation:

W. H. Choi "Finding Appropriate Lexical Diversity Measurements for Small-Size Corpus", Applied Mechanics and Materials, Vols. 121-126, pp. 1244-1248, 2012

Online since:

October 2011

Authors:

Export:

Price:

$35.00

In order to see related information, you need to Login.

In order to see related information, you need to Login.