Accuracy Comparison of Data Imputation Estimation Methods between the Unconstrained Structural Equation Modeling and K-Nearest Neighbors

Article Preview

Abstract:

This study aimed to the accuracy comparison of data imputation estimation methods between the unconstrained structural equation modeling (Uncon-SEM) and k-nearest neighbors (K-NN). The measurement accuracy of the model based on the mean magnitude of relative error (MMRE). The model is developed by using the online database from University of California, Irvine (UCI) which is a data set on waveform generators. Indicators 21 (1,200 sets) methods were as follows: 1) Data set was divided into two groups (experimental group of 1,000 sets and test group of 200 sets); 2) The experimental group was analyzed by three main factors (F1, F2, F3); 3) Uncon-SEM method: It created a SEM with three main factors, then the remaining factors to be created new the relationships with the unconstrained approach and created new SEM. The test data was substituted in the equation to find the MMRE which was 34.29% (accuracy was 65.71%); 4) K-NN method: It selected the main factor was the relationship of the missing data (F2). Measure the Euclidean distance between the test group and experimental group and selected 5 (K=5) of data sets were nearest to the missing data for the estimate by mean. The MMRE which was 57.00% (accuracy was 43.00%). Thus, comparing estimates of missing data showed that using the Uncon-SEM method were more accuracy, and MMRE declined about 22.71% than K-NN method.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 403-408)

Pages:

3671-3675

Citation:

Online since:

November 2011

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] S. Prakancharoen: The estimated time to develop application software oriented network using structural equation modeling. Information Technology Journal. Year 4, Vol. 7, Bangkok: King Mongkut's University of Technology North Bangkok, (2008).

DOI: 10.7763/ijiet.2016.v6.738

Google Scholar

[2] N. Phothi and S. Prakancharoen: Accuracy Comparison of Imputation Methods Using Structural Equation Modeling Between With Discriminant Analysis and Without Discriminant Analysis. Conference on Science and Technology No. 8. Pathum Thani: Thammasat University Rangsit Campus, (2010).

DOI: 10.12982/nlsc.2023.010

Google Scholar

[3] L.C. Rufus: Solutions for Missing Data in Structural Equation Modeling. Research & Practice in Assessment. Vol. 1, Issue 1, March (2006).

Google Scholar

[4] P. Meesad and K. Hengpraprohm: Combination of KNN-Based Feature Selection and KNN-Based Missing-Value Imputation of Microarray Data. The 3rd Intetnational Conference on Innovative Computing Information and Control (ICICIC'08). IEEE computer, (2008).

DOI: 10.1109/icicic.2008.635

Google Scholar

[5] B. Thomas: K-Nearest Neighbors Algorithm: Prediction and Classification. Department of Economics Southern Methodist University Dallas, TX 75275 February (2008).

Google Scholar

[6] B. Leo, H.F. Jerome, O. Adam and S. Jonathan: Classification and Regression Trees. Wadsworth International Group. California, (1984).

Google Scholar

[7] K. Vanitbancha: Multivariate Data Analysis. Vol. 2. Bangkok: Chulalongkorn University Book Center, (2007).

Google Scholar

[8] K.G. Jöreskog and F. Yang: Nonlinear structural equation models: The Kenny-Judd model with interaction effects. In G. Marcoulides & R. Schumacker (Eds. ), Advanced structural equation modeling. Mahwah, NJ: Lawrence Erlbaum Associates. (1996).

DOI: 10.1080/10705510701758448

Google Scholar

[9] H.W. Marsh, Z. Wen and K.T. Hau: Structural equation models of latent interactions: Evaluation of alternative estimation strategies and indicator construction. Psychological Methods, 9, (2004), p.275–300.

DOI: 10.1037/1082-989x.9.3.275

Google Scholar

[10] O. Troyanskaya, M. Cantor, G. Sherlock, P. Brown, T. Hastie, R. Tibshirani, D. Botstein and R.B. Altman: Missing values estimation methods for DNA microarrays. Bioinformatics, Vol. 17, (2001), pp.520-525.

DOI: 10.1093/bioinformatics/17.6.520

Google Scholar