Accuracy Comparison of Data Imputation Estimation Methods between the Unconstrained Structural Equation Modeling and K-Nearest Neighbors

Narong Phothi; Somchai Prakancharoen

doi:10.4028/www.scientific.net/AMR.403-408.3671

Paper Titles

Research on Evaluation Model of Engineering Equipment Supportability in Operating Phase
p.3652

Comparison between Middle-Face Model and 3D Model and Experiment of Gas-Assisted Injection Molding
p.3656

A New Method of Container Port Hinterland Division
p.3661

The Production Scheduling Problem of Aluminum Casting Based on Theory of Constraints
p.3666

Accuracy Comparison of Data Imputation Estimation Methods between the Unconstrained Structural Equation Modeling and K-Nearest Neighbors
p.3671

Multi-Objective Chaos Memetic Algorithm for DTLZ Problems
p.3676

Adaboost Ensemble Data Classification Based on Diversity of Classifiers
p.3682

Combining Algorithms for Recommendation System on Twitter
p.3688

PM₁₀ Prediction Model by Support Vector Regression Based on Particle Swarm Optimization
p.3693

HomeAdvanced Materials ResearchAdvanced Materials Research Vols. 403-408Accuracy Comparison of Data Imputation Estimation...

Accuracy Comparison of Data Imputation Estimation Methods between the Unconstrained Structural Equation Modeling and K-Nearest Neighbors

Abstract:

This study aimed to the accuracy comparison of data imputation estimation methods between the unconstrained structural equation modeling (Uncon-SEM) and k-nearest neighbors (K-NN). The measurement accuracy of the model based on the mean magnitude of relative error (MMRE). The model is developed by using the online database from University of California, Irvine (UCI) which is a data set on waveform generators. Indicators 21 (1,200 sets) methods were as follows: 1) Data set was divided into two groups (experimental group of 1,000 sets and test group of 200 sets); 2) The experimental group was analyzed by three main factors (F1, F2, F3); 3) Uncon-SEM method: It created a SEM with three main factors, then the remaining factors to be created new the relationships with the unconstrained approach and created new SEM. The test data was substituted in the equation to find the MMRE which was 34.29% (accuracy was 65.71%); 4) K-NN method: It selected the main factor was the relationship of the missing data (F2). Measure the Euclidean distance between the test group and experimental group and selected 5 (K=5) of data sets were nearest to the missing data for the estimate by mean. The MMRE which was 57.00% (accuracy was 43.00%). Thus, comparing estimates of missing data showed that using the Uncon-SEM method were more accuracy, and MMRE declined about 22.71% than K-NN method.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Advanced Materials Research (Volumes 403-408)

Pages:

3671-3675

DOI:

https://doi.org/10.4028/www.scientific.net/AMR.403-408.3671

Citation:

Cite this paper

Online since:

November 2011

Authors:

Narong Phothi, Somchai Prakancharoen

Keywords:

Data Imputation Estimation, K-Nearest Neighbor (KNN), Structural Equation Modeling (SEM), Unconstrained Structural Equation Modeling

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

References

[1] S. Prakancharoen: The estimated time to develop application software oriented network using structural equation modeling. Information Technology Journal. Year 4, Vol. 7, Bangkok: King Mongkut's University of Technology North Bangkok, (2008).

DOI: 10.7763/ijiet.2016.v6.738

Google Scholar

[2] N. Phothi and S. Prakancharoen: Accuracy Comparison of Imputation Methods Using Structural Equation Modeling Between With Discriminant Analysis and Without Discriminant Analysis. Conference on Science and Technology No. 8. Pathum Thani: Thammasat University Rangsit Campus, (2010).

DOI: 10.12982/nlsc.2023.010

Google Scholar

[3] L.C. Rufus: Solutions for Missing Data in Structural Equation Modeling. Research & Practice in Assessment. Vol. 1, Issue 1, March (2006).

Google Scholar

[4] P. Meesad and K. Hengpraprohm: Combination of KNN-Based Feature Selection and KNN-Based Missing-Value Imputation of Microarray Data. The 3rd Intetnational Conference on Innovative Computing Information and Control (ICICIC'08). IEEE computer, (2008).

DOI: 10.1109/icicic.2008.635

Google Scholar

[5] B. Thomas: K-Nearest Neighbors Algorithm: Prediction and Classification. Department of Economics Southern Methodist University Dallas, TX 75275 February (2008).

Google Scholar

[6] B. Leo, H.F. Jerome, O. Adam and S. Jonathan: Classification and Regression Trees. Wadsworth International Group. California, (1984).

Google Scholar

[7] K. Vanitbancha: Multivariate Data Analysis. Vol. 2. Bangkok: Chulalongkorn University Book Center, (2007).

Google Scholar

[8] K.G. Jöreskog and F. Yang: Nonlinear structural equation models: The Kenny-Judd model with interaction effects. In G. Marcoulides & R. Schumacker (Eds. ), Advanced structural equation modeling. Mahwah, NJ: Lawrence Erlbaum Associates. (1996).

DOI: 10.1080/10705510701758448

Google Scholar

[9] H.W. Marsh, Z. Wen and K.T. Hau: Structural equation models of latent interactions: Evaluation of alternative estimation strategies and indicator construction. Psychological Methods, 9, (2004), p.275–300.

DOI: 10.1037/1082-989x.9.3.275

Google Scholar

[10] O. Troyanskaya, M. Cantor, G. Sherlock, P. Brown, T. Hastie, R. Tibshirani, D. Botstein and R.B. Altman: Missing values estimation methods for DNA microarrays. Bioinformatics, Vol. 17, (2001), pp.520-525.

DOI: 10.1093/bioinformatics/17.6.520

Google Scholar