Filling the Missing Data of Air Pollutant Concentration Using Single Imputation Methods

M.N. Noor; A.S. Yahaya; N.A. Ramli; Mohd Mustafa Al Bakri Abdullah

doi:10.4028/www.scientific.net/AMM.754-755.923

Paper Titles

Improvement on the Catalytic Performance Using Dual Lipases System in the Synthesis of Ferulate Esters
p.902

Effect of Sintering Temperature on Different Ca Content in Mg-Ca Composite Using Powder Metallurgy Technique
p.907

Experimental Investigation of In Situ Soot Oxidation Using Electromagnetic Waves
p.912

Characterization of Difference Size of IDE Pattern for Formaldehyde Detection Sensor
p.917

Filling the Missing Data of Air Pollutant Concentration Using Single Imputation Methods
p.923

Machining Performance Study of a New Palm Oil Based Bio-Product Industrial Wax
p.935

Spinach Ferredoxin (Fdx) as an Organic Material to Improve Optical Band Gap of Chitosan (Cs) Biofilm
p.939

The Effects of Branched-Tail Structure of Surfactant on the Phase Behaviour of Alkylglucoside/Water/n-Octane Ternary System
p.944

Adsorptive Removal of Ni²⁺ from Aqueous Solution Using Rice Husk-Based Activated Carbon
p.950

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 754-755Filling the Missing Data of Air Pollutant...

Filling the Missing Data of Air Pollutant Concentration Using Single Imputation Methods

Abstract:

Hourly measured PM₁₀concentration at eight monitoring stations within peninsular Malaysia in 2006 was used to conduct the simulated missing data. The gap lengths of the simulated missing values are limited to 12 hours since the actual trend of missingness is considered short. Two percentages of simulated missing gaps were generated that are 5 % and 15 %. A number of single imputation methods (linear interpolation (LI), nearest neighbour interpolation (NN), mean above below (MAB), daily mean (DM), mean 12-hour (12M), mean 6-hour (6M), row mean (RM) and previous year (PY)) were calculated to fill in the simulated missing data. In addition, multiple imputation (MI) was also conducted to compare between the single imputation methods. The performances were evaluated using four statistical criteria namely mean absolute error, root mean squared error, prediction accuracy and index of agreement. The results show that 6M perform comparably well to LI. Thus, this show that the effect of smaller averaging time gives better prediction. Other single imputation methods predict the missing data well except for PY. RM and MI performs moderately with the increasing performance in higher fraction of missing gaps whereas LR makes the worst methods for both simulated missing data percentages.

You might also be interested in these eBooks

Advanced Materials Engineering and Technology III

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volumes 754-755)

Pages:

923-932

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.754-755.923

Citation:

Cite this paper

Online since:

April 2015

Authors:

M.N. Noor, A.S. Yahaya, N.A. Ramli, Mohd Mustafa Al Bakri Abdullah

Keywords:

Missing Data, Multiple Imputation, Performance Indicators, PM₁₀, Single Imputation

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

References

[1] Bello, A.L.: Imputation techniques in regression analysis: Looking closely at their implementation. Computational Statistics & Data Analysis 20, pp.45-57 (1995).