A Outlier Identification and Correction Method Based on Wavelet Transform

Article Preview

Abstract:

There are many outliers in air pollution time series data for various reasons. It has a serious impact on the data analysis and use. There are three main ways to identify anomalies but they each have definite limitations, especially when identifying and correcting the first category and the second category of outlier at the same time. In order to solve this problem, this paper presents a new way to identify anomalies based on wavelet transform and identify outlier by the use of the wavelet transform modulus maxima , then pass the amendment of the outlier through inverse transform the wavelet transform coefficient. Evidence shows that this method can be used to identify and correct the two types of outlier simultaneously and the results are obvious.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 113-116)

Pages:

1485-1489

Citation:

Online since:

June 2010

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2010 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Bickel DR. Robust Estimators of the Mode and Skewness of continuous data. Computational Statistics and Data Analysis. 2008, 39(2): 153-163.

DOI: 10.1016/s0167-9473(01)00057-3

Google Scholar

[2] He Zeng-you, Xu Xiao-fei, Deng Sheng-chun. Discovering Cluster-based Local Outliers [J]. Pattern Recognition Letters. 2003, 24(9-10) : 1641-1650.

DOI: 10.1016/s0167-8655(03)00003-5

Google Scholar

[3] Edwin M. Knorr, Raymond T. Ng. Algorithms for Mining Distance-based Outliers in Large Datasets [A]. Proceedings of the 24th International Conference on Very Large Data Bases[C]. New York, Morgan Kaufmann, 1998: 392-403.

Google Scholar

[4] Edwin M. Knorr, Raymond T. Ng. Finding in Tensional Knowledge of Distance-based Outliers.

Google Scholar

[5] Markus M. Breunig, Hans-Peter Kriegel, Raymond T. Ng, etc. LOF: Identifying Density-based Local Outliers[A]. Proceedings of the ACM SIGMOD International Conference on Management of Data[C]. Dallas, Texas: ACM Press, 2009: 93-104.

DOI: 10.1145/335191.335388

Google Scholar

[6] Jiang M. F, Teng S. S, Su C.M. Two-phase Clustering Process for Outliers Detection[J]. Pattern Recognition Letters. 2001, 22(6-7): 691-700.

DOI: 10.1016/s0167-8655(00)00131-8

Google Scholar

[7] CHEN Bang-wen, SHI Lin-fen, YANG Hong-jin. A Model and a Method for Evaluating the Quality of Statistical Data and Identifying Outliers from the Data. Systems Engineering. 2001, 19(3): 85-89.

Google Scholar