A Systematic Approach on Data Pre-processing In Data Mining
Keywords:
KDD, Data mining, association rules, Pre-processing algorithmsAbstract
Data pre-processing is an important and critical step in the data mining process and it has a huge impact on the success of a data mining Soil classification. Data pre-processing is a first step of the Knowledge discovery in databases (KDD) process that reduces the complexity of the data and offers better analysis and ANN training. Based on the collected data from the field as well soil testing laboratory, data analysis is performed more accurately and efficiently. Data pre-processing is challenging and tedious task as it involves extensive manual effort and time in developing the data operation scripts. There are a number of different tools and methods used for pre-processing, including: sampling, which selects a representative subset from a large population of data; transformation, which manipulates raw data to produce a single input; denoising, which removes noise from data; normalization, which organizes data for more efficient access; and feature extraction, which pulls out specified data that is significant in some particular context. Pre-processing technique for soil data sets are also useful for classification in data mining.
References
Data Pre-processing & Mining Algorithm, Knowledge & Data Mining & Pre-processing, 3rdedition, Han & Kamber.
Mohd Helmy Abd Wahab, Mohd Norzali Haji Mohd, Hafizul Fahri Hanafi, Mohamad Farhan(1998) Mohamad Mohsin
Agrawal, Rakesh and Ramakrishnan Srikant, “Fast Algorithms for Mining & Preprocessing Assosiation Rules”, Proceedings of the 20th VLDB Conference, Santiago, Chile (1994).
Salleb, Ansaf and Christel Vrain, “An Application of Assosiation Knowledge Discovery and Data Mining (PKDD) 2000, LNAI 1910, pp. 613-618, Springer Verlag (2000).
Agarwal,R and Psaila G, Active Data Mining. In Proceedings on Knowledge Discovery and Data Mining (KDD-95), 1995, 3-8 Menl
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2013 COMPUSOFT: An International Journal of Advanced Computer Technology
This work is licensed under a Creative Commons Attribution 4.0 International License.
©2023. COMPUSOFT: AN INTERNATIONAL OF ADVANCED COMPUTER TECHNOLOGY by COMPUSOFT PUBLICATION is licensed under a Creative Commons Attribution 4.0 International License. Based on a work at COMPUSOFT: AN INTERNATIONAL OF ADVANCED COMPUTER TECHNOLOGY. Permissions beyond the scope of this license may be available at Creative Commons Attribution 4.0 International Public License.