A Systematic Approach on Data Pre-processing In Data Mining

Authors

  • Baskar SS Research scholar, Department of Computer Science, St. Joseph’s College, Trichirappalli, India.
  • Arockiam L Associate Professor, Department of Computer Science, St. Joseph’s College, Trichirappalli, India
  • Charles S Assistant Professor, Department of Computer Science, St. Joseph’s College, Trichirappalli, India

Keywords:

KDD, Data mining, association rules, Pre-processing algorithms

Abstract

Data pre-processing is an important and critical step in the data mining process and it has a huge impact on the success of a data mining Soil classification. Data pre-processing is a first step of the Knowledge discovery in databases (KDD) process that reduces the complexity of the data and offers better analysis and ANN training. Based on the collected data from the field as well soil testing laboratory, data analysis is performed more accurately and efficiently. Data pre-processing is challenging and tedious task as it involves extensive manual effort and time in developing the data operation scripts. There are a number of different tools and methods used for pre-processing, including: sampling, which selects a representative subset from a large population of data; transformation, which manipulates raw data to produce a single input; denoising, which removes noise from data; normalization, which organizes data for more efficient access; and feature extraction, which pulls out specified data that is significant in some particular context. Pre-processing technique for soil data sets are also useful for classification in data mining.

References

Data Pre-processing & Mining Algorithm, Knowledge & Data Mining & Pre-processing, 3rdedition, Han & Kamber.

Mohd Helmy Abd Wahab, Mohd Norzali Haji Mohd, Hafizul Fahri Hanafi, Mohamad Farhan(1998) Mohamad Mohsin

Agrawal, Rakesh and Ramakrishnan Srikant, “Fast Algorithms for Mining & Preprocessing Assosiation Rules”, Proceedings of the 20th VLDB Conference, Santiago, Chile (1994).

Salleb, Ansaf and Christel Vrain, “An Application of Assosiation Knowledge Discovery and Data Mining (PKDD) 2000, LNAI 1910, pp. 613-618, Springer Verlag (2000).

Agarwal,R and Psaila G, Active Data Mining. In Proceedings on Knowledge Discovery and Data Mining (KDD-95), 1995, 3-8 Menl

Downloads

Published

2024-02-26

How to Cite

Baskar, S., Arockiam, L., & Charles, S. (2024). A Systematic Approach on Data Pre-processing In Data Mining. COMPUSOFT: An International Journal of Advanced Computer Technology, 2(11), 335–339. Retrieved from https://ijact.in/index.php/j/article/view/59

Issue

Section

Review Article

Similar Articles

<< < 7 8 9 10 11 12 13 14 15 16 > >> 

You may also start an advanced similarity search for this article.