Big data quality factors, frameworks and challenges

Authors

  • Abdallah M Faculty of Science and Information Technology, Al-Zaytoonah University of Jordan, P.O. Box 130, Amman (11733) Jordan
  • Muhairat M Assistant Professor Department of Software Engineering, Al-Zaytoonah University, Jordan
  • Althunibat A Assistant Professor Department of Software Engineering, Al-Zaytoonah University, Jordan
  • Abdalla A Associated Professor, Department of Software Engineering, Al-Zaytoonah University, Jordan

Keywords:

Big Data, Quality Dimension, Quality Factors, Quality Frameworks, Quality Challenges

Abstract

Big Data applications are widely used in many fields such as artificial intelligence, marketing, commercial applications and health care, as demonstrated by the role of Big Data in coping with the COVID-19 pandemic. Therefore, it is essential to ensure the quality of the generation and use of Big Data applications. Consequently, Big Data applications must satisfy quality factors suited for these applications. Furthermore, quality frameworks need to be applied and tested for the quality factors of Big Data applications. Nevertheless, the quality measurement process needs to overcome some challenges for it to become applicable and trustworthy. This research lists different quality factors and dimensions and describes quality frameworks that are commonly used to measure the quality of Big Data. Furthermore, it lists the frequent challenges that researchers and data scientists face throughout the Big Data quality measurement process. Finally, it outlines the solutions that need to be developed for confronting the challenges of Big Data quality.

References

M. van Rijmenam,“A Short History Of Big Data,” 2013. Available: https://datafloq.com/read/big-data-history/239. Last Accessed: April 25, 2020.

F.Rider, The Scholar and theFuture of the Research Library: A Problem and Its Solution: Hadham Press, NewYork, 1944.

S. Sagirogluand D. Sinanc,“Big data: A review,” in 2013 International Conference on Collaboration Technologies and Systems (CTS), IEEE, 2013. doi: 10.1109/CTS.2013.6567202.

M.Kataria and M.P. Mittal, “Big data: a review,”International Journal of Computer Science and Mobile Computing, vol. 3, no. 7,pp. 106-110, July 2014. https://ijcsmc.com/docs/papers/July2014/V3I7KJ06.pdf.

D. Reinsel and J. Gantz,“The Digital Universe in 2020,”Dec. 2012. Available: https://www.emc.com/leadership/digitaluniverse/012iview/index.htm. Last Accessed: April 28, 2020.

E.Dumbill, “Making Sense of Big Data,”Big Data, vol. 1, pp. 1-2,2013, doi: 10.1089/big.2012.1503.

D.Laney, “3D Management: Controlling Data Volume, Velocity, and Variety,” in Application Delivery Strategies, META Group, 2001. Available:https://blogs.gartner.com.

N. Khan, A. Naim, M. Rashid Hussain, Q.N. Naveed, N. Ahmad, and S.Qamar,“The 51 V’s Of Big Data: Survey, Technologies, Characteristics, Opportunities, Issues and Challenges,” in Proceedings of the International Conference on Omni-Layer Intelligent Systems (COINS ’19), Association for Computing Machinery, New York, NY, USA, 2019, pp. 19–24, doi: 10.1145/3312614.3312623.

A. Gandomi and M. Haider, “Beyond the hype: Big data concepts, methods, and analytics,”International Journal of Information Management, vol. 35, no. 2,pp. 137-144,2015, doi: 10.1016/j.ijinfomgt.2014.10.007.

https://one.gov.jo [Last Accessed July 13, 2020].

A.Coelho VazHenriques, F. Meirelles, and M.A. Cunha, “Big data analytics: achievements, challenges, and research trends, ”Independent Journal of Management & Production (IJM&P),vol. 11, no. 4,pp. 1201-1222,2020, doi: 10.14807/ijmp.v11i4.1085.

M. Abdallah, “Big Data Quality Challenges,” in 2019 International Conference on Big Data and Computational Intelligence (ICBDCI), Mauritius,2019, pp. 1-3,doi: 10.1109/ICBDCI.2019.8686099.

C.Batini, A. Rula, M. Scannapieco, and G. Viscusi, “From Data Quality to Big Data Quality,”Journal of Database Management, vol. 26,pp. 60-82, 2015, doi: 10.4018/JDM.2015010103.

D.M.Strong, Y.W. Lee, and R.Y. Wang, “Data quality in context,”Commun. ACM, vol. 40, no. 5,pp. 103-110,1997.

A. Ramasamy and S. Chowdhury, “Big Data Quality Dimensions: A Systematic Literature Review,”Journal of Information Systems and Technology Management – Jistem USP,vol. 17,pp. 1-13,2020, doi: 10.4301/S1807-1775202017003.

L.L.Pipino, Y.W. Lee, and R.Y. Wang, “Data quality assessment,”Commun. ACM, vol. 45, no. 4,pp. 211-218,2002, doi: 10.1145/505248.506010.

F. Sidi, P. H. ShariatPanahy, L. S. Affendey, M. A. Jabar, H. Ibrahim, and A. Mustapha, “Data quality: A survey of data quality dimensions,” in2012 International Conference on Information Retrieval & Knowledge Management, Kuala Lumpur, 2012, pp. 300-304,doi:

1109/InfRKM.2012.6204995..

F.I.Salih, S.A. Ismail, M.M. Hamed, O.M.Yusop, A.Azm, and N.F.M.Azmi,“Data Quality Issues in Big Data: A Review,” in 3rd International Conference of Reliable Information and Communication Technology (IRICT 2018),in F. Saeed, N.Gazem, F. Mohammed, A.Busalim, Eds., Recent Trends in Data Science and Soft Computing, Cham: Springer International Publishing, 2019.

M.Mirzaie, B. Behkamal, and S. Paydar, “State of the Art on the Quality of Big Data: A Systematic Literature Review and Classification Framework,” 2019. arXiv preprint

arXiv:1904.05353.

M.Mirzaie, B. Behkamal, and S. Paydar, “Big Data Quality: A systematic literature review and future research directions,”2019, arXiv preprint arXiv:1904.05353.

N.Abdullah, S.A. Ismail, S.Sophiayati, and S.M. Sam,“Data quality in big data: A review,”Int. J. Advance Soft Compu. Appl.,vol. 7, no. 3,pp. 16-27,2015. Available: http://home.ijasca.com/data/documents/IJASCA-SI-070302_Pg16-27_Data-Quality-in-Big-Data-A-Review.pdf

H.J.Hadi, A.H. Shnain, S. Hadishaheed, and A.H. Ahmad, “BigDataand Five V'sCharacteristics,”International Journal of Advances in Electronics and Computer Science, vol. 2, no. 1,pp.16-23,2015.Available: http://www.iraj.in/journal/journal_file/journal_pdf/12-105- 142063747116-23.pdf.

Ishwarappa and J. Anuradha, “A Brief Introduction on Big Data 5Vs Characteristics and Hadoop Technology,”Procedia Computer Science, vol. 48,pp. 319-324,2015, doi: 10.1016/j.procs.2015.04.188.

Downloads

Published

2024-02-26

How to Cite

Abdallah, M., Muhairat, M., Althunibat, A., & Abdalla, A. (2024). Big data quality factors, frameworks and challenges. COMPUSOFT: An International Journal of Advanced Computer Technology, 9(08), 3785–3790. Retrieved from https://ijact.in/index.php/j/article/view/583

Issue

Section

Review Article

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.