Big data quality factors, frameworks and challenges
Keywords:
Big Data, Quality Dimension, Quality Factors, Quality Frameworks, Quality ChallengesAbstract
Big Data applications are widely used in many fields such as artificial intelligence, marketing, commercial applications and health care, as demonstrated by the role of Big Data in coping with the COVID-19 pandemic. Therefore, it is essential to ensure the quality of the generation and use of Big Data applications. Consequently, Big Data applications must satisfy quality factors suited for these applications. Furthermore, quality frameworks need to be applied and tested for the quality factors of Big Data applications. Nevertheless, the quality measurement process needs to overcome some challenges for it to become applicable and trustworthy. This research lists different quality factors and dimensions and describes quality frameworks that are commonly used to measure the quality of Big Data. Furthermore, it lists the frequent challenges that researchers and data scientists face throughout the Big Data quality measurement process. Finally, it outlines the solutions that need to be developed for confronting the challenges of Big Data quality.
References
M. van Rijmenam,“A Short History Of Big Data,” 2013. Available: https://datafloq.com/read/big-data-history/239. Last Accessed: April 25, 2020.
F.Rider, The Scholar and theFuture of the Research Library: A Problem and Its Solution: Hadham Press, NewYork, 1944.
S. Sagirogluand D. Sinanc,“Big data: A review,” in 2013 International Conference on Collaboration Technologies and Systems (CTS), IEEE, 2013. doi: 10.1109/CTS.2013.6567202.
M.Kataria and M.P. Mittal, “Big data: a review,”International Journal of Computer Science and Mobile Computing, vol. 3, no. 7,pp. 106-110, July 2014. https://ijcsmc.com/docs/papers/July2014/V3I7KJ06.pdf.
D. Reinsel and J. Gantz,“The Digital Universe in 2020,”Dec. 2012. Available: https://www.emc.com/leadership/digitaluniverse/012iview/index.htm. Last Accessed: April 28, 2020.
E.Dumbill, “Making Sense of Big Data,”Big Data, vol. 1, pp. 1-2,2013, doi: 10.1089/big.2012.1503.
D.Laney, “3D Management: Controlling Data Volume, Velocity, and Variety,” in Application Delivery Strategies, META Group, 2001. Available:https://blogs.gartner.com.
N. Khan, A. Naim, M. Rashid Hussain, Q.N. Naveed, N. Ahmad, and S.Qamar,“The 51 V’s Of Big Data: Survey, Technologies, Characteristics, Opportunities, Issues and Challenges,” in Proceedings of the International Conference on Omni-Layer Intelligent Systems (COINS ’19), Association for Computing Machinery, New York, NY, USA, 2019, pp. 19–24, doi: 10.1145/3312614.3312623.
A. Gandomi and M. Haider, “Beyond the hype: Big data concepts, methods, and analytics,”International Journal of Information Management, vol. 35, no. 2,pp. 137-144,2015, doi: 10.1016/j.ijinfomgt.2014.10.007.
https://one.gov.jo [Last Accessed July 13, 2020].
A.Coelho VazHenriques, F. Meirelles, and M.A. Cunha, “Big data analytics: achievements, challenges, and research trends, ”Independent Journal of Management & Production (IJM&P),vol. 11, no. 4,pp. 1201-1222,2020, doi: 10.14807/ijmp.v11i4.1085.
M. Abdallah, “Big Data Quality Challenges,” in 2019 International Conference on Big Data and Computational Intelligence (ICBDCI), Mauritius,2019, pp. 1-3,doi: 10.1109/ICBDCI.2019.8686099.
C.Batini, A. Rula, M. Scannapieco, and G. Viscusi, “From Data Quality to Big Data Quality,”Journal of Database Management, vol. 26,pp. 60-82, 2015, doi: 10.4018/JDM.2015010103.
D.M.Strong, Y.W. Lee, and R.Y. Wang, “Data quality in context,”Commun. ACM, vol. 40, no. 5,pp. 103-110,1997.
A. Ramasamy and S. Chowdhury, “Big Data Quality Dimensions: A Systematic Literature Review,”Journal of Information Systems and Technology Management – Jistem USP,vol. 17,pp. 1-13,2020, doi: 10.4301/S1807-1775202017003.
L.L.Pipino, Y.W. Lee, and R.Y. Wang, “Data quality assessment,”Commun. ACM, vol. 45, no. 4,pp. 211-218,2002, doi: 10.1145/505248.506010.
F. Sidi, P. H. ShariatPanahy, L. S. Affendey, M. A. Jabar, H. Ibrahim, and A. Mustapha, “Data quality: A survey of data quality dimensions,” in2012 International Conference on Information Retrieval & Knowledge Management, Kuala Lumpur, 2012, pp. 300-304,doi:
1109/InfRKM.2012.6204995..
F.I.Salih, S.A. Ismail, M.M. Hamed, O.M.Yusop, A.Azm, and N.F.M.Azmi,“Data Quality Issues in Big Data: A Review,” in 3rd International Conference of Reliable Information and Communication Technology (IRICT 2018),in F. Saeed, N.Gazem, F. Mohammed, A.Busalim, Eds., Recent Trends in Data Science and Soft Computing, Cham: Springer International Publishing, 2019.
M.Mirzaie, B. Behkamal, and S. Paydar, “State of the Art on the Quality of Big Data: A Systematic Literature Review and Classification Framework,” 2019. arXiv preprint
arXiv:1904.05353.
M.Mirzaie, B. Behkamal, and S. Paydar, “Big Data Quality: A systematic literature review and future research directions,”2019, arXiv preprint arXiv:1904.05353.
N.Abdullah, S.A. Ismail, S.Sophiayati, and S.M. Sam,“Data quality in big data: A review,”Int. J. Advance Soft Compu. Appl.,vol. 7, no. 3,pp. 16-27,2015. Available: http://home.ijasca.com/data/documents/IJASCA-SI-070302_Pg16-27_Data-Quality-in-Big-Data-A-Review.pdf
H.J.Hadi, A.H. Shnain, S. Hadishaheed, and A.H. Ahmad, “BigDataand Five V'sCharacteristics,”International Journal of Advances in Electronics and Computer Science, vol. 2, no. 1,pp.16-23,2015.Available: http://www.iraj.in/journal/journal_file/journal_pdf/12-105- 142063747116-23.pdf.
Ishwarappa and J. Anuradha, “A Brief Introduction on Big Data 5Vs Characteristics and Hadoop Technology,”Procedia Computer Science, vol. 48,pp. 319-324,2015, doi: 10.1016/j.procs.2015.04.188.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2020 COMPUSOFT: An International Journal of Advanced Computer Technology
This work is licensed under a Creative Commons Attribution 4.0 International License.
©2023. COMPUSOFT: AN INTERNATIONAL OF ADVANCED COMPUTER TECHNOLOGY by COMPUSOFT PUBLICATION is licensed under a Creative Commons Attribution 4.0 International License. Based on a work at COMPUSOFT: AN INTERNATIONAL OF ADVANCED COMPUTER TECHNOLOGY. Permissions beyond the scope of this license may be available at Creative Commons Attribution 4.0 International Public License.