Cloud Based Data Analytics: A Review
Keywords:
Cloud Computing, Data Analytics, Big Data, Hadoop, MapReduceAbstract
Large volumes of data are generated every second from various sources like social networking platforms, IoT (Internet of Things), sensory devices, wireless communications services, ecommerce platforms, government agencies, to name a few. Regular data processing paradigms yield insignificant results while dealing with data of such large volumes, and are consequently labeled as Big Data. Big Data is a blanket terminology that deals with the storage, management, processing, and most importantly, the analyzing of such data. Cloud Computing has emerged as a technology of paramount importance to modern computing, and deals with providing the infrastructure and computing resources required for such processes in an efficient and cost-effective manner. Various sectors including healthcare, education, and government agencies, are leveraging Big Data to improve decision-making. For example, the medical industry is making use of Big Data to better understand their patients and develop personalized treatment plans, while government agencies are using it to track and prevent fraud, waste, and abuse. This paper presents an in-depth description of cloud computing and big data. We then delve into Big Data analytics where we discuss various Big Data paradigms, and introduce Big Data analytics in the context of cloud computing. Lastly, we discuss the advantages of using Big Data analytics in Cloud Computing as well its limitations and future enhancements for this vast domain.
Downloads
References
Berisha, B., Mëziu, E. & Shabani, I, Big data analytics in Cloud computing: an overview, J Cloud Comp 11, 24 (2022).
Subia Saif, Samar Wazir, Performance Analysis of Big Data and Cloud Computing Techniques: A Survey, Procedia Computer Science, Volume 132, 2018, Pages 118-127, ISSN 1877-0509.
Sangeetha, K. & Prakash, Parvathy. (2015), Big Data and Cloud: A Survey, 10.1007/978-81-322-2135-7_81.
Ahmed, N., Barczak, A.L.C., Susnjak, T. et al, A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench. J Big Data, 7, 110 (2020).
R. Buyya, K. Ramamohanarao, C. Leckie, R. N. Calheiros, A. V. Dastjerdi and S. Versteeg, "Big Data Analytics-Enhanced Cloud Computing: Challenges, Architectural Elements, and Future Directions," 2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS), Melbourne, VIC, Australia, 2015, pp. 75-84, doi: 10.1109/ICPADS.2015.18.
Chaowei Yang, Qunying Huang, Zhenlong Li, Kai Liu & Fei Hu (2017), “Big Data and cloud computing: innovation opportunities and challenges, International Journal of Digital Earth”, 10:1, 13-53, DOI: 10.1080/17538947.2016.1239771.
Manoj Muniswamaiah, Tilak Agerwala, Charles Tappert, “Big Data in Cloud Computing Review and Opportunities”, International Journal of Computer Science & Information Technology (IJCSIT) Vol 11, No 4, August 2019.
A. K. Sandhu, "Big data with cloud computing: Discussions and challenges," in Big Data Mining and Analytics, vol. 5, no. 1, pp. 32-40, March 2022, doi: 10.26599/BDMA.2021.9020016.
Gupta, R., Gupta, H., Mohania, M. (2012), “Cloud Computing and Big Data Analytics: What Is New from Databases Perspective?”, In: Srinivasa, S., Bhatnagar, V. (eds) Big Data Analytics. BDA 2012. Lecture Notes in Computer Science, vol 7678. Springer, Berlin, Heidelberg.
A. K. Manekar and G. Pradeepini, "Cloud Based Big Data Analytics a Review," 2015 International Conference on Computational Intelligence and Communication Networks (CICN), Jabalpur, India, 2015, pp. 785-788, doi: 10.1109/CICN.2015.160.
Zanoon, Dr. Nabeel & Alhaj, Abdullah & Khwaldeh, Sufian. (2017), “Cloud Computing and Big Data is there a Relation between the Two: A Study. International Journal of Applied Engineering Research”, 12. 6970-6982.
Ying Liu, Anthony Soroka, Liangxiu Han, Jin Jian, Min Tang, “Cloud-based big data analytics for customer insight-driven design innovation in SMEs, International Journal of Information Management”, Volume 51, 2020, 102034, ISSN 0268-4012.
Khan, S., Shakil, K.A., Alam, M. (2018), “Cloud-Based Big Data Analytics—A Survey of Current Research and Future Directions”, In: Aggarwal, V., Bhatnagar, V., Mishra, D. (eds) Big Data Analytics. Advances in Intelligent Systems and Computing, vol 654. Springer, Singapore.
Marino S, Zhao Y, Zhou N, Zhou Y, Toga AW, Zhao L, et al. (2020), “Compressive Big Data Analytics: An ensemble meta-algorithm for high-dimensional multisource datasets”, PLoS ONE 15(8): e0228520.
Ajimoko, O. J., 2018, “Considerations for the Adoption of Cloud-based Big Data Analytics in Small Business Enterprises”, The Electronic Journal Information Systems Evaluation, 21(2), pp. 63-79.
Shingyu Kim, Junghee Won, Hyuck Han, Hyeonsang Eom, and Heon Y. Yeom. 2011, “Improving Hadoop performance in intercloud environments. SIGMETRICS Perform. Eval”, Rev. 39, 3 (December 2011), 107–109.
Depeige, A., Doyencourt, D, “Actionable Knowledge as A Service (AKAAS): Leveraging big data analytics in cloud computing environments”, Journal of Big Data 2, 12 (2015).
Carretero Pérez, Jesús; et.al. (eds.), (2015) Proceedings of the Second International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2015): Krakow, Poland. Universidad Carlos III de Madrid, pp. 51-62. ISBN: 978-84-608-2581-4.
Naga Raju Hari Manikyam and Dr. S. Mohan Kumar, “Methods and Techniques To Deal with Big Data Analytics and Challenges In Cloud Computing Environment”, International Journal of Civil Engineering and Technology, 8(4), 2017, pp. 669-678.
Rai, Ibrahim. (2018), “Performance Analysis of Big Data Tools. International Journal of Advances in Computer Science and Technology”, 7. 43-48. 10.30534/ijacst/2018/05762018.