Automating Data Engineering Workflows with AI and Machine Learning

Authors

  • Aditya Banerjee Computer Vision Engineer, Zensar Technologies, India Author

DOI:

https://doi.org/10.63282/3050-9262.IJAIDSML-V5I2P102

Keywords:

AI in Data Engineering, Machine Learning, Data Quality, Data Pipelines, Automation, Big Data, Schema Inference, Pipeline Optimization, Ethical AI, Real-Time Processing

Abstract

Data engineering is a critical component of modern data-driven organizations, encompassing the extraction, transformation, and loading (ETL) of data, as well as the management and optimization of data pipelines. The increasing volume, velocity, and variety of data pose significant challenges for data engineers, who must ensure that data is accurate, timely, and available for various downstream applications. This paper explores the integration of artificial intelligence (AI) and machine learning (ML) techniques to automate and optimize data engineering workflows. We discuss the current state of data engineering, the challenges faced by data engineers, and the potential benefits of AI and ML in addressing these challenges. We present several case studies and algorithms that demonstrate the effectiveness of AI and ML in automating data engineering tasks, including data quality assessment, schema inference, and pipeline optimization. Finally, we discuss the ethical and practical considerations of deploying AI in data engineering and provide recommendations for future research and development

References

Published

2024-05-18

Issue

Section

Articles

How to Cite

1.
Banerjee A. Automating Data Engineering Workflows with AI and Machine Learning. IJAIDSML [Internet]. 2024 May 18 [cited 2025 Dec. 7];5(2):9-16. Available from: https://ijaidsml.org/index.php/ijaidsml/article/view/55