Evaluating the Performance of SQL-Based vs. Python-Based Data Processing in Cloud Computing for Machine Learning Applications

Bharath Muddarla

doi:10.52783/anvi.v28.2817

PDF

Published: Dec 16, 2024

DOI: https://doi.org/10.52783/anvi.v28.2817

Keywords:

Cloud Computing, SQL, Python, Data Processing, Machine Learning, AWS, Data Scalability, Integration Complexity.

Bharath Muddarla, Yashovardhan Chaturvedi

Abstract

In cloud computing environments, efficient data processing is essential for machine learning applications, where the choice of processing tools directly impacts performance and scalability. This study compares SQL-based and Python-based data processing to evaluate their effectiveness in handling large datasets and supporting machine learning workflows. Through experiments on AWS using Amazon Redshift for SQL and Pandas/Dask for Python, we analyzed processing speed, memory utilization, scalability, and integration complexity across different tasks. Results indicate that SQL outperforms Python in speed and memory efficiency for simple, structured data transformations, making it ideal for large-scale data cleaning and aggregation tasks. However, Python offers greater flexibility and seamless integration with machine learning frameworks, proving advantageous for complex transformations and feature engineering. Statistical analyses confirm SQL’s strength in handling high-volume structured data, while Python is better suited for tasks requiring intricate preprocessing and machine learning model integration. These findings suggest that a hybrid approach can combine the strengths of both SQL and Python for optimal data processing in cloud-based machine learning workflows.

Issue

Vol. 28 No. 2s (2025)

Section

Articles

Announcements

Call for Papers

Call for Papers for the Upcoming Issue.

Last Date of Submission: June 30^th, 2025

Call for Reviewers

Call for Editorial Member/ Reviewers Submitting your Application
If you would like to apply for the position of an Editorial Board Member on the journal, please contact the Editor including your CV and a brief covering letter detailing why you are a suitable candidate, to editor@internationalpubls.com. Your cover letter should be no longer than one page and should cover where you believe the research field is going (and the journal's place within it), as well as details of any previous relevant journal editorial and peer review management experience.