AutoETL: A Nonlinear Deep Learning Framework for ETL Automation

G. Sunil Santhosh Kumar

doi:10.52783/cana.v32.2641

PDF

Published: Dec 1, 2024

DOI: https://doi.org/10.52783/cana.v32.2641

Keywords:

nonlinear ETL framework, data transformation, tokenization, transformer model, reinforcement learning, IoU, structured data alignment, TPC-DI dataset

G. Sunil Santhosh Kumar, M. Rudra Kumar

Abstract

This study presents a nonlinear framework for automating Extract, Transform, Load (ETL) processes. The framework uses natural language processing techniques, transformer-based models, and reinforcement learning to convert unstructured data into structured formats. It focuses on creating and refining transformation rules based on data patterns. The research addresses challenges in automating ETL processes, particularly the need to handle complex data relationships without manual input. The TPC-DI dataset is used to test the framework, which transforms financial newswire data into a structured warehouse format. The process follows ACID and OpenClass standards. The framework includes data preparation through tokenization and normalization. A transformer-based model processes sequences to identify patterns. Reinforcement learning refines transformation rules using feedback. The methods ensure structured data alignment measured through metrics like Intersection over Union (IoU), mean average precision (mAP), and mean squared error (MSE). The results show consistent performance across various data thresholds, highlighting its ability to handle diverse data patterns. This research outlines a method to automate data handling while reducing manual involvement, with potential applications across domains.

Issue

Vol. 32 No. 3s (2025)

Section

Articles

Announcements

Call for Papers

Call for Papers for the Upcoming Issue.

Last Date of Submission: June 30^th, 2025

Call for Reviewers

Call for Editorial Member/ Reviewers Submitting your Application
If you would like to apply for the position of an Editorial Board Member on the journal, please contact the Editor including your CV and a brief covering letter detailing why you are a suitable candidate, to editor@internationalpubls.com. Your cover letter should be no longer than one page and should cover where you believe the research field is going (and the journal's place within it), as well as details of any previous relevant journal editorial and peer review management experience.