Voice Conversion using Hybrid CNN BiLSTM-WaveNet Deep Learning Models

A. Bala Raju, S. P Singh, Dhiraj Sunehra

doi:10.52783/cana.v32.5407

PDF

Published: May 14, 2025

DOI: https://doi.org/10.52783/cana.v32.5407

Keywords:

Voice Conversion, Deep Learning, Speech Processing, Speech to Text, Bidirectional LSTM, WaveNet Vocoder

A. Bala Raju, S. P Singh, Dhiraj Sunehra

Abstract

Voice conversion is an exciting area of speech processing in which deep learning approaches are developed that can modify the vocal qualities of an speaker to resemble the voice of another person without altering the context of the utterance. The significance of speech conversion cannot be overstated, as it is employed in a wide range of systems, including entertainment, vocal communication, and privacy enhancement. However, traditional methods have fallen short in the face of large data sets and the preservation of subtle emotions, hindering voice simulation. To address the above limitations, we present a novel way that combines the fusion of the Speech to Text technology with a text-to-speech transformation system powered by a deep learning architecture. The system contains advanced embedding layers like phoneme embedding, bidirectional Long Short-Term Memory (LSTM) networks, and WaveNet vocoder, which make the transformed voice more accurate and authentic. In the proposed model, we use the speech recognition tools packages of Python and complex neural network methods to improve the naturalness and clarity. Moreover, it sets a bar when it comes to processing power, efficiency, and performance.

Issue

Vol. 32 No. 10s (2025)

Section

Articles

Announcements

Call for Papers

Call for Papers for the Upcoming Issue.

Last Date of Submission: April 30^th, 2026

Call for Reviewers

Call for Editorial Member/ Reviewers Submitting your Application
If you would like to apply for the position of an Editorial Board Member on the journal, please contact the Editor including your CV and a brief covering letter detailing why you are a suitable candidate, to editor@internationalpubls.com. Your cover letter should be no longer than one page and should cover where you believe the research field is going (and the journal's place within it), as well as details of any previous relevant journal editorial and peer review management experience.