eLanka

Saturday, 6 Dec 2025
  • Home
  • Read History
  • Articles
    • eLanka Journalists
  • Events
  • Useful links
    • Obituaries
    • Seeking to Contact
    • eLanka Newsletters
    • Weekly Events and Advertisements
    • eLanka Testimonials
    • Sri Lanka Newspapers
    • Sri Lanka TV LIVE
    • Sri Lanka Radio
    • eLanka Recepies
  • Gallery
  • Contact
Newsletter
  • eLanka Weddings
  • Property
  • eLanka Shop
  • Business Directory
eLankaeLanka
Font ResizerAa
Search
  • Home
  • Read History
  • Articles
    • eLanka Journalists
  • Events
  • Useful links
    • Obituaries
    • Seeking to Contact
    • eLanka Newsletters
    • Weekly Events and Advertisements
    • eLanka Testimonials
    • Sri Lanka Newspapers
    • Sri Lanka TV LIVE
    • Sri Lanka Radio
    • eLanka Recepies
  • Gallery
  • Contact
Follow US
© 2005 – 2025 eLanka Pty Ltd. All Rights Reserved.
Home » Goodnews Stories Srilankan Expats » Articles » Voice generation using text: A deep-learning method By Aditya Abeysinghe
Aditya AbeysingheArticles

Voice generation using text: A deep-learning method By Aditya Abeysinghe

eLanka admin
Last updated: September 17, 2021 2:28 am
By
eLanka admin
ByeLanka admin
Follow:
Share
3 Min Read
SHARE

Voice generation using text: A deep-learning method

  By Aditya Abeysinghe

Cable Bridge over Kelani River - major engineering feat By Aditya AbeysingheUsing text to generate speech similar to human voice is the main function of a text-to-speech (TTS) system. The process of converting text to speech is known as speech synthesis. Speech recorded is used to generate new speech, based on the input of the TTS. Since 1960s, several TTS systems have been developed for speech synthesis for current systems. However, these systems have several issues which led to the use of deep learning methods to synthesize speech.

Current methods

Two main methods exist for speech synthesis in traditional systems: concatenative and parametric. In concatenation-based synthesis the waveforms in the speech are concatenated to produce a speech stream. This type uses a waveform database to store and retrieve recorded speech. The speech appropriate for each text supplied is selected and joined to the stream to produce the final speech. In parametric speech synthesis, digital signal processing methods synthesize speech. Different parametric types use parameters such as phonetics and noise that are varied with time to create a waveform. Other techniques use deep neural or hidden models to produce waveforms.

The process

The first stage of speech synthesis is to analyze text input to the TTS. This involves text tokenization and removing blank characters. It breaks sentences into tokens and then sends them to the next stage, linguistic analysis. In this stage phoneme, syllable and words are analyzed in a text-to-phoneme conversion. Then the parameter prediction module predicts acoustic feature parameters of the linguistic analysis. Then the speech synthesis module produces the speech waveform.

Voice generation using text A deep-learning method

Issues in synthesis methods

The main issue with non-deep learning methods is that they are not efficient in processing complex text input that involve decision logic. Also, non-deep learning methods divide the input into separate regions and use separate parameters for each region. This results in fragmenting the data input to the TTS which causes improper models created. Therefore, the accuracy of such models to produce speech on test data is low.

Deep-learning methods in synthesis

Deep-learning methods use artificial neural networks for speech production. Therefore, models created using other methods are replaced with these neural networks, during the process between linguistic analysis and parameter generation as described in the process section. In a deep-learning neural network, linguistic features are processed using hidden layers of the network. The network is trained where error at each time is minimized by adjusting inner parameters or weights of the network. Therefore, the second issue as described where the division of input data in other methods is reduced. Also, complex logic can be represented using deep-learning method which reduces the first issue.

Image Courtesy: https://www.voicebase.com/

TAGGED:Aditya Abeysinghetext-to-speechTTSVoice generation
Share This Article
Email Copy Link Print
Previous Article Beddagana Wetland Park - verdant mosaic of wetlands By Arundathie Abeysinghe Beddagana Wetland Park – verdant mosaic of wetlands By Arundathie Abeysinghe
Next Article TALENTED GIFTED WRITER
FacebookLike
YoutubeSubscribe
LinkedInFollow
Most Read
10 Pictures With Fascinating Stories Behind Them!

“A PICTURE SPEAKS A 1000 WORDS” – By Des Kelly

Look past your thoughts so you may drink the pure nectar of this moment

A Life Hack for when we’re Burnt Out & Broken Down – By Uma Panch

Narration of the History of our Proud Ancestral (Orang Jawa) Heritage. by Noor R. Rahim

eLanka Weddings

eLanka Marriage Proposals

Noel News

Noel News

Noel News

Noel News- By Noel Whittaker

EILEEN MARY SIBELLE DE SILVA (nee DISSANAYAKE) – 29 September 1922 – 6 April 2018 – A Woman of Value an Appreciation written by Mohini Gunasekera

K.K.S. Cement Factory

Dr.Harold Gunatillake’s 90th Birthday party

Sri Lanka's women's cricket squad in Melbourne

Cricket: Sri Lanka’s women’s squad in Melbourne

- Advertisement -
Ad image
Related News
Sri Lankan Christmas Cake (Rich Cake)
Articles Malsha Madhuhansi

Authentic Sri Lankan Christmas Cake (Rich Cake) Recipe – By Malsha – eLanka

Navy Mobile Kitchen
Articles

ළඟට ගිහින් කෑම උයන, උයල පිහල කෑම බෙදන Navy Mobile Kitchen | Rupavahini News

Articles

Celebrating 10 Years in Sri Lanka, The Body Shop unwraps a brand new boutique at OGF Mall, and an exclusive festive preview

Kandy Kings 1
Articles

Kandy Kings Set to Bring New Energy to Sri Lanka’s Golfing Stage

Sri Lanka Library
Articles Victor Melder

VICTOR MELDER SRI LANKA LIBRARY

  • Quick Links:
  • Articles
  • DESMOND KELLY
  • Dr Harold Gunatillake
  • English Videos
  • Sri Lanka
  • Sinhala Videos
  • eLanka Newsletters
  • Obituaries
  • Sunil Thenabadu
  • Dr. Harold Gunatillake
  • Tamil Videos
  • Sinhala Movies
  • Trevine Rodrigo
  • Tamil Movies
  • Michael Roberts

eLanka

Your Trusted Source for News & Community Stories: Stay connected with reliable updates, inspiring features, and breaking news. From politics and technology to culture, lifestyle, and events, eLanka brings you stories that matter — keeping you informed, engaged, and connected 24/7.
Kerrie road, Oatlands , NSW 2117 , Australia.
Email : info@eLanka.com.au / rasangivjes@gmail.com.
WhatsApp : +61402905275 / +94775882546

(c) 2005 – 2025 eLanka Pty Ltd. All Rights Reserved.